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FULL-LENGTH HUMAN CDNAS ENCODING POTENTIALLY SECRETED 



PROTEINS 

CROSS REFERENCE TO RELATED APPLICATIONS 

5 The present application is related to and claims priority from US provisional Application 

Nos. 60/197,873, filed April 18, 2000, 60/224,009, filed August 7, 2000, 60/260,328, filed January 
8, 2001, and 60/224,006, filed August 4, 2000, the entire disclosures of each of which is herein 
incorporated by reference. 

10 FIELD OF THE INVENTION 

The present invention is directed to GENSET polypeptides, fragments thereof, and the 
regulatory regions located in the 5'- and 3 '-ends of the genes encoding the polypeptides. The 
invention also concerns polypeptides encoded by GENSET genes and fragments thereof. The 
present invention also relates to recombinant vectors including the polynucleotides of the present 

1 5 invention, particularly recombinant vectors comprising a GENSET gene regulatory region or a 
sequence encoding a GENSET polypeptide, and to host cells containing the polynucleotides of the 
invention, as well as to methods of making such vectors and host cells. The present invention 
further relates to the use of these recombinant vectors and host cells in the production of the 
polypeptides of the invention. The invention further relates to antibodies that specifically bind to 

20 the polypeptides of the invention and to methods for producing such antibodies and fragments 

thereof. The invention also provides methods of detecting the presence of the polynucleotides and 
polypeptides of the present invention in a sample, methods of diagnosis and screening of abnormal 
GENSET polypeptide expression and/or biological activity, methods of screening compounds for 
their ability to modulate the activity or expression of the GENSET polypeptides, and uses of such 

25 compounds. 

BACKGROUND OF THE INVENTION 

The estimated 30,000-12,000 genes scattered along the human chromosomes offer 
tremendous promise for the understanding, diagnosis, and treatment of human diseases. In addition, 
probes capable of specifically hybridizing to loci distributed throughout the human genome find 
30 application in the construction of high resolution chromosome maps and in the identification of 
individuals. 

In the past, the characterization of even a single human gene was a painstaking process, 
requiring years of effort. Recent developments in the areas of cloning vectors, DNA sequencing, 
and computer technology have merged to greatly accelerate the rate at which human genes can be 
35 isolated, sequenced, mapped, and characterized. 
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Currently, two different approaches are being pursued for identifying and characterizing the 
genes distributed along the human genome. In one approach, large fragments of genomic DNA are 
isolated, cloned, and sequenced. Potential open reading frames in these genomic sequences are 
identified using bio-informatics software. However, this approach entails sequencing large 
5 stretches of human DNA which do not encode proteins in order to find the protein encoding 

sequences scattered throughout the genome. In addition to requiring extensive sequencing, the bio- 
informatics software may mischaracterize the genomic sequences obtained, i.e., labeling non-coding 
DNA as coding DNA and vice versa. 

An alternative approach takes a more direct route to identifying and characterizing human 

1 0 genes. In this approach, complementary DN As (cDNAs) are synthesized from isolated messenger 
RNAs (mRNAs) which encode human proteins. Using this approach, sequencing is only performed 
on DNA which is derived from protein coding fragments of the genome. Often, only short stretches 
of the cDNAs are sequenced to obtain sequences called expressed sequence tags (ESTs). The ESTs 
may then be used to isolate or purify cDNAs which include sequences adjacent to the EST 

1 5 sequences. The cDNAs may contain all of the sequence of the EST which was used to obtain them 
or only a fragment of the sequence of the EST which was used to obtain them. In addition, the 
cDNAs may contain the full coding sequence of the gene from which the EST was derived or, 
alternatively, the cDNAs may include fragments of the coding sequence of the gene from which the 
EST was derived. It will be appreciated that there may be several cDNAs which include the EST 

20 sequence as a result of alternate splicing or the activity of alternative promoters. 

In the past, these short EST sequences were often obtained from oligo-dT primed cDNA 
libraries. Accordingly, they mainly corresponded to the 3' untranslated region of the mRNA. In 
part, the prevalence of EST sequences derived from the 3' end of the mRNA is a result of the fact 
that typical techniques for obtaining cDNAs are not well suited for isolating cDNA sequences 

25 derived from the 5' ends of mRNAs (Adams et aL, Nature 377:3-174, 1996, Hillier et aL, Genome 
Res. 6:807-828, 1996). In addition, in those reported instances where longer cDNA sequences have 
been obtained, the reported sequences typically correspond to coding sequences and do not include 
the full 5' untranslated region (5'UTR) of the mRNA from which the cDNA is derived. Indeed, 
5'UTRs have been shown to affect either the stability or translation of mRNAs. Thus, regulation of 

30 gene expression may be achieved through the use of alternative 5'UTRs as shown, for instance, for 
the translation of the tissue inhibitor of metal loprotease mRNA in mitogenically activated cells 
(Waterhouse et aL, JBiolChem. 265:5585-9. 1990). Furthermore, modification of 5'UTR through 
mutation, insertion or translocation events may even be implied in pathogenesis. For instance, the 
Fragile X syndrome, the most common cause of inherited mental retardation, is partly due to an 

35 insertion of multiple CGG trinucleotides in the 5'UTR of the Fragile X mRNA resulting in the 
inhibition of protein synthesis via ribosome stalling (Feng et aL, Science 268:731-4, 1995). An 
aberrant mutation in regions of the 5'UTR known to inhibit translation of the proto-oncogene c-myc 
was shown to result in upregulation of c-myc protein levels in cells derived from patients with 
multiple myelomas (Willis et aL, Curr Top Microbiol Immunol 224:269-76, 1 997). In addition, the 
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use of oligo-dT primed cDNA libraries does not allow the isolation of complete 5'UTRs since such 
incomplete sequences obtained by this process may not include the first exon of the mRNA, 
particularly in situations where the first exon is short. Furthermore, they may not include some 
exons, often short ones, which are located upstream of splicing sites. Thus, there is a need to obtain 
5 sequences derived from the 5' ends of mRNAs. 

Moreover, despite the great amount of EST data that large-scale sequencing projects have 
yielded (Adams etal., Nature 377:174, 1996; Hillier et al y Genome Res. 6:807-828, 1996), 
information concerning the biological function of the mRNAs corresponding to such obtained 
cDNAs has revealed to be limited. Indeed, whereas the knowledge of the complete coding 

1 0 sequence is absolutely necessary to investigate the biological function of mRNAs, ESTs yield only 
partial coding sequences. So far, large-scale full-length cDNA cloning has been achieved only with 
limited success because of the poor efficiency of methods for constructing full-length cDNA 
libraries. Indeed, such methods require either a large amount of mRNA (Ederly etal., 1995), thus 
resulting in non representative full-length libraries when small amounts of tissue are available or 

1 5 require PCR amplification (Maruyama et al. , 1 994; CLONTECHniques, 1 996) to obtain a 

reasonable number of clones, thus yielding strongly biased cDNA libraries where rare and long 
cDNAs are lost. Thus, there is a need to obtain full-length cDNAs, i.e. cDNAs containing the full 
coding sequence of their corresponding mRNAs. 

While many sequences derived from human chromosomes have practical applications, 

20 approaches based on the identification and characterization of those chromosomal sequences which 
encode a protein product are particularly relevant to diagnostic and therapeutic uses. Of the 30,000- 
120,000 protein coding genes, those genes encoding proteins which are secreted from the cell in 
which they are synthesized, as well as the secreted proteins themselves, are particularly valuable as 
potential therapeutic agents. Such proteins are often involved in cell to cell communication and 

25 may be responsible for producing a clinically relevant response in their target cells. In fact, several 
secretory proteins, including tissue plasminogen activator, G-CSF, GM-CSF, erythropoietin, human 
growth hormone, insulin, interferon-ot, interferon-P, interferon-y, and interleukin-2, are currently in 
clinical use. These proteins are used to treat a wide range of conditions, including acute myocardial 
infarction, acute ischemic stroke, anemia, diabetes, growth hormone deficiency, hepatitis, kidney 

30 carcinoma, chemotherapy induced neutropenia and multiple sclerosis. For these reasons, cDNAs 
encoding secreted proteins or fragments thereof represent a particularly valuable source of 
therapeutic agents. Thus, there is a need for the identification and characterization of secreted 
proteins and the nucleic acids encoding them. 

In addition to being therapeutically useful themselves, secretory proteins include short 

35 peptides, called signal peptides, at their amino termini which direct their secretion. These signal 
peptides are encoded by the signal sequences located at the 5' ends of the coding sequences of genes 
encoding secreted proteins. Because these signal peptides will direct the extracellular secretion of 
any protein to which they are operably linked, the signal sequences may be exploited to direct the 
efficient secretion of any protein by operably linking the signal sequences to a gene encoding the 
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protein for which secretion is desired. In addition, fragments of the signal peptides called 
membrane-translocating sequences may also be used to direct the intracellular import of a peptide 
or protein of interest. This may prove beneficial in gene therapy strategies in which it is desired to 
deliver a particular gene product to cells other than the cells in which it is produced. Signal 
5 sequences encoding signal peptides also find application in simplifying protein purification 

techniques. In such applications, the extracellular secretion of the desired protein greatly facilitates 
purification by reducing the number of undesired proteins from which the desired protein must be 
selected. Thus, there exists a need to identify and characterize the 5* fragments of the genes for 
secretory proteins which encode signal peptides. 

10 Sequences coding for secreted proteins may also find application as therapeutics or 

diagnostics. In particular, such sequences may be used to determine whether an individual is likely 
to express a detectable phenotype, such as a disease, as a consequence of a mutation in the coding 
sequence for a secreted protein. In instances where the individual is at risk of suffering from a 
disease or other undesirable phenotype as a result of a mutation in such a coding sequence, the 

15 undesirable phenotype may be corrected by introducing a normal coding sequence using gene 
therapy. Alternatively, if the undesirable phenotype results from overexpression of the protein 
encoded by the coding sequence, expression of the protein may be reduced using antisense or triple 
helix based strategies. 

The secreted human polypeptides encoded by the coding sequences may also be used as 

20 therapeutics by administering them directly to an individual having a condition, such as a disease, 
resulting from a mutation in the sequence encoding the polypeptide. In such an instance, the 
condition can be cured or ameliorated by administering the polypeptide to the individual. 

In addition, the secreted human polypeptides or fragments thereof may be used to generate 
antibodies useful in determining the tissue type or species of origin of a biological sample. The 

25 antibodies may also be used to determine the cellular localization of the secreted human 

polypeptides or the cellular localization of polypeptides which have been fused to the human 
polypeptides. In addition, the antibodies may also be used in immunoaffinity chromatography 
techniques to isolate, purify, or enrich the human polypeptide or a target polypeptide which has 
been fused to the human polypeptide. 

30 Public information on the number of human genes for which the promoters and upstream 

regulatory regions have been identified and characterized is quite limited. In part, this may be due 
to the difficulty of isolating such regulatory sequences. Upstream regulatory sequences such as 
transcription factor binding sites are typically too short to be utilized as probes for isolating 
promoters from human genomic libraries. Recently, some approaches have been developed to 

35 isolate human promoters. One of them consists of making a CpG island library (Cross et al, Nature 
Genetics 6: 236-244, 1994). The second consists of isolating human genomic DNA sequences 
containing Spel binding sites by the use of Spel binding protein (Mortlock et a!., Genome Res. 
6:327-335, 1996). Both of these approaches have their limits due to a lack of specificity and of 



4 



WO 02/083898 PCT/IB01/00914 

comprehensiveness. Thus, there exists a need to identify and systematically characterize the 5* 
fragments of the genes. 

cDNAs including the 5' ends of their corresponding mRNA may be used to efficiently 
identify and isolate 5'UTRs and upstream regulatory regions which control the location, 
5 developmental stage, rate, and quantity of protein synthesis, as well as the stability of the mRNA 
(Theil et al. 9 BioFactors 4:87-93, (1993). Once identified and characterized, these regulatory 
regions may be utilized in gene therapy or protein purification schemes to obtain the desired amount 
and locations of protein synthesis or to inhibit, reduce, or prevent the synthesis of undesirable gene 
products. 

10 In addition, cDNAs containing the 5' ends of secretory protein genes may include 

sequences useful as probes for chromosome mapping and the identification of individuals. Thus, 
there is a need to identify and characterize the sequences upstream of the 5' coding sequences of 
genes encoding secretory proteins. 

SUMMARY OF THE INVENTION 

1 5 The present invention provides a purified or isolated polynucleotide comprising, consisting 

of, or consisting essentially of a nucleotide sequence selected from the group consisting of: (a) the 
sequences of SEQ ID NOs:l-169, 339-455, 561-784; (b) the sequences of clone inserts of the 
deposited clone pool; (c) the coding sequences of SEQ ID NOs: 1-169, 339-455, 561-784; (d) the 
coding sequences of the clone inserts of the deposited clone pool; (e) the sequences encoding one of 

20 the polypeptides of SEQ ID NOs: 170-33 8, 456-560, 785-918; (f) the sequences encoding one of the 
polypeptides encoded by the clone inserts of the deposited clone pool; (g) the genomic sequences 
coding for the GENSET polypeptides; (h) the 5 1 transcriptional regulatory regions of GENSET 
genes; (i) the 3' transcriptional regulatory regions of GENSET genes; (j) the polynucleotides 
comprising the nucleotide sequence of any combination of (g)-(i); (k) the variant polynucleotides of 

25 any of the polynucleotides of (a)-(j); (1) the polynucleotides comprising a nucleotide sequence of 
(a)-(k), wherein the polynucleotide is single stranded, double stranded, or a portion is single 
stranded and a portion is double stranded; (m) the polynucleotides comprising a nucleotide 
sequence complementary to any of the single stranded polynucleotides of (1). The invention further 
provides for fragments of the nucleic acid molecules of (a)-(m) described above. 

30 Further embodiments of the invention include purified or isolated polynucleotides that 

comprise, consist of, or consist essentially of a nucleotide sequence at least 70% identical, more 
preferably at least 75%, and even more preferably at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
or 99% identical, to any of the nucleotide sequences in (a)-(m) above, e.g. over a region of at least 
about 25, 50, 100, 150, 250, 500, 1000, or more contiguous nucleotides, or a polynucleotide which 

35 hybridizes under stringent hybridization conditions to a polynucleotide in (a)-(m) above. 

The present invention also relates to recombinant vectors, which include the purified or 
isolated polynucleotides of the present invention, and to host cells recombinant for the 
polynucleotides of the present invention, as well as to methods of making such vectors and host 
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cells. The present invention further relates to the use of these recombinant vectors and recombinant 
host cells in the production of GENSET polypeptides. 

The invention further provides a purified or isolated polypeptide comprising, consisting of, 
or consisting essentially of an amino acid sequence selected from the group consisting of: (a) the 
5 polypeptides of SEQ ID NOs: 170-338, 456-560, 785-918; (b) the polypeptides encoded by the 
clone inserts of the deposited clone pool; (c) the epitope-bearing fragments of the polypeptides of 
SEQ ID NOs: 170-338, 456-560, 785-91 8; (d) the epitope-bearing fragments of the polypeptides 
encoded by the clone inserts contained in the deposited clone pool; (e) the domains of the 
polypeptides of SEQ ID NOs:170-338, 456-560, 785-918; (f) the domains of the polypeptides 

10 encoded by the clone inserts contained in the deposited clone pool; and (g) the allelic variant 

polypeptides of any of the polypeptides of (a)-(f). The invention further provides for fragments of 
the polypeptides of (a)-(g) above, such as those having biological activity or comprising 
biologically functional domain(s). 

The present invention further includes polypeptides with an amino acid sequence with at 

15 least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 
99% similarity to those polypeptides described in (a)-(g), as well as polypeptides having an amino 
acid sequence at least 70% identical, more preferably at least 75% identical, and still more 
preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those polypeptides described 
in (a)-(g), e.g. over a region of at least about 25, 50, 100, 150, 250, 500, 1000, or more amino acids. 

20 The invention further relates to methods of making the polypeptides of the present invention. 

The present invention further relates to transgenic plants or animals, wherein said 
transgenic plant or animal is transgenic for a polynucleotide of the present invention and expresses a 
polypeptide of the present invention. 

The invention further relates to antibodies that specifically bind to GENSET polypeptides 

25 of the present invention and fragments thereof as well as to methods for producing such antibodies 
and fragments thereof. 

The invention also provides kits, uses and methods for detecting GENSET gene expression 
and/or biological activity in a biological sample. One such method involves assaying for the 
expression of a GENSET polynucleotide in a biological sample using the polymerase chain reaction 

30 (PCR) to amplify and detect GENSET polynucleotides or Southern and Northern blot hybridization 
to detect GENSET genomic DNA, cDNA or mRNA. Alternatively, a method of detecting 
GENSET gene expression in a test sample can be accomplished using a compound which binds to a 
GENSET polypeptide of the present invention or a portion of a GENSET polypeptide. 
The present invention also relates to diagnostic methods and uses of GENSET 

35 polynucleotides and polypeptides for identifying individuals or non-human animals having elevated 
or reduced levels of GENSET gene products, which individuals are likely to benefit from therapies 
to suppress or enhance GENSET gene expression, respectively, and to methods of identifying 
individuals or non-human animals at increased risk for developing, or at present having, certain 
diseases/disorders associated with GENSET polypeptide expression or biological activity. 
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The present invention also relates to kits, uses and methods of screening compounds for 
their ability to modulate (e.g. increase or inhibit) the activity or expression of GENSET 
polypeptides including compounds that interact with GENSET gene regulatory sequences and 
compounds that interact directly or indirectly with a GENSET polypeptide. Uses of such 
5 compounds are also within the scope of the present invention. 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, an active agent, the polypeptides, polynucleotides or antibodies of the 
present invention, as well as, typically, a pharmaceutically acceptable carrier. 

The present invention also relates to computer systems containing cDN A codes and 

10 polypeptide codes of sequences of the invention and to computer-related methods of comparing 
sequences, identifying homology or features using GENSET polypeptides or GENSET 
polynucleotide sequences of the invention. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence encoding: i) a polypeptide comprising an amino 

1 5 acid sequence having at least about 80% identity to any one of the sequences shown as SEQ ID 
NOs: 170-338, 456-560, 785-918 or any one of the sequences of polypeptides encoded by the clone 
inserts of the deposited clone pool; or a biologically active fragment of the polypeptide. 

In one embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID 
NOs:170-338, 456-560, 785-918 or any one of the sequences of the polypeptides encoded by the 

20 clone inserts of the deposited clone pool. In another embodiment, the polypeptide comprises a 
signal peptide. In another embodiment, the polypeptide is a mature protein. In another 
embodiment, the nucleic acid sequence has at least about 80% identity over at least about 100 
contiguous nucleotides to any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 

25 embodiment, the polynucleotide hybridizes under stringent conditions to a polynucleotide 

comprising any one of the sequences shown as SEQ ID NOs: 1 -169, 339-455, 561-784 or any one of 
the sequences of the clone inserts of the deposited clone pool. In another embodiment, the nucleic 
acid sequence comprises any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561-784 
or any one the sequences of the clone inserts of the deposited clone pool. In another embodiment, 

30 the polynucleotide is operably linked to a promoter. 

In another aspect, the present invention provides an expression vector comprising any of the 
herein-described polynucleotides, operably linked to a promoter. In another aspect, the present 
invention provides a host cell recombinant for any of the herein-described polynucleotides. In 
another aspect, the present invention provides a non-human transgenic animal comprising the host 

35 cell. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of host cells comprising a herein- 
described polynucleotide and b) culturing the population of host cells under conditions conducive to 
the production of the polypeptide within said host cells. 
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In one embodiment, the method further comprises purifying the polypeptide from the 
population of host cells. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of cells comprising a polynucleotide 
5 encoding a herein-described polypeptide; b) culturing the population of cells under conditions 
conducive to the production of the polypeptide within the cells; and c) purifying the polypeptide 
from the population of cells. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence having at least about 80% identity over at least 
1 0 about 1 00 contiguous nucleotides to any one of the sequences shown as SEQ ID NOs: 1 -1 69, 339- 
455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In one embodiment, the polynucleotide hybridizes under stringent conditions to a 
polynucleotide comprising any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 
1 5 embodiment, the polynucleotide comprises any one of the sequences shown as SEQ ID NOs: 1-1 69, 
339-455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In another aspect, the present invention provides a biologically active polypeptide encoded 
by any of the herein-described polynucleotides. 

In another aspect, the present invention provides an isolated polypeptide or biologically 
20 active fragment thereof, the polypeptide comprising an amino acid sequence having at least about 
80% sequence identity to any one of the sequences shown as SEQ ID NOs: 1 70-338, 456-560, 785- 
91 8 or any one of the sequences of polypeptides encoded by the clone inserts of the deposited clone 
pool. 

In one embodiment, the polypeptide is selectively recognized by an antibody raised against 
25 an antigenic polypeptide, or an antigenic fragment thereof, the antigenic polypeptide comprising 
any one of the sequences shown as SEQ ID NOs: 170-338, 456-560, 785-918 or any one of the 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool. In another 
embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID NOs: 170-338, 
456-560, 785-91 8 or any one of the sequences of polypeptides encoded by the clone inserts of the 
30 deposited clone pool. In another embodiment, the polypeptide comprises a signal peptide. In 
another embodiment, the polypeptide is a mature protein. 

In another aspect, the present invention provides an antibody that specifically binds to any 
of ther herein-described polypeptides. 

In another aspect, the present invention provides a method of determining whether a 
35 GENSET gene is expressed within a mammal, the method comprising the steps of: a) providing a 
biological sample from said mammal; b) contacting said biological sample with either of: i) a 
polynucleotide that hybridizes under stringent conditions to any of the herein-described 
polynucleotides; or ii) a polypeptide that specifically binds to any of the herein-described 
polypeptides; and c) detecting the presence or absence of hybridization between the polynucleotide 
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and an RNA species within the sample, or the presence or absence of binding of the polypeptide to a 
protein within the sample; wherein a detection of the hybridization or of the binding indicates that 
the GENSET gene is expressed within the mammal. 

In one embodiment, the polynucleotide is a primer, and the hybridization is detected by 
5 detecting the presence of an amplification product comprising the sequence of the primer. In 
another embodiment, the polypeptide is an antibody. 

In another aspect, the present invention provides a method of determining whether a 
mammal has an elevated or reduced level of GENSET gene expression, the method comprising the 
steps of: a) providing a biological sample from the mammal; and b) comparing the amount of any of 

10 the herein-described polypeptides, or of an RNA species encoding the polypeptide, within the 

biological sample with a level detected in or expected from a control sample; wherein an increased 
amount of the polypeptide or the RNA species within the biological sample compared to the level 
detected in or expected from the control sample indicates that the mammal has an elevated level of 
the GENSET gene expression, and wherein a decreased amount of the polypeptide or the RNA 

15 species within the biological sample compared to the level detected in or expected from the control 
sample indicates that the mammal has a reduced level of the GENSET gene expression. 

In another aspect, the present invention provides a method of identifying a candidate 
modulator of a GENSET polypeptide, the method comprising: a) contacting any of the herein- 
described polypeptides with a test compound; and b) determining whether the compound 

20 specifically binds to the polypeptide; wherein a detection that the compound specifically binds to 
the polypeptide indicates that the compound is a candidate modulator of the GENSET polypeptide. 

In one embodiment, the method further comprises testing the biological activity of the 
GENSET polypeptide in the presence of the candidate modulator, wherein an alteration in the 
biological activity of the GENSET polypeptide in the presence of the compound in comparison to 

25 the activity in the absence of said compound indicates that the compound is a modulator of the 
GENSET polypeptide. 

In another aspect, the present invention provides a method for the production of a 
pharmaceutical composition, the method comprising a) identifying a modulator of a GENSET 
polypeptide using any of the herein-described methods; and b) combining the modulator with a 

30 pharmaceutical ly acceptable carrier. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 is a block diagram of an exemplary computer system. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
35 new nucleotide or protein sequence with a database of sequences in order to determine the identity 
levels between the new sequence and the sequences in the database. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. 
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Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. 



BRIEF DESCRIPTION OF THE TABLES 

5 Table I provides the SEQ ID Nos in the present application (with the SEQ ID Nos 

corresponding to nucleic acid sequences preceded by "NUC", and the SEQ ID Nos corresponding to 
the encoded polypeptide sequences preceded by "PRT") that correspond to a SEQ ID NO in priority 
application number 60/197,873. Applicants' internal designation number (Clone ID) corresponding 
to each sequence identification (SEQ ID) number is also provided. 

1 0 Table II lists the putative chromosomal location of the polynucleotides of the present 

invention. The SEQ ID NO listed for each polynucleotide is that from the priority application 
60/197,873; the corresponding SEQ ID NOs for the sequence in the present application can be 
determined by referring to Table I. 

Table III lists the number of hits in Genset's cDNA libraries of tissues and cell types for 

1 5 polynucleotides of the invention. The following abbreviations are used to refer to each cell or tissue 
type: A=Brain; B=Fetal brain; C=Fetal kidney; D=Fetal liver; E=Pituitary gland; F=Liver; 
G=Placenta; H=Prostate; I=Salivary gland; J=Stomach/Intestine; and K=Testis. The SEQ ID NO 
listed for each polynucleotide is that from the priority application 60/197,873; the corresponding 
SEQ ID NOs for the sequence in the present application can be determined by referring to Table I. 

20 Table IV lists the number of hits in publicly available library of tissues and cell types for 

polynucleotides of the invention. The SEQ ID NO listed for each polynucleotide is that from the 
priority application 60/197,873; the corresponding SEQ ID NOs for each sequence in the present 
application can be determined by referring to Table I. 

Table V lists the tissues and cell types in which the polynucleotide sequences of the present 

25 invention are over- or under-represented. The SEQ ID NO listed for each polynucleotide is that 
from the priority application 60/197,873; the corresponding SEQ ID NOs for each sequence in the 
present application can be determined by referring to Table I. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

30 SEQ ID NOs:l-169, 339-455, 561-784 are the nucleotide sequences of cDNAs, with open 

reading frames as indicated as features (CDS). When appropriate, the locations of the potential 
polyadenylation site and polyadenylation signal are also indicated. 

SEQ ID NOs: 170-338, 456-560, 785-918 are the amino acid sequences of proteins encoded 
by the cDNAs of SEQ ID NOs:l-169, 339-455, 561-784. 

35 SEQ ID NOs: 1-85, 339-400, 406-407, 413-415, 561-594, and 634-651 are the nucleotide 

sequences of cDNAs encoding a potentially secreted protein. The locations of the ORFs and 
sequences encoding signal peptides are listed in the accompanying Sequence Listing. In addition, 
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the von Heijne score of the signal peptide computed as described below is listed as the "score" in 
the accompanying Sequence Listing. The sequence of the signal-peptide is listed as "seq" in the 
accompanying Sequence Listing. The "/" in the signal peptide sequence indicates the location 
where proteolytic cleavage of the signal peptide occurs to generate a mature protein. When 
5 appropriate, the locations of the first and last nucleotides of the coding sequences, eventually the 
locations of the first and last nucleotides of the polyA and the locations of the first and last 
nucleotides of the polyA sites are indicated. 

SEQ ID NOs:86-169, 401-405, 408-412, 416-455, 595-633, 652-784 are the nucleotide 
sequences of cDNAs in which no sequence encoding a signal peptide has been identified to date. 

10 However, it remains possible that subsequent analysis will identify a sequence encoding a signal 
peptide in these nucleic acids. The locations of the ORFs are listed in the accompanying Sequence 
Listing. When appropriate, the locations of the first and last nucleotides of the coding sequences, 
eventually the locations of the first and last nucleotides of the polyA and the locations of the first 
and last nucleotides of the polyA sites are indicated. 

15 SEQ ID NOs: 170-254, 456-517, 520-521, 527-529, 785-818, and 858-875 are the amino 

acid sequences of polypeptides which contain a signal peptide. These polypeptides are encoded by 
the cDNAs of SEQ ID NOs: 1-85, 339-400, 406-407, 413-415, 563-594, and 634-651. The location 
of the signal peptide is listed in the accompanying Sequence Listing. 

SEQ IDNOs:255-338, 517-519, 522-526, 530-560, 819-857, 876-918 are the amino acid 

20 sequences of polypeptides in which no signal peptide has been identified to date. However, it 

remains possible that subsequent analysis will identify a signal peptide in these polypeptides. These 
polypeptides are encoded by the nucleic acids of SEQ ID NOs: 86-169, 401-405, 408-412, 416-455, 
595-633, 652-784. 

In accordance with the regulations relating to Sequence Listings, the following codes have 
25 been used in the Sequence Listing to describes nucleotide sequences. The code "r" in the sequences 
indicates that the nucleotide may be a guanine or an adenine. The code "y" in the sequences 
indicates that the nucleotide may be a thymine or a cytosine. The code "m" in the sequences 
indicates that the nucleotide may be an adenine or a cytosine. The code "k" in the sequences 
indicates that the nucleotide may be a guanine or a thymine. The code "s" in the sequences 
30 indicates that the nucleotide may be a guanine or a cytosine. The code "w" in the sequences 
indicates that the nucleotide may be an adenine or an thymine. In addition, all instances of the 
symbol "n" in the nucleic acid sequences mean that the nucleotide can be adenine, guanine, cytosine 
or thymine. 

In some instances, the polypeptide sequences in the Sequence Listing contain the symbol 
35 "Xaa." These "Xaa" symbols indicate either (1) a residue which cannot be identified because of 
nucleotide sequence ambiguity or (2) a stop codon in the determined sequence where applicants 
believe one should not exist (if the sequence were determined more accurately). In some instances, 
several possible identities of the unknown amino acids may be suggested by the genetic code. 
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In the case of secreted proteins, it should be noted that, in accordance with the regulations 
governing Sequence Listings, in the appended Sequence Listing the encoded protein (i.e. the protein 
containing the signal peptide and the mature protein or part thereof) extends from an amino acid 
residue having a negative number through a positively numbered amino acid residue. Thus, the first 
5 amino acid of the mature protein resulting from cleavage of the signal peptide is designated as 
amino acid number 1, and the first amino acid of the signal peptide is designated with the 
appropriate negative number. 

DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED 
10 EMBODIMENTS 

Definitions 

Before describing the invention in greater detail, the following definitions are set forth to 
illustrate and define the meaning and scope of the terms used to describe the invention herein. 

The term "GENSET gene," when used herein, encompasses genomic, mRNA and cDNA 
15 sequences encoding a GENSET polypeptide, including the 5' and 3' untranslated regions of said 
sequences. 

The term "GENSET polypeptide biological activity" or "GENSET biological activity" is 
intended for polypeptides exhibiting any activity similar, but not necessarily identical, to an activity 
of a GENSET polypeptide of the invention. The GENSET polypeptide biological activity of a 
20 given polypeptide may be assessed using any suitable biological assay, a number of which are 
known to those skilled in the art. In contrast, the term "biological activity" refers to any activity 
that any polypeptide may have. 

The term "corresponding mRNA" refers to mRNA which was or can be a template for 
cDNA synthesis for producing a cDNA of the present invention. 
25 The term "corresponding genomic DNA" refers to genomic DNA which encodes an mRNA 

of interest, e.g. corresponding to a cDNA of the invention, which genomic DNA includes the 
sequence of one of the strands of the mRNA, in which thymidine residues in the sequence of the 
genomic DNA (or cDNA) are replaced by uracil residues in the mRNA. 

The term "deposited clone pool" is used herein to refer to the pool of clones entitled cDNA- 
30 8-2000, deposited with the ATCC on September 27, 2000, or the pool of clones entitled cDNA-1 1- 
2000, deposited with the ATCC on November 27, 2000, or any other deposited clone pool 
containing a clone corresponding to any of the herein-described sequences. 

The term "heterologous", when used herein, is intended to designate any polynucleotide or 
polypeptide other than a GENSET polynucleotide or GENSET polypeptide of the invention, 
35 respectively. 

"Providing" with respect to, e.g. a biological sample, population of cells, etc. indicates that 
the sample, population of cells, etc. is somehow used in a method or procedure. Significantly, 
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"providing" a biological sample or population of cells does not require that the sample or cells are 
specifically isolated or obtained for the purposes of the invention, but can instead refer, for 
example, to the use of a biological sample obtained by another individual, for another purpose. 

An "amplification product" refers to a product of any amplification reaction, e.g. PCR, RT- 
5 PCR,LCR, etc. 

A "modulator" of a protein or other compound refers to any agent that has a functional 
effect on the protein, including physical binding to the protein, alterations of the quantity or quality 
of expression of the protein, altering any measurable or detectable activity, property, or behavior of 
the protein, or in any way interacts with the protein or compound. 

1 0 "A test compound" can be any molecule that is evaluated for its ability to modulate a 

protein or other compound. 

An antibody or other compound that specifically binds to a polypeptide or polynucleotide of 
the invention is also said to "selectively recognize" the polypeptide or polynucleotide. 

The term "isolated" with respect to a molecule requires that the molecule be removed from 

1 5 its original environment (e. g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the 
same polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials 
in the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 
polynucleotide or polypeptide could be part of a composition, and still be isolated in that the vector 

20 or composition is not part of its natural environment. For example, a naturally-occurring 

polynucleotide present in a living animal is not isolated, but the same polynucleotide, separated 
from some or all of the coexisting materials in the natural system, is isolated. Specifically excluded 
from the definition of "isolated" are: naturally-occurring chromosomes (such as chromosome 
spreads), artificial chromosome libraries, genomic libraries, and cDNA libraries that exist either as 

25 an in vitro nucleic acid preparation or as a transfected/transformed host cell preparation, wherein the 
host cells are either an in vitro heterogeneous preparation or plated as a heterogeneous population of 
single colonies. Also specifically excluded are the above libraries wherein a specified 
polynucleotide makes up less than 5% of the number of nucleic acid inserts in the vector molecules. 
Further specifically excluded are whole cell genomic DNA or whole cell RNA preparations 

30 (including said whole cell preparations which are mechanically sheared or enzymatically digested). 
Further specifically excluded are the above whole cell preparations as either an in vitro preparation 
or as a heterogeneous mixture separated by electrophoresis (including blot transfers of the same) 
wherein the polynucleotide of the invention has not further been separated from the heterologous 
polynucleotides in the electrophoresis medium (e.g., further separating by excising a single band 

35 from a heterogeneous band population in an agarose gel or nylon blot). 

The term "purified" does not require absolute purity; rather, it is intended as a relative 
definition. Purification of starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of magnitude is expressly 
contemplated. As an example, purification from 0.1% concentration to 10% concentration is two 
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orders of magnitude. To illustrate, individual cDNA clones isolated from a cDNA library have been 
conventionally purified to electrophoretic homogeneity. The sequences obtained from these clones 
could not be obtained directly either from the library or from total human DNA. The cDNA clones 
are not naturally occurring as such, but rather are obtained via manipulation of a partially purified 
5 naturally occurring substance (messenger RNA). The conversion of mRNA into a cDNA library 
involves the creation of a synthetic substance (cDNA) and pure individual cDNA clones can be 
isolated from the synthetic library by clonal selection. Thus, creating a cDNA library from 
messenger RNA and subsequently isolating individual clones from that library results in an 
approximately 10 4 -10 6 fold purification of the native message. 

10 The term "purified" is further used herein to describe a polypeptide or polynucleotide of the 

invention which has been separated from other compounds including, but not limited to, 
polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomeric polypeptides of the invention from oligomeric forms such as 
homo- or hetero- dimers, trimers, etc. The term "purified" may also be used to specify the 

15 separation of covalently closed (i.e. circular) polynucleotides from linear polynucleotides. A 
polynucleotide is substantially pure when at least about 50%, preferably 60 to 75% of a sample 
exhibits a single polynucleotide sequence and conformation (linear versus covalently close). A 
substantially pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 
90% weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 

20 95%, and preferably is over about 99% pure. Polypeptide and polynucleotide purity, or 
homogeneity, is indicated by a number of means well known in the art, such as agarose or 
polyacrylamide gel electrophoresis of a sample, followed by visualizing a single band upon staining 
the gel. For certain purposes higher resolution can be provided by using HPLC or other means well 
known in the art. As an alternative embodiment, purification of the polypeptides and polynucleotides 

25 of the present invention may be expressed as "at least" a percent purity relative to heterologous 

polypeptides and polynucleotides (DNA, RNA or both). As a preferred embodiment, the polypeptides 
and polynucleotides of the present invention are at least; 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
80%, 90%, 95%, 96%, 96%, 98%, 99%, or 100% pure relative to heterologous polypeptides and 
polynucleotides, respectively. As a further preferred embodiment the polypeptides and 

30 polynucleotides have a purity ranging from any number, to the thousandth position, between 90% and 
100% (e.g., a polypeptide or polynucleotide at least 99.995% pure) relative to either heterologous 
polypeptides or polynucleotides, respectively, or as a weight/weight ratio relative to all compounds 
and molecules other than those existing in the carrier. Each number representing a percent purity, 
to the thousandth position, may be claimed as individual species of purity. 

35 As used interchangeably herein, the terms " nucleic acid molecule(s) ", " oligonucleotide^) ", 

and " polvnucleotide(s) " include RNA or DNA (either single or double stranded, coding, 
complementary or antisense), or RNA/DNA hybrid sequences of more than one nucleotide in either 
single chain or duplex form (although each of the above species may be particularly specified). The 
term " nucleotide " is used herein as an adjective to describe molecules comprising RNA, DNA, or 
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RNA/DNA hybrid sequences of any length in single-stranded or duplex form. More precisely, the 
expression "nucleotide sequence" encompasses the nucleic material itself and is thus not restricted 
to the sequence information (i.e. the succession of letters chosen among the four base letters) that 
biochemically characterizes a specific DNA or RNA molecule. The term "nucleotide" is also used 
5 herein as a noun to refer to individual nucleotides or varieties of nucleotides, meaning a molecule, 
or individual unit in a larger nucleic acid molecule, comprising a purine or pyrimidine, a ribose or 
deoxyribose sugar moiety, and a phosphate group, or phosphodiester linkage in the case of 
nucleotides within an oligonucleotide or polynucleotide. The term "nucleotide" is also used herein 
to encompass "modified nucleotides" which comprise at least one modification such as (a) an 

10 alternative linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or 
(d) an analogous sugar. For examples of analogous linking groups, purine, pyrimidines, and sugars, 
see, for example, PCT publication No. WO 95/04064, which disclosure is hereby incorporated by 
reference in its entirety. Preferred modifications of the present invention include, but are not 
limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 

15 acetylcytosine, 5 -(carboxy hydroxy lmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, 1-methylguanine, 1 -methyl inosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'- 

20 methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyIadenine, uracil-5- 
oxyacetic acid (v) ybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2- 
thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, and 2,6-diaminopurine. The 
polynucleotide sequences of the invention may be prepared by any known method, including 

25 synthetic, recombinant, ex vivo generation, or a combination thereof, as well as utilizing any 

purification methods known in the art. Methylenemethylimino linked oligonucleosides as well as 
mixed backbone compounds having, may be prepared as described in U.S. Pat. Nos. 5,378,825; 
5,386,023; 5,489,677; 5,602,240; and 5,610,289, which disclosures are hereby incorporated by 
reference in their entireties. Formacetal and thioformacetal linked oligonucleosides may be prepared 

30 as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, which disclosures are hereby incorporated 
by reference in their entireties. Ethylene oxide linked oligonucleosides may be prepared as 
described in U.S. Pat. No. 5,223,618, which disclosure is hereby incorporated by reference in its 
entirety. Phosphinate oligonucleotides may be prepared as described in U.S. Pat. No. 5,508,270, 
which disclosure is hereby incorporated by reference in its entirety. Alkyl phosphonate 

35 oligonucleotides may be prepared as described in U.S. Pat. No. 4,469,863, which disclosure is 
hereby incorporated by reference in its entirety. 3 , -Deoxy-3 , -methylene phosphonate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050 which 
disclosures are hereby incorporated by reference in their entireties. Phosphoramidite 
oligonucleotides may be prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 
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5,366,878 which disclosures are hereby incorporated by reference in their entireties. 
Alkylphosphonothioate oligonucleotides may be prepared as described in published PCT 
applications WO 94/17093 and WO 94/02499 which disclosures are hereby incorporated by 
reference in their entireties. 3'-Deoxy-3'-amino phosphoramidate oligonucleotides may be prepared 
5 as described in U.S. Pat. No. 5,476,925, which disclosure is hereby incorporated by reference in its 
entirety. Phosphotriester oligonucleotides may be prepared as described in U.S. Pat No. 5,023,243, 
which disclosure is hereby incorporated by reference in its entirety. Borano phosphate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198 which 
disclosures are hereby incorporated by reference in their entireties. 

1 0 The term "upstream" is used herein to refer to a location which is toward the 5' end of the 

polynucleotide from a specific reference point. 

The terms "base paired" and "Watson & Crick base paired" are used interchangeably herein 
to refer to nucleotides which can be hydrogen bonded to one another by virtue of their sequence 
identities in a manner like that found in double-helical DNA with thymine or uracil residues linked 

15 to adenine residues by two hydrogen bonds and cytosine and guanine residues linked by three 
hydrogen bonds (see Stryer, 1995, which disclosure is hereby incorporated by reference in its 
entirety). 

The terms "complementary" or "complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 

20 another specified polynucleotide throughout the entirety of the complementary region. For the 

purpose of the present invention, a first polynucleotide is deemed to be complementary to a second 
polynucleotide when each base in the first polynucleotide is paired with its complementary base. 
Complementary bases are, generally, A and T (or A and U), or C and G. "Complement" is used 
herein as a synonym from "complementary polynucleotide", "complementary nucleic acid" and 

25 "complementary nucleotide sequence". These terms are applied to pairs of polynucleotides based 
solely upon their sequences and not any particular set of conditions under which the two 
polynucleotides would actually bind. Unless otherwise stated, all complementary polynucleotides 
are fully complementary on the whole length of the considered polynucleotide. 

The terms "polypeptide" and "protein", used interchangeably herein, refer to a polymer of 

30 amino acids without regard to the length of the polymer; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This term also does not specify or exclude 
chemical or post-expression modifications of the polypeptides of the invention, although chemical 
or post-expression modifications of these polypeptides may be included excluded as specific 
embodiments. Therefore, for example, modifications to polypeptides that include the covalent 

35 attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and the like are 
expressly encompassed by the term polypeptide. Further, polypeptides with these modifications 
may be specified as individual species to be included or excluded from the present invention. The 
natural or other chemical modifications, such as those listed in examples above can occur anywhere 
in a polypeptide, including the peptide backbone, the amino acid side-chains and the amino or 
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carboxyl termini. It will be appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, a given polypeptide may 
contain many types of modifications. Polypeptides may be branched, for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Modifications include 
5 acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent 

attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, formation 
of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 

10 anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance Creighton (1993); Seifter et al, (1990); Rattan et ah, (1992)). Also included 
within the definition are polypeptides which contain one or more analogs of an amino acid 

15 (including, for example, non-naturally occurring amino acids, amino acids which only occur 

naturally in an unrelated biological system, modified amino acids from mammalian systems, etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

As used herein, the terms "recombinant polynucleotide" and "polynucleotide construct" are 

20 used interchangeably to refer to linear or circular, purified or isolated polynucleotides that have 
been artificially designed and which comprise at least two nucleotide sequences that are not found 
as contiguous nucleotide sequences in their initial natural environment. In particular, these terms 
mean that the polynucleotide or cDNA is adjacent to "backbone" nucleic acid to which it is not 
adjacent in its natural environment. Additionally, to be "enriched" the cDNAs will represent 5% or 

25 more of the number of nucleic acid inserts in a population of nucleic acid backbone molecules. 
Backbone molecules according to the present invention include nucleic acids such as expression 
vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and other vectors or nucleic 
acids used to maintain or manipulate a nucleic acid insert of interest. Preferably, the enriched 
cDNAs represent 15% or more of the number of nucleic acid inserts in the population of 

30 recombinant backbone molecules. More preferably, the enriched cDNAs represent 50% or more of 
the number of nucleic acid inserts in the population of recombinant backbone molecules. In a 
highly preferred embodiment, the enriched cDNAs represent 90% or more (including any number 
between 90 and 100%, to the thousandth position, e.g., 99.5%) of the number of nucleic acid inserts 
in the population of recombinant backbone molecules. 

35 The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 

artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 
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As used herein, the term "operably linked" refers to a linkage of polynucleotide elements in 
a functional relationship. A sequence which is "operably linked" to a regulatory sequence such as a 
promoter means that said regulatory element is in the correct location and orientation in relation to 
the nucleic acid to control RNA polymerase initiation and expression of the nucleic acid of interest. 
5 For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the coding sequence. 

As used herein, the term "non-human animal" refers to any non-human animal, including 
insects, birds, rodents and more usually mammals. Preferred non-human animals include: primates; 
farm animals such as swine, goats, sheep, donkeys, cattle, horses, chickens, rabbits; and rodents, 
10 preferably rats or mice. As used herein, the term "animal" is used to refer to any species in the 
animal kingdom, preferably vertebrates, including birds and fish, and more preferable a mammal. 
Both the terms "animal" and "mammal" expressly embrace human subjects unless preceded with 
the term "non-human". 

The term "domain" refers to an amino acid fragment with specific biological properties. 
15 This term encompasses all known structural and linear biological motifs. Examples of such motifs 
include but are not limited to leucine zippers, helix-turn-helix motifs, glycosylation sites, 
ubiquitination sites, alpha helices, and beta sheets, signal peptides which direct the secretion of 
proteins, sites for post-translational modification, enzymatic active sites, substrate binding sites, and 
enzymatic cleavage sites. 

20 Although each of these terms has a distinct meaning, the terms "comprising", "consisting 

of and "consisting essentially of may be interchanged for one another throughout the instant 
application. The term "having" has the same meaning as "comprising" and may be replaced with 
either the term "consisting of or "consisting essentially of. 

Unless otherwise specified in the application, nucleotides and amino acids of 

25 polynucleotides and polypeptides, respectively, of the present invention are contiguous and not 
interrupted by heterologous sequences. 

Identity Between Nucleic Acids Or Polypeptides 

The terms "percentage of sequence identity" and "percentage homology" are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are 

30 determined by comparing two optimally aligned sequences over a comparison window, wherein the 
portion of the polynucleotide or polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid base or amino acid residue 

35 occurs in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence identity. Homology is evaluated using any of the variety 
of sequence comparison algorithms and programs known in the art. Such algorithms and programs 
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include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, CLUSTALW, 
FASTDB (Pearson and Lipman, 1988; Altschul et aL, 1990; Thompson et aL, 1994; Higgins et 
aL, 1996; Altschul et aL, 1990; Altschul et aL, 1993; Brutlag et al, 1990), the disclosures of which 
are incorporated by reference in their entireties. 
5 In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 

evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
(see, e.g., Karlin and Altschul, 1990; Altschul et aL, 1990, 1993, 1997), the disclosures of which 
are incorporated by reference in their entireties. In particular, five specific BLAST programs are 
used to perform the following task: 
10 (1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
1 5 sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against 
the six-frame translations of a nucleotide sequence database. 

20 The BLAST programs identify homologous sequences by identifying similar segments, 

which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid 
sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence 
database. High-scoring segment pairs are preferably identified (i.e., aligned) by means of a scoring 
matrix, many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 

25 matrix (Gonnet et aL, 1992; Henikoff and Henikoff, 1993, the disclosures of which are 

incorporated by reference in their entireties). Less preferably, the PAM or PAM250 matrices may 
also be used (see, e.g., Schwartz and Dayhoff, eds., 1978, the disclosure of which is incorporated by 
reference in its entirety). The BLAST programs evaluate the statistical significance of all high- 
scoring segment pairs identified, and preferably selects those segments which satisfy a user- 

30 specified threshold of significance, such as a user-specified percent homology. Preferably, the 
statistical significance of a high-scoring segment pair is evaluated using the statistical significance 
formula of Karlin (see, e.g., Karlin and Altschul, 1990), the disclosure of which is incorporated by 
reference in its entirety. The BLAST programs may be used with the default parameters or with 
modified parameters provided by the user. 

35 Another preferred method for determining the best overall match between a query 

nucleotide sequence (a sequence of the present invention) and a subject sequence, also referred to as 
a global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et aL (1990), the disclosure of which is incorporated by reference in its 
entirety. In a sequence alignment the query and subject sequences are both DMA sequences. An 
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RNA sequence can be compared by first converting U's to T's. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA 
sequences to calculate percent identity are: Matrix = Unitary, k-tuple = 4, Mismatch Penalty = 1 , 
Joining Penalty = 30, Randomization Group Length = 0, Cutoff Score = 1 , Gap Penalty = 5, Gap 
5 Size Penalty = 0.05, Window Size = 500 or the length of the subject nucleotide sequence, 

whichever is shorter. If the subject sequence is shorter than the query sequence because of 5* or 3* 
deletions, not because of internal deletions, a manual correction must be made to the results. This is 
because the FASTDB program does not account for 5' and 3' truncations of the subject sequence 
when calculating percent identity. For subject sequences truncated at the 5' or 3'ends, relative to the 

10 query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5* and 3* of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using 10, the specified parameters, to arrive at a 

15 final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only nucleotides outside the 5' and 3 r nucleotides of the subject sequence, as displayed 
by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated 
for the purposes of manually adjusting the percent identity score. For example, a 90 nucleotide 
subject sequence is aligned to a 100 nucleotide query sequence to determine percent identity. The 

20 deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not 
show a matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not matched/total 
number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly matched the 

25 final percent identity would be 90%. In another example, a 90 nucleotide subject sequence is 
compared with a 1 00 nucleotide query sequence. This time the deletions are internal deletions so 
that there are no nucleotides on the 5' or 3 1 of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually corrected. 
Once again, only nucleotides 5' and 3* of the subject sequence which are not matched/aligned with 

30 the query sequence are manually corrected. No other manual corrections are made for the purposes 
of the present invention. 

Another preferred method for determining the best overall match between a query amino 
acid sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 

35 algorithm of Brutlag et al. (1990). In a sequence alignment the query and subject sequences are 
both amino acid sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix = PAM 0, k-tuple = 2, 
Mismatch Penalty = 1, Joining Penalty = 20, Randomization Group Length = 0, Cutoff Score = 1, 
Window Size = sequence length, Gap Penalty = 5, Gap Size Penalty = 0.05, Window Size = 500 or 
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the length of the subject amino acid sequence, whichever is shorter. If the subject sequence is 
shorter than the query sequence due to N-or C-terminal deletions, not because of internal deletions, 
the results, in percent identity, must be manually corrected. This is because the FASTDB program 
does not account for N- and C-terminal truncations of the subject sequence when calculating global 
5 percent identity. For subject sequences truncated at the N- and C-termini, relative to the query 
sequence, the percent identity is corrected by calculating the number of residues of the query 
sequence that are N- and C- terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. Whether a 
residue is matched/aligned is determined by results of the FASTDB sequence alignment. This 

10 percentage is then subtracted from the percent identity, calculated by the above FASTDB program 
using the specified parameters, to arrive at a final percent identity score. This final percent identity 
score is what is used for the purposes of the present invention. Only residues to the N- and C- 
termini of the subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, only query 

1 5 amino acid residues outside the farthest N- and C-terminal residues of the subject sequence. For 
example, a 90 amino acid residue subject sequence is aligned with a 100-residue query sequence to 
determine percent identity. The deletion occurs at the N-terminus of the subject sequence and 
therefore, the FASTDB alignment does not match/align with the first residues at the N-terminus. 
The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- 

20 termini not matched/total number of residues in the query sequence) so 10% is subtracted from the 
percent identity score calculated by the FASTDB program. If the remaining 90 residues were 
perfectly matched the final percent identity would be 90%. In another example, a 90-residue 
subject sequence is compared with a 100-residue query sequence. This time the deletions are 
internal so there are no residues at the N- or C-termini of the subject sequence, which are not 

25 matched/aligned with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the 
subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the 
query sequence are manually corrected. No other manual corrections are made for the purposes of 
the present invention. 

30 The term "percentage of sequence similarity" refers to comparisons between polypeptide 

sequences and is determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 

35 determining the number of positions at which an identical or equivalent amino acid residue occurs 
in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence similarity. Similarity is evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art, including those 
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Equivalent amino acid residues are defined herein in the "Mutated 



Polynucleotides of the invention 

5 The present invention concerns GENSET genomic and cDNA sequences. The present 

invention encompasses GENSET genes, polynucleotides comprising GENSET genomic and cDNA 
sequences, as well as fragments and variants thereof. These polynucleotides may be purified, 
isolated, or recombinant. 

Also encompassed by the present invention are allelic variants, orthologs, splice variants, 

10 and/or species homologues of the GENSET genes. Procedures known in the art can be used to 
obtain full-length genes and cDNAs, allelic variants, splice variants, full-length coding portions, 
orthologs, and/or species homologues of genes and cDNAs corresponding to a nucleotide sequence 
selected from the group consisting of sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, using information from the sequences 

1 5 disclosed herein or the clone pool deposited with the ATCC or other depositary authority. For 
example, allelic variants, orthologs and/or species homologues may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a suitable 
nucleic acid source for allelic variants and/or the desired homologue using any technique known to 
those skilled in the art including those described into the section entitled "To find similar 

20 sequences". 

In a specific embodiment, the polynucleotides of the invention are at least 15, 30, 50, 100, 
125, 500, or 1000 continuous nucleotides. In another embodiment, the polynucleotides are less than 
or equal to 300kb, 200kb, lOOkb, 50kb, lOkb, 7.5kb, 5kb, 2.5kb, 2kb, 1.5kb, or lkb in length. In a 
further embodiment, polynucleotides of the invention comprise a portion of the coding sequences, 
25 as disclosed herein, but do not comprise all or a portion of any intron. In another embodiment, the 
polynucleotides comprising coding sequences do not contain coding sequences of a genomic 
flanking gene (i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 500, 250, 
100, 75, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 naturally occurring genomic flanking gene(s). 

30 Deposited clone pool of the invention 

Expression of GENSET genes has been shown to lead to the production of at least one 
mRNA species per GENSET gene, which cDNA sequence is set forth in the appended sequence 
listing as SEQ ID NOs:l-169, 339-455, 561-784. The cDNAs (SEQ ID NOs:l-169, 339-455, 561- 
784) corresponding to these GENSET mRNA species were cloned either in the vector pBluescriptll 
35 SK" (Stratagene) or in a vector called pPT. Cells containing the cloned cDNAs of the present 
invention are maintained in permanent deposit by the inventors at Genset, S.A., 24 Rue Royale, 
75008 Paris, France. Each cDNA can be removed from the Bluescript vector in which it was 
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inserted by performing a NotI Pst I double digestion, or from the pPT vector by performing a Muni 
Hindi II double digestion, to produce the appropriate fragment for each clone, provided the cDNA 
sequence does not contain any of the corresponding restriction sites within its sequence. 
Alternatively, other restriction enzymes of the multicloning site of the vector may be used to 
5 recover the desired insert as indicated by the manufacturer. 

Pools of cells containing certain cDNAs of the invention, from which the cells containing a 
particular polynucleotide is obtainable, have also been deposited with the American Tissue Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, United States. These 
cDNA clones have been transfected into separate bacterial cells (E-coli) for these composite 
10 deposits. 

Bacterial cells containing a particular clone can be obtained from the composite deposit as 

follows: 

An oligonucleotide probe or probes should be designed to the sequence that is known for 
that particular clone. This sequence can be derived from the sequences provided herein, or from a 
15 combination of those sequences. The design of the oligonucleotide probe should preferably follow 
these parameters: 

(a) It should be designed to an area of the sequence which has the fewest ambiguous bases 
("N's"), if any; 

(b) Preferably, the probe is designed to have a Tm of approximately 80 degrees Celsius 
20 (assuming 2 degrees for each A or T and 4 degrees for each G or C). However, probes having 

melting temperatures between 40 degrees Celsius and 80 degrees Celsius may also be used provided 
that specificity is not lost. 

The oligonucleotide should preferably be labeled with gamma[ 32 P] ATP (specific activity 
6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for labeling 
25 oligonucleotides. Other labeling techniques can also be used. Unincorporated label should 

preferably be removed by gel filtration chromatography or other established methods. The amount 
of radioactivity incorporated into the probe should be quantified by measurement in a scintillation 
counter. Preferably, specific activity of the resulting probe should be approximately 4x1 0 6 
dpm/pmole. 

30 The bacterial culture containing the pool of full-length clones should preferably be thawed 

and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml of sterile L-broth 
containing ampicillin at 100 ug/ml. The culture should preferably be grown to saturation at 37 
degrees Celsius, and the saturated culture should preferably be diluted in fresh L-broth. Aliquots of 
these dilutions should preferably be plated to determine the dilution and volume which will yield 

35 approximately 5000 distinct and well-separated colonies on solid bacteriological media containing 
L-broth containing ampicillin at 100 ug/ml and agar at 1.5% in a 150 mm petri dish when grown 
overnight at 37 degrees Celsius. Other known methods of obtaining distinct, well-separated 
colonies can also be employed. 
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Standard colony hybridization procedures should then be used to transfer the colonies to 
nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65 degrees Celsius for 1 hour with gentle agitation 
in 6X SSC (20X stock is 1 75.3 g NaC 1/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with NaOH) 
5 containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 10 ml per 150 
mm filter). Preferably, the probe is then added to the hybridization mix at a concentration greater 
than or equal to lxl 0 6 dpm/ml. The filter is then preferably incubated at 65 degrees Celsius with 
gentle agitation overnight. The filter is then preferably washed in 500 ml of 2X SSC/0.1% SDS at 
room temperature with gentle shaking for 15 minutes. A third wash with 0.1X SSC/0.5% SDS at 65 
10 degrees Celsius for 30 minutes to 1 hour is optional. The filter is then preferably dried and 

subjected to autoradiography for sufficient time to visualize the positives on the X-ray film. Other 
known hybridization methods can also be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated using 
standard procedures. The clones can then be verified by restriction analysis, hybridization analysis, 
1 5 or DNA sequencing. The plasmid DNA obtained using these procedures may then be manipulated 
using standard cloning techniques familiar to those skilled in the art. 

Alternatively, to recover cDNA inserts from the pool of bacteria, a PCR can be performed 
on plasmid DNA isolated using standard procedures and primers designed at both ends of the cDNA 
insertion, including primers designed in the multicloning site of the vector. If a specific cDNA of 
20 interest is to be recovered, primers may be designed in order to be specific for the 5' end and the 3' 
end of this cDNA using sequence information available from the appended sequence listing. The 
PCR product which corresponds to the cDNA of interest can then be manipulated using standard 
cloning techniques familiar to those skilled in the art. 

Therefore, an object of the invention is an isolated, purified, or recombinant polynucleotide 
25 comprising a nucleotide sequence selected from the group consisting of cDNA inserts of the 
deposited clone pool. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant GENSET cDNAs consisting of, consisting essentially of, or comprising a 
nucleotide sequence selected from the group consisting of cDNA inserts of the deposited clone 
pool. 

30 cDNA sequences of the invention 

Another object of the invention is a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence selected from the group consisting of sequences of SEQ ID 
NOs:l-169, 339-455, 561-784, complementary sequences thereto, and fragments thereof. 
Moreover, preferred polynucleotides of the invention include purified, isolated, or recombinant 
35 GENSET cDNAs consisting of, consisting essentially of, or comprising a sequence selected from 
the group consisting of SEQ ID NOs:l-169, 339-455, 561-784. 

Accordingly, the coding sequence (CDS) or open reading frame (ORF) of each cDNA of 
the invention refers to the nucleotide sequence beginning with the first nucleotide of the start codon 
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and ending with the last nucleotide of the stop codon. Similarly, the 5 5 untranslated region (or 
5'UTR) of each cDNA of the invention refers to the nucleotide sequence starting at nucleotide 1 
and ending at the nucleotide immediately 5' to the first nucleotide of the start codon. The 3' 
untranslated region (or 3'UTR) of each cDNA of the invention refers to the nucleotide sequence 
5 starting at the nucleotide immediately 3' to the last nucleotide of the stop codon and ending at the 
last nucleotide of the cDNA. 

Untranslated regions 

In addition, the invention concerns a purified, isolated, and recombinant nucleic acid 
comprising a nucleotide sequence selected from the group consisting of the 5'UTRs of sequences of 

10 SEQ ID NOs: 1-1 69, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool, 
sequences complementary thereto, and allelic variants thereof. The invention also concerns a 
purified, isolated, and/or recombinant nucleic acid comprising a nucleotide sequence selected from 
the group consisting of the 3'UTRs of sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, sequences complementary thereto, and allelic 

1 5 variants thereof. 

These polynucleotides may be used to detect the presence of GENSET mRNA species in a 
biological sample using either hybridization or RT-PCR techniques well known to those skilled in 
the art. 

In addition, these polynucleotides may be used as regulatory molecules able to affect the 
20 processing and maturation of any polynucleotide including them (either a GENSET polynucleotide 
or an heterologous polynucleotide), preferably the localization, stability and/or translation of said 
polynucleotide including them (for a review on UTRs see Decker and Parker, 1995, Derrigo et al, 
2000). In particular, 3'UTRs may be used in order to control the stability of heterologous mRNAs 
in recombinant vectors using any methods known to those skilled in the art including Makrides 
25 ((1999) Protein Expr Purif 1999 Nov; 1 7(2): 1 83-202), US Patents 5,925,564; 5,807,707 and 
5,756,264, which disclosures are hereby incorporated by reference in their entireties. 

Coding sequences 

Another object of the invention is an isolated, purified or recombinant polynucleotide 
comprising the coding sequence of a sequence selected from the group consisting of sequences of 
30 SEQ ID NOs: 1-169, 339-455, 561-784, clone inserts of the deposited clone pool, and variants 
thereof. 

A further object of the invention is an isolated, purified or recombinant polynucleotide 
encoding a polypeptide comprising a sequence selected from the group consisting of sequences of 
SEQ ID NOs: 170-338, 456-560, 785-918 and allelic variants thereof Another object of the 
35 invention is an isolated, purified or recombinant polynucleotide encoding a polypeptide comprising 
a sequence selected from the group consisting of polypeptides encoded by cDNA inserts of the 
deposited clone pool and allelic variants thereof. 
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It will be appreciated that should the extent of the coding sequence differ from that 
indicated in the appended sequence listing as a result of a sequencing error, reverse transcription or 
amplification error, mRNA splicing, post-translational modification of the encoded protein, 
enzymatic cleavage of the encoded protein, or other biological factors, one skilled in the art would 
5 be readily able to identify the extent of the coding sequences in the sequences of SEQ ID NOsil- 
169, 339-455, 561-784. Accordingly, the scope of any claims herein relating to nucleic acids 
containing the coding sequence of one of SEQ ID NOs: 1-169, 339-455, 561-784 is not to be 
construed as excluding any readily identifiable variations from or equivalents to the coding 
sequences described in the appended sequence listing. Equivalents includes any alterations in a 

10 nucleotide coding sequence that does not result in an amino acid change, or that results in a 
conservative amino acid substitution, as defined below, in the polypeptide encoded by the 
nucleotide sequence. Similarly, should the extent of the polypeptides differ from those indicated in 
the appended sequence listing as a result of any of the preceding factors, the scope of claims 
relating to polypeptides comprising the amino acid sequence of the polypeptides of SEQ ID 

15 NOs: 170-338, 456-560, 785-918 is not to be construed as excluding any readily identifiable 
variations from or equivalents to the sequences described in the appended sequence listing. 

The above-disclosed polynucleotides that contain the coding sequence of the GENSET 
genes may be expressed in a desired host cell or a desired host organism, when this polynucleotide 
is placed under the control of suitable expression signals. The expression signals may be either the 

20 expression signals contained in the regulatory regions in the GENSET genes of the invention or, in 
contrast, the signals may be exogenous regulatory nucleic sequences. Such a polynucleotide, when 
placed under the suitable expression signals, may also be inserted in a vector for its expression 
and/or amplification. 

Further included in the present invention are polynucleotides encoding the polypeptides of 
25 the present invention that are fused in frame to the coding sequences for additional heterologous 
amino acid sequences. Also included in the present invention are nucleic acids encoding 
polypeptides of the present invention together with additional, non-coding sequences, including, but 
not limited to, non-coding 5' and 3' sequences, vector sequence, sequences used for purification, 
probing, or priming. For example, heterologous sequences include transcribed, untranslated 
30 sequences that may play a role in transcription and mRNA processing, such as ribosome binding 
and stability of mRNA. The heterologous sequences may alternatively comprise additional coding 
sequences that provide additional functionalities. Thus, a nucleotide sequence encoding a 
polypeptide may be fused to a tag sequence, such as a sequence encoding a peptide that facilitates 
purification or detection of the fused polypeptide. In certain preferred embodiments of this aspect 
35 of the invention, the tag amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
a pQE vector (QIAGEN), or in any of a number of additional, commercially available vectors. For 
instance, hexa-histidine provides for the convenient purification of the fusion protein (see, Gentz et 
a/., 1989, Proc Natl Acad Sci U S A Feb;86(3):821-4, the disclosure of which is incorporated by 
reference in its entirety). The "HA" tag is another peptide useful for purification which corresponds 
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to an epitope derived from the influenza hemagglutinin protein (see, Wilson et ah, 1984, Cell 
Jul;37(3):767-78, the disclosure of which is incorporated by reference in its entirety). As discussed 
below, other such fusion proteins include a GENSET polypeptide fused to Fc at the N- or 
C-terminus. 

5 Suitable recombinant vectors that contain a polynucleotide such as described herein are 

disclosed elsewhere in the specification. Expression vectors encoding GENSET polypeptides or 
fragments thereof are described in the section entitled "Preparation of the polypeptides". 

Regulatory sequences of the invention 

As mentioned, the genomic sequence of GENSET genes contain regulatory sequences in 
10 the non-coding 5'-flanking region and possibly in the non-coding 3'-flanking region that border the 
GENSET polypeptide coding regions containing the exons of these genes. 

Polynucleotides derived from GENSET polynucleotide 5' and 3' regulatory regions are 
useful in order to detect the presence of at least a copy of a genomic nucleotide sequence of the 
GENSET gene or a fragment thereof in a test sample. 

15 Preferred regulatory sequences 

Polynucleotides carrying the regulatory elements located at the 5' end and at the 3' end of 
GENSET polypeptide coding regions may be advantageously used to control, e.g., the 
transcriptional and translational activity of a heterologous polynucleotide of interest. 

Thus, the present invention also concerns a purified or isolated nucleic acid comprising a 

20 polynucleotide which is selected from the group consisting of the 5' and 3' GENSET 

polynucleotide regulatory regions, sequences complementary thereto, regulatory active fragments 
and variants thereof. The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with a polynucleotide selected from the 
group consisting of GENSET polynucleotide 5' and 3' regulatory regions, advantageously 99 % 

25 nucleotide identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide 

identity with a polynucleotide selected from the group consisting of GENSET polynucleotide 5' and 
3' regulatory regions, sequences complementary thereto, variants and regulatory active fragments 
thereof. 

Another object of the invention consists of purified, isolated or recombinant nucleic acids 
30 comprising a polynucleotide that hybridizes, under the stringent hybridization conditions defined 
herein, with a polynucleotide selected from the group consisting of the nucleotide sequences of 
GENSET polynucleotide 5' and 3' regulatory regions, sequences complementary thereto, variants 
and regulatory active fragments thereof. 

Preferred fragments of 5' regulatory regions have a length of about 1500 or 1000 
35 nucleotides, preferably of about 500 nucleotides, more preferably about 400 nucleotides, even more 
preferably 300 nucleotides and most preferably about 200 nucleotides. 
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Preferred fragments of 3' regulatory regions are at least 20, 50, 100, 150, 200, 300 or 400 
bases in length. 

"Regulatory active" polynucleotide derivatives of the 5' or 3' regulatory region are 
polynucleotides comprising or alternatively consisting of a fragment of said polynucleotide which is 
5 functional as a regulatory region for expressing a recombinant polypeptide or a recombinant 

polynucleotide in a recombinant cell host. It could act either as an enhancer or as a repressor. For 
the purpose of the invention, a nucleic acid or polynucleotide is "functional" as a regulatory region 
for expressing a recombinant polypeptide or a recombinant polynucleotide if said regulatory 
polynucleotide contains nucleotide sequences which contain transcriptional and translational 

10 regulatory information, and such sequences are "operably linked" to nucleotide sequences which 
encode the desired polypeptide or the desired polynucleotide. 

The regulatory polynucleotides of the invention may be prepared from the nucleotide 
sequence of GENSET genomic or cDNA sequence, for example, by cleavage using suitable 
restriction enzymes, or by PCR. The regulatory polynucleotides may also be prepared by digestion 

15 of a GENSET gene-containing genomic clone by an exonuclease enzyme, such as Bal3 1 (Wabiko et 
aL, DNA 5(4): 305- 14 (1986), the disclosure of which is incorporated by reference in its entirety). 
These regulatory polynucleotides can also be prepared by nucleic acid chemical synthesis, as 
described elsewhere in the specification. 

The regulatory polynucleotides according to the invention may be part of a recombinant 

20 expression vector that may be used to express a coding sequence in a desired host cell or host 

organism. The recombinant expression vectors according to the invention are described elsewhere 
in the specification. 

Preferred 5'-regulatory polynucleotides of the invention include 5'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. More preferred 5 '-regulatory 
25 polynucleotides of the invention include sequences selected from the group consisting of 5'-UTRs 
of sequences of SEQ ID NOs:l-169, 339-455, 561-784, 5'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

Preferred 3 '-regulatory polynucleotide of the invention include 3'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. More preferred 3 '-regulatory 
30 polynucleotides of the invention include sequences selected from the group consisting of 3'-UTRs 
of sequences of SEQ ID NOs:l-169, 339-455, 561-784, 3'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

A further object of the invention consists of a purified or isolated nucleic acid comprising: 
a) a polynucleotide comprising a 5' regulatory nucleotide sequence selected from the group 
35 consisting of: 

(i) a nucleotide sequence comprising a polynucleotide of a GENSET polynucleotide 5' 
regulatory region or a complementary sequence thereto; 
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(ii) a nucleotide sequence comprising a polynucleotide having at least 95% of nucleotide 
identity with the nucleotide sequence of a GEN SET polynucleotide 5' regulatory region or a 
complementary sequence thereto; 

(iii) a nucleotide sequence comprising a polynucleotide that hybridizes under stringent 

5 hybridization conditions with the nucleotide sequence of a GENSET polynucleotide 5' regulatory 
region or a complementary sequence thereto; and 

(iv) a regulatory active fragment or variant of the polynucleotides in (i), (ii) and (iii); 

b) a nucleic acid molecule encoding a desired polypeptide or a nucleic acid molecule of 
interest, wherein said nucleic acid molecule is operably linked to the polynucleotide defined in (a); 

10 and 

c) optionally, a polynucleotide comprising a 3'- regulatory polynucleotide, preferably a 3'- 
regulatory polynucleotide of a GENSET gene. 

In a specific embodiment, the nucleic acid defined above includes the 5'-UTR of a 

GENSET cDNA, or a regulatory active fragment or variant thereof. 
15 In a second specific embodiment, the nucleic acid defined above includes the 3 5 -UTR of a 

GENSET cDNA, or a regulatory active fragment or variant thereof. 

The regulatory polynucleotide of the 5' regulatory region, or its regulatory active fragments 

or variants, is operably linked at the 5'-end of the nucleic acid molecule encoding the desired 

polypeptide or nucleic acid molecule of interest. 
20 The regulatory polynucleotide of the 3' regulatory region, or its regulatory active fragments 

or variants, is advantageously operably linked at the 3 '-end of the nucleic acid molecule encoding 

the desired polypeptide or nucleic acid molecule of interest. 

The desired polypeptide encoded by the above-described nucleic acid may be of various 

nature or origin, encompassing proteins of prokaryotic viral or eukaryotic origin. Among the 
25 polypeptides expressed under the control of a GENSET polynucleotide regulatory region include 

bacterial, fungal or viral antigens. Also encompassed are eukaryotic proteins such as intracellular 

proteins, such as "house-keeping" proteins, membrane-bound proteins, such as mitochondrial 

membrane-bound proteins and cell surface receptors, and secreted proteins such as endogenous 

mediators such as cytokines. The desired polypeptide may be an heterologous polypeptide or a 
30 GENSET polypeptide, especially a protein with an amino acid sequence selected from the group 

consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918, fragments and variants 

thereof. 

The desired nucleic acids encoded by the above-described polynucleotides, usually an RNA 
molecule, may be complementary to a desired coding polynucleotide, for example to a GENSET 
35 coding sequence, and thus useful as an antisense polynucleotide. Such a polynucleotide may be 
included in a recombinant expression vector in order to express the desired polypeptide or the 
desired nucleic acid in host cell or in a host organism. Suitable recombinant vectors that contain a 
polynucleotide such as described herein are disclosed elsewhere in the specification. 
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Polynucleotide variants 

The invention also relates to variants of the polynucleotides described herein and fragments 
thereof. "Variants" of polynucleotides, as the term is used herein, are polynucleotides that differ 
from a reference polynucleotide. Generally, differences are limited so that the nucleotide sequences 
5 of the reference and the variant are closely similar overall and, in many regions, identical. The 
present invention encompasses both allelic variants and degenerate variants. 

Examples of variant sequences of polynucleotides of the invention are given in the 
appended sequence listing. Specifically, Table I includes sequences for which a plurality of closely 
related sequences, e.g. variants, are provided. 

10 Allelic variants 

A variant of a polynucleotide may be a naturally occurring variant such as a naturally 
occurring allelic variant, or it may be a variant that is not known to occur naturally. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism (see Lewin, 1990), the disclosure of which is incorporated by reference 
1 5 in its entirety. Diploid organisms may be homozygous or heterozygous for an allelic form. Non- 
naturally occurring variants of the polynucleotide may be made by art-known mutagenesis 
techniques, including those applied to polynucleotides, cells or organisms. See, for example, Table 
I, which includes sequences for which a plurality of closely related sequences, e.g. allelic variants of 
a single gene, are provided. 

20 Degenerate variant 

In addition to the isolated polynucleotides of the present invention, and fragments thereof, 
the invention further includes polynucleotides which comprise a sequence substantially different 
from those described above but which, due to the degeneracy of the genetic code, still encode a 
GENSET polypeptide of the present invention. These polynucleotide variants are referred to as 

25 "degenerate variants" throughout the instant application. That is, all possible polynucleotide 

sequences that encode the GENSET polypeptides of the present invention are contemplated. This 
includes the genetic code and species-specific codon preferences known in the art. Thus, it would 
be routine for one skilled in the art to generate the degenerate variants described above, for instance, 
to optimize codon expression for a particular host (e.g., change codons in the human mRNA to 

30 those preferred by other mammalian or bacterial host cells). 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes may 
also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 

35 involve one or more nucleotides. The variants may be altered in coding or non-coding regions or 
both. Alterations in the coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. In the context of the present invention, preferred embodiments 
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are those in which the polynucleotide variants encode polypeptides which retain substantially the 
same biological properties or activities as the GENSET protein. More preferred polynucleotide 
variants are those containing conservative substitutions. 

Similar polynucleotides 

5 Other embodiments of the present invention provide a purified, isolated or recombinant 

polynucleotide which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a 
polynucleotide selected from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 
561-784 and the clone inserts of the deposited clone pool. The above polynucleotides are included 
regardless of whether they encode a polypeptide having a GENSET biological activity. This is 

10 because even where a particular nucleic acid molecule does not encode a polypeptide having 

activity, one of skill in the art would still know how to use the nucleic acid molecule, for instance, 
as a hybridization probe or primer. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having GENSET activity include, inter alia, isolating a GENSET gene 
or allelic variants thereof from a DNA library, and detecting GENSET mRNA expression in 

1 5 biological samples suspected of containing GENSET mRNA or DNA, e.g., by Northern Blot or 
PCR analysis. 

The present invention is further directed to polynucleotides having sequences at least 50%. 
60%, 70%, 80%>, 90%, 95%, 96%, 97%, 98% or 99% identity to a polynucleotide selected from the 
group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and clone inserts of the 

20 deposited clone pool, where said polynucleotide do, in fact, encode a polypeptide having a 

GENSET biological activity. Of course, due to the degeneracy of the genetic code, one of ordinary 
skill in the art will immediately recognize that a large number of the polynucleotides at least 50%. 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to a polynucleotide selected from 
the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and clone inserts of the 

25 deposited clone pool will encode a polypeptide having biological activity. In fact, since degenerate 
variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a 
reasonable number will also encode a polypeptide having biological activity. This is because the 

30 skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly affect protein function (e.g., replacing one aliphatic amino acid with a second aliphatic 
amino acid), as further described below. By a polynucleotide having a nucleotide sequence at least, 
for example, 95% "identical" to a reference nucleotide sequence of the present invention, it is 
intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence 

35 except that the polynucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence encoding the GENSET polypeptide. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted, inserted, 
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or substituted with another nucleotide. The query sequence may be an entire sequence selected 
from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of 
clone inserts of the deposited clone pool, or the ORF (open reading frame) of a polynucleotide 
sequence selected from said group, or any fragment specified as described herein. 

5 Hybridizing Polynucleotides 

In another aspect, the invention provides an isolated or purified nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization conditions to any 
polynucleotide of the present invention using any methods known to those skilled in the art 
including those disclosed herein and in particular in the "To find similar sequences" section. Also 

10 contemplated are nucleic acid molecules that hybridize to the polynucleotides of the present 
invention at lower stringency hybridization conditions, preferably at moderate or low stringency 
conditions as defined herein. Such hybridizing polynucleotides may be of at least 15,18, 20, 23, 25, 
28, 30, 35, 40, 50, 75, 100, 200, 300, 500 or 1000 nucleotides in length. 

Of particular interest are polynucleotides hybridizing to any polynucleotide of the invention 

15 and encoding GENSET polypeptides, particularly GENSET polypeptides exhibiting a GENSET 
biological activity. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3' 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 5' complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
20 polynucleotide would hybridize to any nucleic acid molecule containing a poly(A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo dT as 
a primer). 

Complementary polynucleotides 

The invention further provides isolated nucleic acid molecules having a nucleotide 
25 sequence fully complementary to any polynucleotide of the invention. The present invention 
encompasses a purified, isolated or recombinant polynucleotide having a nucleotide sequence 
complementary to a sequence selected from the group consisting of sequences of SEQ ID NOs: 1 - 
169, 339-455, 561-784, sequences of clone inserts of the deposited clone pool and fragments 
thereof. Such isolated molecules, particularly DNA molecules, are useful as probes for gene 
30 mapping and for identifying GENSET mRNA in a biological sample, for instance, by PCR or 
Northern blot analysis. 

Polynucleotide fragments 

The present invention is further directed to polynucleotides encoding portions or fragments 
of the nucleotide sequences described herein. Uses for the polynucleotide fragments of the present 
35 invention include probes, primers, molecular weight markers and for expressing the polypeptide 
fragments of the present invention. Fragments include portions of polynucleotides selected from 
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the group consisting of a) the sequences of SEQ ID NOs: 1-169, 339-455, 561-784, b) genomic 
GENSET sequences, c) the polynucleotides encoding a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918, d) the sequences of clone 
inserts of the deposited clone pool, and e) the polynucleotides encoding the polypeptides encoded 
5 by the clone inserts of the deposited clone pool. Particularly included in the present invention is a 
purified or isolated polynucleotide comprising at least 8 consecutive bases of a polynucleotide of 
the present invention. In one aspect of this embodiment, the polynucleotide comprises at least 10, 
12, 15, 18, 20, 25,28,30,35,40, 50, 75, 100, 150, 200,300, 400, 500, 800, 1000, 1500, or 2000 
consecutive nucleotides of a polynucleotide of the present invention. 

10 In addition to the above preferred polynucleotide sizes, further preferred sub-genuses of 

polynucleotides comprise at least 8 nucleotides, wherein "at least 8" is defined as any integer 
between 8 and the integer representing the 3' most nucleotide position as set forth in the sequence 
listing or elsewhere herein. Further included as preferred polynucleotides of the present invention 
are polynucleotide fragments at least 8 nucleotides in length, as described above, that are further 

1 5 specified in terms of their 5' and 3' position. The 5' and 3' positions are represented by the position 
numbers set forth in the appended sequence listing. For allelic, degenerate and other variants, 
position 1 is defined as the 5 f most nucleotide of the ORF, i.e., the nucleotide "A" of the start codon 
with the remaining nucleotides numbered consecutively. Therefore, every combination of a 5' and 
3* nucleotide position that a polynucleotide fragment of the present invention, at least 8 contiguous 

20 nucleotides in length, could occupy on a polynucleotide of the invention is included in the invention 
as an individual species. The polynucleotide fragments specified by 5 f and 3 ? positions can be 
immediately envisaged and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. 

It is noted that the above species of polynucleotide fragments of the present invention may 

25 alternatively be described by the formula "a to b"; where "a" equals the 5' most nucleotide position 
and "b" equals the 3' most nucleotide position of the polynucleotide; and further where "a" equals 
an integer between 1 and the number of nucleotides of the polynucleotide sequence of the present 
invention minus 8, and where "b" equals an integer between 9 and the number of nucleotides of the 
polynucleotide sequence of the present invention; and where "a" is an integer smaller then "b" by at 

30 least 8. 

Therefore, the present invention encompasses isolated, purified, or recombinant 
polynucleotides which consist of, consist essentially of, or comprise a contiguous span of at least 8, 
10, 12, 15, 18, 20, 25,28, 30,35,40, 50, 75, 100, 150, 200,300,400,500, 1000 or 2000 nucleotides 
of a sequence selected from the group consisting of the sequences of SEQ ID NOs: 1-169, 339-455, 
35 561-784 and sequences fully complementary thereto. 

Other preferred fragments of the invention are polynucleotides comprising polynucleotides 
encoding domains of polypeptides. Such fragments may be used to obtain other polynucleotides 
encoding polypeptides having similar domains using hybridization or RT-PCR techniques. 
Alternatively, these fragments may be used to express a polypeptide domain which may have a 
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specific biological property. Thus, another object of the invention is an isolated, purified or 
recombinant polynucleotide encoding a polypeptide consisting of, consisting essentially of, or 
comprising a contiguous span of at least 5, 6, 8, 10, 12, 15,20, 25,30,35, 40, 50, 60, 75, 100, 150 
or 200 consecutive amino acids of a sequence selected from the group consisting of the sequences 
5 of SEQ ID NOs: 1 70-33 8, 456-560, 785-9 1 8, to the extent that a contiguous span of these lengths is 
consistent with the lengths of said selected sequence, where said contiguous span comprises at least 
1, 2, 3, 5, or 10 of the amino acid positions of a domain of said selected sequence. The present 
invention also encompasses isolated, purified or recombinant polynucleotides encoding a 
polypeptide comprising a contiguous span of at least 5, 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 

10 75, 100, 150 or 200 consecutive amino acids of a sequence selected from the group consisting of 
sequences of SEQ ID NOs: 170-338, 456-560, 785-91 8, to the extent that a contiguous span of these 
lengths is consistent with the lengths of said selected sequence, where said contiguous span is a 
domain of said selected sequence. The present invention also encompasses isolated, purified or 
recombinant polynucleotides encoding a polypeptide comprising a domain of a sequence selected 

15 from the group consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918. 

The present invention further encompasses any combination of the polynucleotide 
fragments listed in this section. 

Oligonucleotide primers and probes 

The present invention also encompasses fragments of GENSET polynucleotides for use as 
20 primers and probes. Polynucleotides derived from the GENSET genomic and cDNA sequences are 
useful in order to detect the presence of at least a copy of a GENSET polynucleotide or fragment, 
complement, or variant thereof in a test sample. 

Structural definition 

Any polynucleotide of the invention may be used as a primer or probe. Particularly 
25 preferred probes and primers of the invention include isolated, purified, or recombinant 

polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the GENSET genomic sequences, the cDNA sequences and the sequences fully complementary 
thereto. Another object of the invention is a purified, isolated, or recombinant polynucleotide 
30 comprising the nucleotide sequence of a sequence selected from the group consisting of the 

sequences of SEQ ID NOs: 1-1 69, 339-455, 561-784, sequences of clone inserts of the deposited 
clone pool, sequences fully complementary thereto, allelic variants thereof, and fragments thereof. 
Moreover, preferred probes and primers of the invention include purified, isolated, or recombinant 
GENSET cDNAs consisting of, consisting essentially of, or comprising the sequences of SEQ ID 
35 NOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool. 

Particularly preferred probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 
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80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the sequences of SEQ ID NOs:l-169, 339-455, 561-784 and the sequences fully complementary 
thereto. 

Design of primers and probes 

5 A probe or a primer according to the invention has between 8 and 1000 nucleotides in 

length, or is specified to be at least 12, 15, 18, 20, 25, 35, 40, 50, 60, 70, 80, 100, 250, 500 or 1000 
nucleotides in length. More particularly, the length of these probes and primers can range from 8, 
10, 15, 20, or 30 to 100 nucleotides, preferably from 10 to 50, more preferably from 15 to 30 
nucleotides. Shorter probes and primers tend to lack specificity for a target nucleic acid sequence 

10 and generally require cooler temperatures to form sufficiently stable hybrid complexes with the 
template. Longer probes and primers are expensive to produce and can sometimes self-hybridize to 
form hairpin structures. The appropriate length for primers and probes under a particular set of 
assay conditions may be empirically determined by one of skill in the art. The formation of stable 
hybrids depends on the melting temperature (Tm) of the DNA. The Tm depends on the length of 

15 the primer or probe, the ionic strength of the solution and the G+C content. The higher the G+C 
content of the primer or probe, the higher is the melting temperature because G:C pairs are held by 
three H bonds whereas A:T pairs have only two. The GC content in the probes of the invention 
usually ranges between 10 and 75%, preferably between 35 and 60%, and more preferably between 
40 and 55%. 

20 For amplification purposes, pairs of primers with approximately the same Tm are 

preferable. Primers may be designed using the OSP software (Hillier and Green, 1991), the 
disclosure of which is incorporated by reference in its entirety, based on GC content and melting 
temperatures of oligonucleotides, or using PC-Rare (http:// 

bioinformatics.weizmann.ac.il/software/PC-Rare/doc/manuel.html) based on the octamer frequency 
25 disparity method (Griffais et al. 9 1991), the disclosure of which is incorporated by reference in its 
entirety. DNA amplification techniques are well known to those skilled in the art. Amplification 
techniques that can be used in the context of the present invention include, but are not limited to, the 
ligase chain reaction (LCR) described in EP-A- 320 308, WO 9320227 and EP-A-439 182, the 
polymerase chain reaction (PCR, RT-PCR) and techniques such as the nucleic acid sequence based 
30 amplification (NASBA) described in Guatelli et a/.(1990) and in Compton (1991), Q-beta 
amplification as described in European Patent Application No 4544610, strand displacement 
amplification as described in Walker et aL (1996) and EP A 684 315 and, target mediated 
amplification as described in PCT Publication WO 9322461, the disclosures of which are 
incorporated by reference in their entireties. 
35 LCR and Gap LCR are exponential amplification techniques, both depending on DNA 

ligase to join adjacent primers annealed to a DNA molecule. In Ligase Chain Reaction (LCR), 
probe pairs are used which include two primary (first and second) and two secondary (third and 
fourth) probes, all of which are employed in molar excess to target. The first probe hybridizes to a 
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first segment of the target strand and the second probe hybridizes to a second segment of the target 
strand, the first and second segments being contiguous so that the primary probes abut one another 
in 5' phosphate-3'hydroxyl relationship, and so that a ligase can covalently fuse or ligate the two 
probes into a fused product. In addition, a third (secondary) probe can hybridize to a portion of the 
5 first probe and a fourth (secondary) probe can hybridize to a portion of the second probe in a similar 
abutting fashion. Of course, if the target is initially double stranded, the secondary probes also will 
hybridize to the target complement in the first instance. Once the ligated strand of primary probes 
is separated from the target strand, it will hybridize with the third and fourth probes, which can be 
ligated to form a complementary, secondary ligated product. It is important to realize that the 

10 ligated products are functionally equivalent to either the target or its complement. By repeated 

cycles of hybridization and ligation, amplification of the target sequence is achieved. A method for 
multiplex LCR has also been described (WO 9320227), the disclosure of which is incorporated by 
reference in its entirety. Gap LCR (GLCR) is a version of LCR where the probes are not adjacent 
but are separated by 2 to 3 bases. 

1 5 For amplification of mRNAs, it is within the scope of the present invention to reverse 

transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a single 
enzyme for both steps as described in U.S. Patent No. 5,322,770 or, to use Asymmetric Gap LCR 
(RT-AGLCR) as described by Marshall et a/.(1994), the disclosures of which are incorporated by 
reference in its entireties. AGLCR is a modification of GLCR that allows the amplification of 

20 RNA. 

PCR technology is the preferred amplification technique used in the present invention. A 
variety of PCR techniques are familiar to those skilled in the art. For a review of PCR technology, 
see White (1997), Erlich (1992) and the publication entitled "PCR Methods and Applications" 
((1991) Cold Spring Harbor Laboratory Press), the disclosures of which are incorporated by 

25 reference in their entireties. In each of these PCR procedures, PCR primers on either side of the 
nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid sample along 
with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, Tth 
polymerase or Vent polymerase. The nucleic acid in the sample is denatured and the PCR primers 
are specifically hybridized to complementary nucleic acid sequences in the sample. The hybridized 

30 primers are extended. Thereafter, another cycle of denaturation, hybridization, and extension is 
initiated. The cycles are repeated multiple times to produce an amplified fragment containing the 
nucleic acid sequence between the primer sites. PCR has further been described in several patents 
including US Patent Nos. 4,683,195; 4,683,202; and 4,965,188, the disclosures of which are 
incorporated herein by reference in their entireties. 

35 Preparation of primers and probes 

Primers and probes can be prepared by any suitable method, including, for example, 
cloning and restriction of appropriate sequences and direct chemical synthesis by a method such as 
the phosphodiester method of Narang et ar/.(1979), the phosphodiester method of Brown et 
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al. (1979), the diethylphosphoramidite method of Beaucage et #7.(1981) and the solid support 
method described in EP 0 707 592, which disclosures are hereby incorporated by reference in their 
entireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs 
5 such as, for example peptide nucleic acids which are disclosed in International Patent Application 
WO 92/20702, morpholino analogs which are described in U.S. Patent Nos. 5,1 85,444; 5,034,506 
and 5,142,047, which disclosures are hereby incorporated by reference in their entireties. The probe 
may have to be rendered "non-extendable" in that additional dNTPs cannot be added to the probe. 
In and of themselves analogs usually are non-extendable and nucleic acid probes can be rendered 

10 non-extendable by modifying the 3' end of the probe such that the hydroxy 1 group is no longer 
capable of participating in elongation. For example, the 3' end of the probe can be functionalized 
with the capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3 1 hydroxyl group simply can be cleaved, replaced or modified, U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993, which disclosure is hereby incorporated by 

15 reference in its entirety, describes modifications, which can be used to render a probe non- 
extendable. 

Labeling of probes 

Any of the polynucleotides of the present invention can be labeled, if desired, by 

20 incorporating any label known in the art to be detectable by spectroscopic, photochemical, 

biochemical, immunochemical, or chemical means. For example, useful labels include radioactive 
substances (including, 32 P, 35 S, 3 H, 125 I), fluorescent dyes (including, 5-bromodesoxyuridin, 
fluorescein, acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at 
their 3' and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described 

25 in the French patent No. FR-78 10975 or by Urdea et al (1988) or Sanchez-Pescador et al (1988), 
which disclosures are hereby incorporated by reference in their entireties. In addition, the probes 
according to the present invention may have structural characteristics such that they allow the signal 
amplification, such structural characteristics being, for example, branched DNA probes as those 
described by Urdea et al. in 1991 or in the European patent No. EP 0 225 807 (Chiron), which 

30 disclosures are hereby incorporated by reference in their entireties. 

The detectable probe may be single stranded or double stranded and may be made using 
techniques known in the art, including in vitro transcription, nick translation, or kinase reactions. A 
nucleic acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it may be denatured 

35 prior to contacting the probe. In some applications, the nucleic acid sample may be immobilized on 
a surface such as a nitrocellulose or nylon membrane. The nucleic acid sample may comprise 
nucleic acids obtained from a variety of sources, including genomic DNA, cDNA libraries, RNA, or 
tissue samples. 
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Procedures used to detect the presence of nucleic acids capable of hybridizing to the 
detectable probe include well known techniques such as Southern blotting, Northern blotting, dot 
blotting, colony hybridization, and plaque hybridization. In some applications, the nucleic acid 
capable of hybridizing to the labeled probe may be cloned into vectors such as expression vectors, 
5 sequencing vectors, or in vitro transcription vectors to facilitate the characterization and expression 
of the hybridizing nucleic acids in the sample. For example, such techniques may be used to isolate 
and clone sequences in a genomic library or cDNA library which are capable of hybridizing to the 
detectable probe as described herein. 

Immobilization of probes 

10 A label can also be used to capture the primer, so as to facilitate the immobilization of 

either the primer or a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which forms 
a binding pair with the solid phase reagent's specific binding member (e.g. biotin and streptavidin). 
Therefore depending upon the type of label carried by a polynucleotide or a probe, it may be 

1 5 employed to capture or to detect the target DNA. Further, it will be understood that the 

polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. 
For example, in the case where a solid phase reagent's binding member is a nucleic acid sequence, 
it may be selected such that it binds a complementary portion of a primer or probe to thereby 
immobilize the primer or probe to the solid phase. In cases where a polynucleotide probe itself 

20 serves as the binding member, those skilled in the art will recognize that the probe will contain a 
sequence or "tail" that is not complementary to the target. In the case where a polynucleotide 
primer itself serves as the capture label, at least a portion of the primer will be free to hybridize with 
a nucleic acid on a solid phase. DNA Labeling techniques are well known to the skilled technician. 
The probes of the present invention are useful for a number of purposes. They can notably 

25 be used in Southern hybridization to genomic DNA. The probes can also be used to detect PCR 
amplification products. They may also be used to detect mismatches in the GENSET gene or 
mRNA using other techniques. They may also be used for in situ hybridization. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. The solid support is not critical and can be selected 

30 by one skilled in the art. Thus, latex particles, microparticles, magnetic beads, non-magnetic beads 
(including polystyrene beads), membranes (including nitrocellulose strips), plastic tubes, walls of 
microtiter wells, glass or silicon chips, sheep (or other suitable animal's) red blood cells and 
duracytes are all suitable examples. Suitable methods for immobilizing nucleic acids on solid 
phases include ionic, hydrophobic, covalent interactions and the like. A solid support, as used 

35 herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture reagent. 
Alternatively, the solid phase can retain an additional receptor which has the ability to attract and 
immobilize the capture reagent. The additional receptor can include a charged substance that is 
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oppositely charged with respect to the capture reagent itself or to a charged substance conjugated to 
the capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid support and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule enables 
5 the indirect binding of the capture reagent to a solid support material before the performance of the 
assay or during the performance of the assay. The solid phase thus can be a plastic, derivatized 
plastic, magnetic or non-magnetic metal, glass or silicon surface of a test tube, microtiter well, 
sheet, bead, microparticle, chip, sheep (or other suitable animal's) red blood cells, duracytes® and 
other configurations known to those of ordinary skill in the art. The polynucleotides of the 
10 invention can be attached to or immobilized on a solid support individually or in groups of at least 
2, 5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In 
addition, polynucleotides other than those of the invention may be attached to the same solid 
support as one or more polynucleotides of the invention. 

15 Oligonucleotide array 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 
may be used either for detecting or amplifying targeted sequences in GENSET genes, may be used 
for detecting mutations in the coding or in the non-coding sequences of GENSET genes, and may 
also be used to determine GENSET gene expression in different contexts such as in different 

20 tissues, at different stages of a process (embryo development, disease treatment), and in patients 
versus healthy individuals as described elsewhere in the application. 

As used herein, the term "array" means a one dimensional, two dimensional, or 
multidimensional arrangement of nucleic acids of sufficient length to permit specific detection of 
gene expression. For example, the array may contain a plurality of nucleic acids derived from 

25 genes whose expression levels are to be assessed. The array may include a GENSET genomic 
DNA, a GENSET cDNA, sequences complementary thereto or fragments thereof. Preferably, the 
fragments are at least 12, 15, 18, 20, 25, 30, 35, 40 or 50 nucleotides in length. More preferably, the 
fragments are at least 100 nucleotides in length. Even more preferably, the fragments are more than 
100 nucleotides in length. In some embodiments the fragments may be more than 500 nucleotides 

30 in length. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on the solid support. Alternatively the polynucleotides of the invention may be attached 
in an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
35 ordered array of polynucleotides is designed to be "addressable" where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 
typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
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polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 
known as the Genechips™, and has been generally described in US Patent No. 5,143,854; PCT 
5 publications WO 90/15070 and 92/10092, which disclosures are hereby incorporated by reference in 
their entireties. These arrays may generally be produced using mechanical synthesis methods or 
light directed synthesis methods which incorporate a combination of photolithographic methods and 
solid phase oligonucleotide synthesis (Fodor et aL, 1991), which disclosure is hereby incorporated 
by reference in its entirety. The immobilization of arrays of oligonucleotides on solid supports has 

1 0 been rendered possible by the development of a technology generally identified as "Very Large 
Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, probes are immobilized in 
a high density array on a solid surface of a chip. Examples of VLSIPS™ technologies are provided 
in US Patents 5,143,854; and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1995, which disclosures are hereby incorporated by reference in their entireties, which 

15 describe methods for forming oligonucleotide arrays through techniques such as light-directed 

synthesis techniques. In designing strategies aimed at providing arrays of nucleotides immobilized 
on solid supports, further presentation strategies were developed to order and display the 
oligonucleotide arrays on the chips in an attempt to maximize hybridization patterns and sequence 
information. Examples of such presentation strategies are disclosed in PCT Publications WO 

20 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/3 1256, the disclosures of which are 
incorporated herein by reference in their entireties. 

Consequently, the invention concerns an array of nucleic acid molecules comprising at least 
one polynucleotide of the invention, particularly a probe or primer as described herein. Preferably, 
the invention concerns an array of nucleic acids comprising at least two polynucleotides of the 

25 invention, particularly probes or primers as described herein. Preferably, the invention concerns an 
array of nucleic acids comprising at least five polynucleotides of the invention, particularly probes 
or primers as described herein. 

A preferred embodiment of the present invention is an array of polynucleotides of at least 
12, 15, 18, 20, 25, 30, 35, 40, 50, 100 or 500 nucleotides in length which includes at least 1, 2, 5, 

30 10, 15, 20, 35, 50 or 100 sequences selected from the group consisting of the sequences of SEQ ID 
NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool, sequences 
fully complementary thereto, and fragments thereof. 

Methods of making the polynucleotides of the invention 

The present invention also comprises methods of making the polynucleotides of the 
35 invention, including the polynucleotides of SEQ ID NOs: 1-1 69, 339-455, 561-784, genomic DNA 
obtainable therefrom, or fragments thereof. These methods comprise sequentially linking together 
nucleotides to produce the nucleic acids having the preceding sequences. Polynucleotides of the 
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invention may be synthesized either enzymatically using techniques well known to those skilled in 
the art including amplification or hybridization-based methods as described herein, or chemically. 

A variety of chemical methods of synthesizing nucleic acids are known to those skilled in 
the art. In many of these methods, synthesis is conducted on a solid support. These included the 3' 
5 phosphoramidite methods in which the 3' terminal base of the desired oligonucleotide is 

immobilized on an insoluble carrier. The nucleotide base to be added is blocked at the 5' hydroxyl 
and activated at the 3' hydroxyl so as to cause coupling with the immobilized nucleotide base. 
Deblocking of the new immobilized nucleotide compound and repetition of the cycle will produce 
the desired polynucleotide. Alternatively, polynucleotides may be prepared as described in U.S. 
10 Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its entirety. In some 
embodiments, several polynucleotides prepared as described above are ligated together to generate 
longer polynucleotides having a desired sequence. 

Polypeptides of the invention 

The term " GENSET polypeptides" is used herein to embrace all of the proteins and 
15 polypeptides of the present invention. The present invention encompasses GENSET polypeptides, 
including recombinant, isolated or purified GENSET polypeptides consisting of, consisting 
essentially of, or comprising a sequence selected from the group consisting of SEQ ID NOs: 1 70- 
338, 456-560, 785-918 and the polypeptides encoded by human cDNAs contained in the deposited 
clones. Other objects of the invention are polypeptides encoded by the polynucleotides of the 
20 invention as well as fusion polypeptides comprising such polypeptides. 

Polypeptide variants 

The present invention further provides for GENSET polypeptides encoded by allelic and 
splice variants, orthologs, and/or species homologues. Procedures known in the art can be used to 
obtain, allelic variants, splice variants, orthologs, and/or species homologues of polynucleotides 

25 encoding by polypeptides of the group consisting of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool, using information from the 
sequences disclosed herein or the clones deposited with the ATCC. 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence at least 50% identical, more preferably at least 60% identical, and still more preferably 

30 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identical to a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and those encoded by the 
clone inserts of the deposited clone pool. By a polypeptide having an amino acid sequence at least, 
for example, 95% "identical" to a query amino acid sequence of the present invention, it is intended 
that the amino acid sequence of the subject polypeptide is identical to the query sequence except 

35 that the subject polypeptide sequence may include up to five amino acid alterations per each 100 
amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an 
amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% (5 of 100) of 
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the amino acid residues in the subject sequence may be inserted, deleted, (indels) or substituted with 
another amino acid. 

Further polypeptides of the present invention include polypeptides which have at least 90% 
similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 98% 
5 or 99% similarity to those described above. By a polypeptide having an amino acid sequence at 
least, for example, 95% "similar" to a query amino acid sequence of the present invention, it is 
intended that the amino acid sequence of the subject polypeptide is similar (i.e. contains identical or 
equivalent amino acid residues) to the query sequence except that the subject polypeptide sequence 
may include up to five amino acid alterations per each 100 amino acids of the query amino acid 
10 sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% 
similar to a query amino acid sequence, up to 5% (5 of 100) of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another non-equivalent 
amino acid. 

These alterations of the reference sequence may occur at the amino or carboxy terminal 
15 positions of the reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. The query sequence may be an entire amino acid 
sequence selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785- 
918 and those encoded by the clone inserts of the deposited clone pool or any fragment specified as 
20 described herein. 

The variant polypeptides described herein are included in the present invention regardless 
of whether they have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would still know how 
to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the 

25 polypeptides of the present invention that do not have GENSET biological activity include, inter 
alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or 
on molecular sieve gel filtration columns using methods known to those of skill in the art. As 
described below, the polypeptides of the present invention can also be used to raise polyclonal and 
monoclonal antibodies, which are useful in assays for detecting GENSET protein expression or as 

30 agonists and antagonists capable of enhancing or inhibiting GENSET protein function. Further, 

such polypeptides can be used in the yeast two-hybrid system to "capture" GENSET protein binding 
proteins, which are also candidate agonists and antagonists according to the present invention (see, 
e.g., Fields et ah 1989, which disclosure is hereby incorporated by reference in its entirety). 

Preparation of the polypeptides of the invention 

35 The polypeptides of the present invention can be prepared in any suitable manner. Such 

polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 
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these methods. The polypeptides of the present invention are preferably provided in an isolated 
form, and may be partially or preferably substantially purified. 

Consequently, the present invention also comprises methods of making the polypeptides of 
the invention, particularly polypeptides encoded by the cDNAs of SEQ ID NOs: 1 -1 69, 339-455, 
5 561-784 or by the clone inserts of the deposited clone pool, genomic DNA obtainable therefrom, or 
fragments thereof and methods of making the polypeptides of SEQ ID NOs: 170-338, 456-560, 785- 
91 8 or fragments thereof. The methods comprise sequentially linking together amino acids to 
produce the nucleic polypeptides having the preceding sequences. In some embodiments, the 
polypeptides made by these methods are 150 amino acids or less in length. In other embodiments, 
10 the polypeptides made by these methods are 120 amino acids or less in length. 

Isolation 

From natural sources 

The GENSET proteins of the invention may be isolated from natural sources, including 
bodily fluids, tissues and cells, whether directly isolated or cultured cells, of humans or non-human 

15 animals. Methods for extracting and purifying natural proteins are known in the art, and include the 
use of detergents or chaotropic agents to disrupt particles followed by differential extraction and 
separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
sedimentation according to density, and gel electrophoresis. See, for example, "Methods in 
Enzymology, Academic Press, 1993" for a variety of methods for purifying proteins, which 

20 disclosure is hereby incorporated by reference in its entirety. Polypeptides of the invention also can 
be purified from natural sources using antibodies directed against the polypeptides of the invention, 
such as those described herein, in methods which are well known in the art of protein purification. 

From recombinant sources 

Preferably, the GENSET polypeptides of the invention are recombinantly produced using 
25 routine expression methods known in the art. The polynucleotide encoding the desired polypeptide 
is operably linked to a promoter into an expression vector suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems are used in forming recombinant polypeptides. The 
polypeptide is then isolated from lysed cells or from the culture medium and purified to the extent 
needed for its intended use. 
30 Any GENSET polynucleotide, including those described in SEQ ID NOs: 1-1 69, 339-455, 

561-784, those of clone inserts of the deposited clone pool, and allelic variants thereof may be used 
to express GENSET polypeptides. The nucleic acid encoding the GENSET polypeptide to be 
expressed is operably linked to a promoter in an expression vector using conventional cloning 
technology. The GENSET insert in the expression vector may comprise the full coding sequence 
35 for the GENSET protein or a portion thereof. For example, the GENSET derived insert may encode 
a polypeptide comprising at least 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 150 or 200 
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consecutive amino acids of a GENSET protein selected from the group consisting of sequences of 
SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool. 

Consequently, a further embodiment of the present invention is a method of making a 
5 polypeptide comprising a protein selected from the group consisting of sequences of SEQ ID 
NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the deposited 
clone pool, said method comprising the steps of: 

a) obtaining a cDNA comprising a sequence selected from the group consisting of i) the 
sequences SEQ ID NOs: 1-1 69, 339-455, 561-784, ii) the sequences of clone inserts of the deposited 

10 clone pool one, iii) sequences encoding one of the polypeptide of SEQ ID NOs: 170-338, 456-560, 
785-918, and iv) sequences of polynucleotides encoding a polypeptide which is encoded by one of 
the clone insert of the deposited clone pool; 

b) inserting said cDNA in an expression vector such that the cDNA is operably linked to a 
promoter; and 

15 c) introducing said expression vector into a host cell whereby said host cell produces said 

polypeptide. 

In one aspect of this embodiment, the method further comprises the step of isolating the 
polypeptide. Another embodiment of the present invention is a polypeptide obtainable by the 
method described in the preceding paragraph. 

20 The expression vector is any of the mammalian, yeast, insect or bacterial expression 

systems known in the art. Commercially available vectors and expression systems are available 
from a variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to 
enhance expression and facilitate proper protein folding, the codon context and codon pairing of the 

25 sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained in U.S. Patent No. 5,082,767, which disclosure is hereby incorporated by 
reference in its entirety. 

In one embodiment, the entire coding sequence of a GENSET cDNA and the 3'UTR 
through the poly A signal of the cDNA is operably linked to a promoter in the expression vector. 

30 Alternatively, if the nucleic acid encoding a portion of the GENSET protein lacks a methionine to 
serve as the initiation site, an initiating methionine can be introduced next to the first codon of the 
nucleic acid using conventional techniques. Similarly, if the insert from the GENSET cDNA lacks 
a poly A signal, this sequence can be added to the construct by, for example, splicing out the Poly A 
signal from pSG5 (Stratagene) using Bgll and Sail restriction endonuclease enzymes and 

35 incorporating it into the mammalian expression vector pXTl (Stratagene). pXTl contains the LTRs 
and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in 
the construct allow efficient stable transfection. The vector includes the Herpes Simplex Thymidine 
Kinase promoter and the selectable neomycin gene. The nucleic acid encoding the GENSET 
protein or a portion thereof is obtained by PCR from a vector containing a GENSET cDNA selected 
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from the group consisting of the sequences of SEQ IDNOs:l-169, 339-455, 561-784 and the clone 
inserts of the deposited clone pool using oligonucleotide primers complementary to the GENSET 
cDNA or portion thereof and containing restriction endonuclease sequences for Pst I incorporated 
into the 5* primer and Bglll at the 5' end of the corresponding cDNA 3' primer, taking care to ensure 
5 that the sequence encoding the GENSET protein or a portion thereof is positioned properly with 
respect to the poly A signal. The purified fragment obtained from the resulting PCR reaction is 
digested with PstI, blunt ended with an exonuclease, digested with Bgl II, purified and ligated to 
pXTl, now containing a poly A signal and digested with Bglll. 

In another embodiment, it is often advantageous to add to the recombinant polynucleotide 
10 additional nucleotide sequence which codes for secretory or leader sequences, pro-sequences, 
sequences which aid in purification, such as multiple histidine residues, or an additional sequence 
for stability during recombinant production. 

As a control, the expression vector lacking a cDNA insert is introduced into host cells or 
organisms. 

1 5 Transfection of a GENSET expression vector into mouse NTH 3T3 cells is but one 

embodiment of introducing polynucleotides into host cells. Introduction of a polynucleotide 
encoding a polypeptide into a host cell can be effected by calcium phosphate transfection, 
DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, or other methods. Such methods are described in many standard laboratory 

20 manuals, such as Davis et ah (1986), which disclosure is hereby incorporated by reference in its 

entirety. It is specifically contemplated that the polypeptides of the present invention may in fact be 
expressed by a host cell lacking a recombinant vector. 

Recombinant cell extracts, or proteins from the culture medium if the expressed polypeptide 
is secreted, are then prepared and proteins separated by gel electrophoresis. If desired, the proteins 

25 may be ammonium sulfate precipitated or separated based on size or charge prior to electrophoresis. 
The proteins present are detected using techniques such as Coomassie or silver staining or using 
antibodies against the protein encoded by the GENSET cDNA of interest. Coomassie and silver 
staining techniques are familiar to those skilled in the art. 

Proteins from the host cells or organisms containing an expression vector which contains 

30 the GENSET cDNA or a fragment thereof are compared to those from the control cells or organism. 
The presence of a band from the cells containing the expression vector which is absent in control 
cells indicates that the GENSET cDNA is expressed. Generally, the band corresponding to the 
protein encoded by the GENSET cDNA will have a mobility near that expected based on the 
number of amino acids in the open reading frame of the cDNA. However, the band may have a 

35 mobility different than that expected as a result of modifications such as glycosylation, 
ubiquitination, or enzymatic cleavage. 

Alternatively, the GENSET polypeptide to be expressed may also be a product of 
transgenic animals, i.e., as a component of the milk of transgenic cows, goats, pigs or sheep which 
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are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein of 
interest. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including differential extraction, ammonium sulfate or ethanol 
5 precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. See, for example, "Methods in 
Enzymology", supra for a variety of methods for purifying proteins. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. A recombinantly 

10 produced version of a GENSET polypeptide can be substantially purified using techniques 
described herein or otherwise known in the art, such as, for example, by the one-step method 
described in Smith and Johnson (1988), which disclosure is hereby incorporated by reference in its 
entirety. Polypeptides of the invention also can be purified from recombinant sources using 
antibodies directed against the polypeptides of the invention, such as those described herein, in 

1 5 methods which are well known in the art of protein purification. 

Preferably, the recombinantly expressed GENSET polypeptide is purified using standard 
immunochromatography techniques such as the one described in the section entitled 
"Immunoaffinity Chromatography". In such procedures, a solution containing the protein of 
interest, such as the culture medium or a cell extract, is applied to a column having antibodies 

20 against the protein attached to the chromatography matrix. The recombinant protein is allowed to 
bind the immunochromatography column. Thereafter, the column is washed to remove non- 
specifically bound proteins. The specifically bound secreted protein is then released from the 
column and recovered using standard techniques. 

If antibody production is not possible, the GENSET cDNA sequence or fragment thereof 

25 may be incorporated into expression vectors designed for use in purification schemes employing 
chimeric polypeptides. In such strategies the coding sequence of the GENSET cDNA or fragment 
thereof is inserted in frame with the gene encoding the other half of the chimera. The other half of 
the chimera may be beta-globin or a nickel binding polypeptide encoding sequence. A 
chromatography matrix having antibody to beta-globin or nickel attached thereto is then used to 

30 purify the chimeric protein. Protease cleavage sites may be engineered between the beta-globin 
gene or the nickel binding polypeptide and the GENSET cDNA or fragment thereof. Thus, the two 
polypeptides of the chimera may be separated from one another by protease digestion. 

One useful expression vector for generating beta-globin chimerics is pSG5 (Stratagene), 
which encodes rabbit beta-globin. Intron II of the rabbit beta-globin gene facilitates splicing of the 

35 expressed transcript, and the polyadenylation signal incorporated into the construct increases the 
level of expression. These techniques as described are well known to those skilled in the art of 
molecular biology. Standard methods are published in methods texts such as Davis et al. 9 (1986) 
and many of the methods are available from Stratagene, Life Technologies, Inc., or Promega. 
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Polypeptide may additionally be produced from the construct using in vitro translation systems such 
as the In vitro Express™ Translation Kit (Stratagene). 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In addition, 
5 polypeptides of the invention may also include an initial modified methionine residue, in some 
cases as a result of host-mediated processes. Thus, it is well known in the art that the N-terminal 
methionine encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal 
1 0 process is inefficient, depending on the nature of the amino acid to which the N-terminal 
methionine is covalently linked. 

From chemical synthesis 

In addition, polypeptides of the invention, especially short protein fragments, can be 
chemically synthesized using techniques known in the art (See, e.g., Creighton, 1983; and 

1 5 Hunkapiller et al. 9 1984), which disclosures are hereby incorporated by reference in their entireties. 
For example, a polypeptide corresponding to a fragment of a polypeptide sequence of the invention 
can be synthesized by use of a peptide synthesizer. A variety of methods of making polypeptides 
are known to those skilled in the art, including methods in which the carboxyl terminal amino acid 
is bound to polyvinyl benzene or another suitable resin. The amino acid to be added possesses 

20 blocking groups on its amino moiety and any side chain reactive groups so that only its carboxyl 
moiety can react. The carboxyl group is activated with carbodiimide or another activating agent 
and allowed to couple to the immobilized amino acid. After removal of the blocking group, the 
cycle is repeated to generate a polypeptide having the desired sequence. Alternatively, the methods 
described in U.S. Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its 

25 entirety, may be used. 

Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be 
introduced as a substitution or addition into the polypeptide sequence. Non-classical amino acids 
include, but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6- 

30 amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, 
norvaline, hydroxy proline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids 
such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

35 Modifications 

The invention encompasses polypeptides which are differentially modified during or after 
translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
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protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular 
ligand, etc. Any of numerous chemical modifications may be carried out by known techniques, 
including but not limited, to specific chemical cleavage by cyanogen bromide, trypsin, 
chymotrypsin, papain, V8 protease, NaBH4; acetylation, formylation, oxidation, reduction; 
5 metabolic synthesis in the presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, for 
example, e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or C-terminal 
ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of 
N-linked or O-linked carbohydrate chains, and addition or deletion of an N-terminal methionine 

10 residue as a result of prokaryotic host cell expression. The polypeptides may also be modified with 
a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection 
and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the polypeptides of 
the invention which may provide additional advantages such as increased solubility, stability and 

15 circulating time of the polypeptide, or decreased immunogenicity. See U.S. Patent No: 4,1 79,337. 
The chemical moieties for derivatization may be selected See U.S. Patent No: 4,179,337, which 
disclosure is hereby incorporated by reference in its entirety. The chemical moieties for 
derivatization may be selected from water soluble polymers such as polyethylene glycol, ethylene 
glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the 

20 like. The polypeptides may be modified at random positions within the molecule, or at 

predetermined positions within the molecule and may include one, two, three or more attached 
chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the 

25 term "about" indicating that in preparations of polyethylene glycol, some molecules will weigh 
more, some less, than the stated molecular weight) for ease in handling and manufacturing. Other 
sizes may be used, depending on the desired therapeutic profile (e.g., the duration of sustained 
release desired, the effects, if any on biological activity, the ease in handling, the degree or lack of 
antigenicity and other known effects of the polyethylene glycol to a therapeutic protein or analog). 

30 The polyethylene glycol molecules (or other chemical moieties) should be attached to the 

protein with consideration of effects on functional or antigenic domains of the protein. There are a 
number of attachment methods available to those skilled in the art, e.g., EP 0 401 384, (coupling 
PEG to G-CSF), and Malik et al. (1992) (reporting pegylation of GM-CSF using tresyl chloride), 
which disclosures are hereby incorporated by reference in their entireties. For example, 

35 polyethylene glycol may be covalently bound through amino acid residues via a reactive group, 
such as, a free amino or carboxyl group. Reactive groups are those to which an activated 
polyethylene glycol molecule may be bound. The amino acid residues having a free amino group 
may include lysine residues and the N-terminal amino acid residues; those having a free carboxyl 
group may include aspartic acid residues glutamic acid residues and the C-terminal amino acid 
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residue. Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such as 
attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
5 polyethylene glycol as an illustration of the present composition, one may select from a variety of 
polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of 
polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of obtaining the selected N-terminally 
pegylated protein. The method of obtaining the N-terminally pegylated preparation (i.e., separating 

10 this moiety from other monopegylated moieties if necessary) may be by purification of the 
N-terminally pegylated material from a population of pegylated protein molecules. Selective 
proteins chemically modified at the N-terminus modification may be accomplished by reductive 
alkylation, which exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 

1 5 reaction conditions, substantially selective derivatization of the protein at the N-terminus with a 
carbonyl group containing polymer is achieved. 

Multimerization 

The polypeptides of the invention may be in monomers or multimers (i.e., dimers, trimers, 
tetramers and higher multimers). Accordingly, the present invention relates to monomers and 

20 multimers of the polypeptides of the invention, their preparation, and compositions containing them. 
In specific embodiments, the polypeptides of the invention are monomers, dimers, trimers or 
tetramers. In additional embodiments, the multimers of the invention are at least dimers, at least 
trimers, or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used herein, 

25 the term "homomer", refers to a multimer containing only polypeptides corresponding to the amino 
acid sequences of SEQ ID NOs: 170-338, 456-560, 785-918 or encoded by the clone inserts of the 
deposited clone pool (including fragments, variants, splice variants, and fusion proteins, 
corresponding to these polypeptides as described herein). These homomers may contain 
polypeptides having identical or different amino acid sequences. In a specific embodiment, a 

30 homomer of the invention is a multimer containing only polypeptides having an identical amino 
acid sequence. In another specific embodiment, a homomer of the invention is a multimer 
containing polypeptides having different amino acid sequences. In specific embodiments, the 
multimer of the invention is a homodimer (e.g., containing polypeptides having identical or 
different amino acid sequences) or a homotrimer (e.g., containing polypeptides having identical 

35 and/or different amino acid sequences). In additional embodiments, the homomenc multimer of the 
invention is at least a homodimer, at least a homotrimer, or at least a homotetramer. 

As used herein, the term "heteromer" refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the polypeptides of 
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the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a 
heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the 
invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or 
5 covalent associations and/or may be indirectly linked, by for example, liposome formation. Thus, 
in one embodiment, multimers of the invention, such as, for example, homodimers or homotrimers, 
are formed when polypeptides of the invention contact one another in solution. In another 
embodiment, heteromultimers of the invention, such as, for example, heterotrimers or 
heterotetramers, are formed when polypeptides of the invention contact antibodies to the 

10 polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a 
fusion protein of the invention) in solution. In other embodiments, multimers of the invention are 
formed by covalent associations with and/or between the polypeptides of the invention. Such 
covalent associations may involve one or more amino acid residues contained in the polypeptide 
sequence (e.g., that recited in the sequence listing, or contained in the polypeptide encoded by a 

15 deposited clone). In one instance, the covalent associations are cross-linking between cysteine 
residues located within the polypeptide sequences, which interact in the native (i.e., naturally 
occurring) polypeptide. In another instance, the covalent associations are the consequence of 
chemical or recombinant manipulation. Alternatively, such covalent associations may involve one 
or more amino acid residues contained in the heterologous polypeptide sequence in a fusion protein 

20 of the invention. 

In one example, covalent associations are between the heterologous sequence contained in a 
fusion protein of the invention (see, e.g., US Patent Number 5,478,925, which disclosure is hereby 
incorporated by reference in its entirety). In a specific example, the covalent associations are 
between the heterologous sequence contained in an Fc fusion protein of the invention (as described 

25 herein). In another specific example, covalent associations of fusion proteins of the invention are 
between heterologous polypeptide sequence from another protein that is capable of forming 
covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., International 
Publication No: WO 98/49305, the contents of which are herein incorporated by reference in its 
entirety). In another embodiment, two or more polypeptides of the invention are joined through 

30 peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 5,073,627 
(hereby incorporated by reference). Proteins comprising multiple polypeptides of the invention 
separated by peptide linkers may be produced using conventional recombinant DNA technology. 

Another method for preparing multimer polypeptides of the invention involves the use of 
polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. 

35 Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of the 
proteins in which they are found. Leucine zippers were originally identified in several 
DNA-binding proteins, and have since been found in a variety of different proteins (Landschulz et 
al., 1988). Among the known leucine zippers are naturally occurring peptides and derivatives 
thereof that dimerize or trimerize. Examples of leucine zipper domains suitable for producing 
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soluble multimeric proteins of the invention are those described in PCT application WO 94/10308, 
hereby incorporated by reference. Recombinant fusion proteins comprising a polypeptide of the 
invention fused to a polypeptide sequence that dimerizes or trimerizes in solution are expressed in 
suitable host cells, and the resulting soluble multimeric fusion protein is recovered from the culture 
5 supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced biological 
activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially form 
trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as 
described in Hoppe et ah (1994) and in U.S. patent application Ser. No. 08/446,922, which 

10 disclosure is hereby incorporated by reference in its entirety. Other peptides derived from naturally 
occurring trimeric proteins may be employed in preparing trimeric polypeptides of the invention. In 
another example, proteins of the invention are associated by interactions between Flag® 
polypeptide sequence contained in fusion proteins of the invention containing Flag® polypeptide 
sequence. In a further embodiment, associations proteins of the invention are associated by 

1 5 interactions between heterologous polypeptide sequence contained in Flag® fusion proteins of the 
invention and anti Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known in the 
art. For example, polypeptides desired to be contained in the multimers of the invention may be 
chemically cross-linked using linker molecules and linker molecule length optimization techniques 

20 known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference 
in its entirety). Additionally, multimers of the invention may be generated using techniques known 
in the art to form one or more inter-molecule cross-links between the cysteine residues located 
within the sequence of the polypeptides desired to be contained in the multimer (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides 

25 of the invention may be routinely modified by the addition of cysteine or biotin to the C terminus or 
N-terminus of the polypeptide and techniques known in the art may be applied to generate 
multimers containing one or more of these modified polypeptides (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). Additionally, other techniques 
known in the art may be applied to generate liposomes containing the polypeptide components 

30 desired to be contained in the multimer of the invention (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art. In one embodiment, polypeptides contained in multimers of the 
invention are produced recombinantly using fusion protein technology described herein or 

35 otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by 
reference in its entirety). In a specific embodiment, polynucleotides coding for a homodimer of the 
invention are generated by ligating a polynucleotide sequence encoding a polypeptide of the 
invention to a sequence encoding a linker polypeptide and then further to a synthetic polynucleotide 
encoding the translated product of the polypeptide in the reverse orientation from the original 
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C-terminus to the N-terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). In another embodiment, recombinant 
techniques described herein or otherwise known in the art are applied to generate recombinant 
polypeptides of the invention which contain a transmembrane domain (or hydrophobic or signal 
5 peptide) and which can be incorporated by membrane reconstitution techniques into liposomes (see, 
e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 

Mutated polypeptides 

To improve or alter the characteristics of GENSET polypeptides of the present invention, 
protein engineering may be employed. Recombinant DNA technology known to those skilled in the 

10 art can be used to create novel mutant proteins or muteins including single or multiple amino acid 
substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, e.g., 
increased/decreased biological activity or increased/decreased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. Further, the polypeptides of the present 

1 5 invention may be produced as multimers including dimers, trimers and tetramers. Multimerization 
may be facilitated by linkers or recombinantly though heterologous polypeptides such as Fc regions. 

N- and C-terminal deletions 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron et al. (1993), reported 

20 modified KGF proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides having one or 
more residues deleted from the amino terminus of the polypeptides of SEQ ID NOs: 1 70-338, 456- 
560, 785-91 8 or that encoded by the clone inserts of the deposited clone pool. Similarly, many 
examples of biologically functional C-terminal deletion mutants are known. For instance, 

25 Interferon gamma shows up to ten times higher activities by deleting 810 amino acid residues from 
the C-terminus of the protein {See, e.g., Dobeli, et al. 1988), which disclosure is hereby 
incorporated by reference in its entirety. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the carboxy terminus of the polypeptides shown of SEQ 
ID NOs: 170-338, 456-560, 785-918 or encoded by the clone inserts of the deposited clone pool. 

30 The invention also provides polypeptides having one or more amino acids deleted from both the 
amino and the carboxyl termini as described below. 

Other mutations 

Other mutants in addition to N- and C-terminal deletion forms of the protein discussed 
above are included in the present invention. It also will be recognized by one of ordinary skill in 
35 the art that some amino acid sequences of the GENSET polypeptides of the present invention can be 
varied without significant effect of the structure or function of the protein. If such differences in 
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sequence are contemplated, it should be remembered that there will be critical areas on the protein 
which determine activity. Thus, the invention further includes variations of the GENSET 
polypeptides which show substantial GENSET polypeptide activity. Such mutants include 
deletions, insertions, inversions, repeats, and substitutions selected according to general rules 
5 known in the art so as to have little effect on activity. For example, guidance concerning how to 
make phenotypically silent amino acid substitutions is provided. 

There are two main approaches for studying the tolerance of an amino acid sequence to 
change (see, Bowie et al. 1994, which disclosure is hereby incorporated by reference in its entirety). 
The first method relies on the process of evolution, in which mutations are either accepted or 

10 rejected by natural selection. 

The second approach uses genetic engineering to introduce amino acid changes at specific 
positions of a cloned gene and selections or screens to identify sequences that maintain 
functionality. These studies have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The studies indicate which amino acid changes are likely to be permissive at a certain 

1 5 position of the protein. For example, most buried amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Other such phenotypically 
silent substitutions are described by Bowie et ah (supra) and the references cited therein. 

Typically seen as conservative substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxyl residues Ser and Thr, 

20 exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 
exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, 
Tyr. Thus, the fragment, derivative, analog, or homologue of the polypeptide of the present 
invention may be, for example: (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino 

25 acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic 
code: or (ii) one in which one or more of the amino acid residues includes a substituent group: or 
(iii) one in which the GENSET polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in which the 
additional amino acids are fused to the above form of the polypeptide, such as an IgG Fc fusion 

30 region peptide or leader or secretory sequence or a sequence which is employed for purification of 
the above form of the polypeptide or a pro-protein sequence. Such fragments, derivatives and 
analogs are deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the GENSET polypeptides of the present invention may include one or more amino 
acid substitutions, deletions, or additions, either from natural mutations or human manipulation. As 

35 indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions 
that do not significantly affect the folding or activity of the protein. The following groups of amino 
acids generally represent equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gin, Asn, Ser, Thr; (2) 
Cys, Ser, Tyr, Thr; (3) Val, He, Leu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, His. 
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A specific embodiment of a modified GENSET peptide molecule of interest according to 
the present invention, includes, but is not limited to, a peptide molecule which is resistant to 
proteolysis, is a peptide in which the -CONH- peptide bond is modified and replaced by a (CH2NH) 
reduced bond, a (NHCO) retro inverso bond, a (CH2-0) methylene-oxy bond, a (CH2-S) 
5 thiomethylene bond, a (CH2CH2) carba bond, a (CO-CH2) cetomethylene bond, a (CHOH-CH2) 
hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH=CH- bond. The invention 
also encompasses a human GENSET polypeptide or a fragment or a variant thereof in which at least 
one peptide bond has been modified as described above. 

Amino acids in the GENSET proteins of the present invention that are essential for function 

1 0 can be identified by methods known in the art, such as site-directed mutagenesis or 

alanine-scanning mutagenesis (see, e.g., Cunningham et aL 1989, which disclosure is hereby 
incorporated by reference in its entirety). The latter procedure introduces single alanine mutations 
at every residue in the molecule. The resulting mutant molecules are then tested for biological 
activity using assays appropriate for measuring the function of the particular protein. Of special 

1 5 interest are substitutions of charged amino acids with other charged or neutral amino acids which 
may produce proteins with highly desirable improved characteristics, such as less aggregation. 
Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical 
formulations, because aggregates can be immunogenic, (see, e.g., Pinckard et aL, 1967; Robbins, et 
aL, 1987; and Cleland, et aL, 1993). 

20 A further embodiment of the invention relates to a polypeptide which comprises the amino 

acid sequence of a GENSET polypeptide having an amino acid sequence which contains at least one 
conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, 
not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid 
substitutions, and not more than 20 conservative amino acid substitutions. Also provided are 

25 polypeptides which comprise the amino acid sequence of a GENSET polypeptide, having at least 
one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions. 

Polypeptide fragments 

Structural definition 

The present invention is further directed to fragments of the amino acid sequences 
30 described herein such as the polypeptides of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 or those 
encoded by the clone inserts of the deposited clone pool. More specifically, the present invention 
embodies purified, isolated, and recombinant polypeptides comprising at least 6, preferably at least 
8 to 10, more preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 
275, or 300 consecutive amino acids of a polypeptide selected from the group consisting of the 
35 sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, the polypeptides encoded by the clone 
inserts of the deposited clone pool, and other polypeptides of the present invention. 
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In addition to the above polypeptide fragments, further preferred sub-genuses of 
polypeptides comprise at least 6 amino acids, wherein "at least 6" is defined as any integer between 
6 and the integer representing the C-terminal amino acid of the polypeptide of the present invention 
including the polypeptide sequences of the sequence listing below. Further included are species of 
5 polypeptide fragments at least 6 amino acids in length, as described above, that are further specified 
in terms of their N-terminal and C-terminal positions. However, included in the present invention 
as individual species are all polypeptide fragments, at least 6 amino acids in length, as described 
above, and may be particularly specified by a N-terminal and C-terminal position. That is, every 
combination of a N-terminal and C-terminal position that a fragment at least 6 contiguous amino 

10 acid residues in length could occupy, on any given amino acid sequence of the sequence listing or 
of the present invention is included in the present invention 

The present invention also provides for the exclusion of any fragment species specified by 
N-terminal and C-terminal positions or of any fragment sub-genus specified by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and C-terminal 

1 5 positions or by size in amino acid residues as described above may be excluded as individual 
species. 

The above polypeptide fragments of the present invention can be immediately envisaged 
using the above description and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. Moreover, the above fragments need not have a 

20 GENSET biological activity, although polypeptides having these activities are preferred 

embodiments of the invention, since they would be useful, for example, in immunoassays, in 
epitope mapping, epitope tagging, as vaccines, and as molecular weight markers. The above 
fragments may also be used to generate antibodies to a particular portion of the polypeptide. These 
antibodies can then be used in immunoassays well known in the art to distinguish between human 

25 and non-human cells and tissues or to determine whether cells or tissues in a biological sample are 
or are not of the same type which express the polypeptides of the present invention. 

It is noted that the above species of polypeptide fragments of the present invention may 
alternatively be described by the formula "a to b"; where "a" equals the N-terminal most amino acid 
position and "b" equals the C-terminal most amino acid position of the polynucleotide; and further 

30 where "a" equals an integer between 1 and the number of amino acids of the polypeptide sequence 
of the present invention minus 6, and where "b" equals an integer between 7 and the number of 
amino acids of the polypeptide sequence of the present invention; and where "a" is an integer 
smaller then "b M by at least 6. 

The present invention also provides for the exclusion of any species of polypeptide 

35 fragments of the present invention specified by 5' and 3' positions or sub-genuses of polypeptides 
specified by size in amino acids as described above. Any number of fragments specified by 5 ? and 
3' positions or by size in amino acids, as described above, may be excluded. 
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Domains 

Preferred polynucleotide fragments of the invention are domains of polypeptides of the 
invention. Such domains may eventually comprise linear or structural motifs and signatures 
5 including, but not limited to, leucine zippers, helix-turn-helix motifs, post-translational modification 
sites such as glycosylation sites, ubiquitination sites, alpha helices, and beta sheets, signal sequences 
encoding signal peptides which direct the secretion of the encoded proteins, sequences implicated in 
transcription regulation such as homeoboxes, acidic stretches, enzymatic active sites, substrate 
binding sites, and enzymatic cleavage sites. Such domains may present a particular biological 

10 activity such as DNA or RNA-binding, secretion of proteins, transcription regulation, enzymatic 
activity, substrate binding activity, etc. 

A domain has a size generally comprised between 3 and 1 000 amino acids. In a preferred 
embodiment, domains comprise a number of amino acids that is any integer between 6 and 200. 
Domains may be synthesized using any methods known to those skilled in the art, including those 

1 5 disclosed herein, particularly in the section entitled "Preparation of the polypeptides of the 

invention". Methods for determining the amino acids which make up a domain with a particular 
biological activity include mutagenesis studies and assays to determine the biological activity to be 
tested. 

Alternatively, the polypeptides of the invention may be scanned for motifs, domains and/or 

20 signatures in databases using any computer method known to those skilled in the art. Searchable 
databases include Prosite (Hofmann et ah, 1999; Bucher and Bairoch 1994), Pfam (Sonnhammer et 
ah, 1997; Henikoff et ah, 2000; Bateman et ah, 2000), Blocks (Henikoff et ah, 2000), Print 
(Attwood et ah, 1996), Prodom (Sonnhammer and Kahn, 1994; Corpet et ah 2000), Sbase (Pongor 
et ah, 1993; Murvai et ah, 2000), Smart (Schultz et ah, 1998), Dali/FSSP (Holm and Sander, 1996, 

25 1997 and 1999), HSSP (Sander and Schneider 1991), CATH (Orengo et ah, 1997; Pearl et ah, 
2000), SCOP (Murzin et ah, 1995; Lo Conte et ah, 2000), COG (Tatusov et ah, 1997 and 2000), 
specific family databases and derivatives thereof (Nevill-Manning et ah, 1998; Yona et ah, 1 999; 
Attwood et ah, 2000), each of which disclosures are hereby incorporated by reference in their 
entireties. For a review on available databases, see issue 1 of volume 28 of Nucleic Acid Research 

30 (2000), which disclosure is hereby incorporated by reference in its entirety. 

The domains of the present invention preferably comprises 6 to 200 amino acids (i.e. any 
integer between 6 and 200, inclusive) of a polypeptide of the present invention. Also, included in 
the present invention are domain fragments between the integers of 6 and the full length GENSET 
sequence of the sequence listing. All combinations of sequences between the integers of 6 and the 

35 full-length sequence of a GENSET polypeptide are included. The domain fragments may be 

specified by either the number of contiguous amino acid residues (as a sub-genus) or by specific N- 
terminal and C-terminal positions (as species) as described above for the polypeptide fragments of 
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the present invention. Any number of domain fragments of the present invention may also be 
excluded in the same manner. 



Epitopes and Antibody Fusions: 

A preferred embodiment of the present invention is directed to epitope-bearing polypeptides 
5 and epitope-bearing polypeptide fragments. These epitopes may be "antigenic epitopes" or both an 
"antigenic epitope" and an "immunogenic epitope". An "immunogenic epitope" is defined as a part 
of a protein that elicits an antibody response in vivo when the polypeptide is the immunogen. On 
the other hand, a region of polypeptide to which an antibody binds is defined as an "antigenic 
determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is 

10 less than the number of antigenic epitopes (See, e.g., Geysen, et al., 1984), which disclosure is 
hereby incorporated by reference in its entirety. It is particularly noted that although a particular 
epitope may not be immunogenic, it is nonetheless useful since antibodies can be made to both 
immunogenic and antigenic epitopes. 

An epitope can comprise as few as 3 amino acids in a spatial conformation, which is unique 

1 5 to the epitope. Generally an epitope consists of at least 6 such amino acids, and more often at least 
8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a number of amino 
acids that is any integer between 3 and 50. Fragments which function as epitopes may be produced 
by any conventional means (See, e.g., Houghten, 1985), also further described in U.S. Patent No. 
4,63 1,21, which disclosures are hereby incorporated by reference in their entireties. Methods for 

20 determining the amino acids which make up an epitope include x-ray crystallography, 2- 

dimensional nuclear magnetic resonance, and epitope mapping, e.g., the Pepscan method described 
by Geysen et al. (1984); PCT Publication No. WO 84/03564; and PCT Publication No. WO 
84/03506, which disclosures are hereby incorporated by reference in their entireties. Another 
example is the algorithm of Jameson and Wolf, (1988) (said reference incorporated by reference in 

25 its entirety). The Jameson- Wolf antigenic analysis, for example, may be performed using the 

computer program PROTEAN, using default parameters (Version 4.0 Windows, DNASTAR, Inc., 
1228 South Park Street Madison, WI. 

The epitope-bearing fragments of the present invention preferably comprise 6 to 50 amino 
acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present invention. Also, 

30 included in the present invention are antigenic fragments between the integers of 6 and the full 
length GENSET sequence of the sequence listing. All combinations of sequences between the 
integers of 6 and the full-length sequence of a GENSET polypeptide are included. The epitope- 
bearing fragments may be specified by either the number of contiguous amino acid residues (as a 
sub-genus) or by specific N-terminal and C-terminal positions (as species) as described above for 

35 the polypeptide fragments of the present invention. Any number of epitope-bearing fragments of 
the present invention may also be excluded in the same manner. 

Antigenic epitopes are useful, for example, to raise antibodies, including monoclonal 
antibodies that specifically bind the epitope (see, Wilson et al., 1984; and Sutcliffe, et al., 1983, 



57 



WO 02/083898 PCT/IBO 1/009 14 

which disclosures are hereby incorporated by reference in their entireties). The antibodies are then 
used in various techniques such as diagnostic and tissue/cell identification techniques, as described 
herein, and in purification methods such as immunoaffinity chromatography. 

Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
5 well known in the art (See, Sutcliffe et al., supra; Wilson et al., supra; Chow et a/.(1985); and 
Bittle, et al., (1985), which disclosures are hereby incorporated by reference in their entireties). A 
preferred immunogenic epitope includes the natural GENSET protein. The immunogenic epitopes 
may be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. However, 

10 immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be sufficient 
to raise antibodies capable of binding to, at the very least, linear epitopes in a denatured polypeptide 
(e.g., in Western blotting.). 

Epitope-bearing polypeptides of the present invention are used to induce antibodies 
according to methods well known in the art including, but not limited to, in vivo immunization, in 

15 vitro immunization, and phage display methods (See, e.g., Sutcliffe, et ah, supra; Wilson, etal., 
supra, and Bittle, et al, supra). If in vivo immunization is used, animals may be immunized with 
free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a 
macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine residues may be coupled to a carrier using a linker such as 

20 -maleimidobenzoyl- N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to 
carriers using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats 
and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal 
and/or intradermal injection of emulsions containing about 100 ngs of peptide or carrier protein and 
Freund's adjuvant. Several booster injections may be needed, for instance, at intervals of about two 

25 weeks, to provide a useful titer of anti-peptide antibody, which can be detected, for example, by 
ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in 
serum from an immunized animal may be increased by selection of anti-peptide antibodies, for 
instance, by adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

30 As one of skill in the art will appreciate, and discussed above, the polypeptides of the 

present invention comprising an immunogenic or antigenic epitope can be fused to heterologous 
polypeptide sequences. For example, the polypeptides of the present invention may be fused with 
the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, 
CH3, any combination thereof including both entire domains and portions thereof) resulting in 

35 chimeric polypeptides. These fusion proteins facilitate purification, and show an increased half-life 
in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains of the 
human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins (See, e.g., EPA 0,394,827; and Traunecker et al., 1988, which 
disclosures are hereby incorporated by reference in their entireties). Fusion proteins that have a 
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disulflde-linked dimeric structure due to the IgG portion can also be more efficient in binding and 
neutralizing other molecules than monomeric polypeptides or fragments thereof alone (See, e.g., 
Fountoulakis et aL, 1995, which disclosure is hereby incorporated by reference in its entirety). 
Nucleic acids encoding the above epitopes can also be recombined with a gene of interest as an 
5 epitope tag to aid in detection and purification of the expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the techniques of 
gene-shuffling, motif-shuffling, exon-shuffling, or codon-shuffling (collectively referred to as 
"DNA shuffling"). DNA shuffling may be employed to modulate the activities of polypeptides of 
the present invention thereby effectively generating agonists and antagonists of the polypeptides. 

10 See, for example, U.S. Patent Nos.: 5,605,793; 5,81 1,238; 5,834,252; 5,837,458; and Patten, et aL, 
(1997); Harayama, (1998); Hansson, et al (1999); and Lorenzo and Blasco, (1998). (Each of these 
documents are hereby incorporated by reference). In one embodiment, one or more components, 
motifs, sections, parts, domains, fragments, etc., of coding polynucleotides of the invention, or the 
polypeptides encoded thereby may be recombined with one or more components, motifs, sections, 

1 5 parts, domains, fragments, etc. of one or more heterologous molecules. 

The present invention further encompasses any combination of the polypeptide fragments 
listed in this section. 

Antibodies 

20 Definitions 

The present invention further relates to antibodies and T-cell antigen receptors (TCR), 
which specifically bind the polypeptides, and more specifically, the epitopes of the polypeptides of 
the present invention. The antibodies of the present invention include IgG (including IgGl, IgG2, 
IgG3, and IgG4), IgA (including IgAl and IgA2), IgD, IgE, or IgM, and IgY. The term "antibody" 

25 (Ab) refers to a polypeptide or group of polypeptides which are comprised of at least one binding 
domain, where a binding domain is formed from the folding of variable domains of an antibody 
molecule to form three-dimensional binding spaces with an internal surface shape and charge 
distribution complementary to the features of an antigenic determinant of an antigen, which allows 
an immunological reaction with the antigen. As used herein, the term "antibody" is meant to 

30 include whole antibodies, including single-chain whole antibodies, and antigen binding fragments 
thereof. In a preferred embodiment the antibodies are human antigen binding antibody fragments of 
the present invention include, but are not limited to, Fab, Fab' F(ab)2 and F(ab')2, Fd, single-chain 
Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either a 
V L or V H domain. The antibodies may be from any animal origin including birds and mammals. 

35 Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise the 
variable region(s) alone or in combination with the entire or partial of the following: hinge region, 
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CHI , CH2, and CH3 domains. Also included in the invention are any combinations of variable 
region(s) and hinge region, CHI, CH2, and CH3 domains. The present invention further includes 
chimeric, humanized, and human monoclonal and polyclonal antibodies, which specifically bind the 
polypeptides of the present invention. The present invention further includes antibodies that are 
5 anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, and trispecific or 
have greater multispecificity. Multispecific antibodies may be specific for different epitopes of a 
polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 

10 support material. See, e.g., WO 93/1 771 5; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, et al. 
(1991); US Patents 5,573,920, 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny et aL (1992), 
which disclosures are hereby incorporated by reference in their entireties. 

Antibodies of the present invention may be described or specified in terms of the epitope(s) 
or epitope-bearing portion(s) of a polypeptide of the present invention, which are recognized or 

1 5 specifically bound by the antibody. The antibodies may specifically bind a complete protein encoded 
by a nucleic acid of the present invention, or a fragment thereof. Therefore, the epitope(s) or epitope 
bearing polypeptide portion(s) may be specified as described herein, e.g., by N-terminal and C- 
terminal positions, by size in contiguous amino acid residues, or otherwise described herein 
(including the sequence listing). Antibodies which specifically bind any epitope or polypeptide of 

20 the present invention may also be excluded as individual species. Therefore, the present invention 
includes antibodies that specifically bind specified polypeptides of the present invention, and allows 
for the exclusion of the same. 

Thus, another embodiment of the present invention is a purified or isolated antibody 
capable of specifically binding to a polypeptide comprising a sequence selected from the group 

25 consisting of the sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and the sequences of the 
clone inserts of the deposited clone pool. In one aspect of this embodiment, the antibody is capable 
of binding to an epitope-containing polypeptide comprising at least 6 consecutive amino acids, 
preferably at least 8 to 10 consecutive amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 
50, or 100 consecutive amino acids of a sequence selected from the group consisting of SEQ ID 

30 NOs: 170-338, 456-560, 785-918 and sequences of the clone inserts of the deposited clone pool. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homologue 
of the polypeptides of the present invention are included. Antibodies that do not bind polypeptides 
with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, less than 70%, less 

35 than 65%, less than 60%, less than 55%, and less than 50% identity (as calculated using methods 
known in the art and described herein, e.g., using FASTDB and the parameters set forth herein) to a 
polypeptide of the present invention are also included in the present invention. Further included in 
the present invention are antibodies, which only bind polypeptides encoded by polynucleotides, 
which hybridize to a polynucleotide of the present invention under stringent hybridization 
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conditions (as described herein). Antibodies of the present invention may also be described or 
specified in terms of their binding affinity. Preferred binding affinities include those with a 
dissociation constant or Kd less than 5X1 0 6 M, 10" 6 M, 5X10" 7 M, 10" 7 M, 5X10" 8 M, 10" 8 M, 5X10" 9 M, 
10" 9 M, 5X10" 10 M, 10 10 M, 5X10 n M, 10 n M, 5X10 12 M, 10 12 M, 5X10' 13 M, 10 ,3 M, 5X10" ,4 M, 10' 
5 ,4 M, 5X10" 15 M, and 10- ,5 M. 

The invention also concerns a purified or isolated antibody capable of specifically binding 
to a mutated GENSET protein or to a fragment or variant thereof comprising an epitope of the 
mutated GENSET protein. 



10 Preparation of antibodies 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. Some of these methods are described in more detail in the example entitled "Preparation of 
Antibody Compositions to the GENSET protein". For example, a polypeptide of the present 
invention or an antigenic fragment thereof can be administered to an animal in order to induce the 

15 production of sera containing "polyclonal antibodies". As used herein, the term "monoclonal 

antibody" is not limited to antibodies produced through hybridoma technology but it rather refers to 
an antibody that is derived from a single clone, including eukaryotic, prokaryotic, or phage clone, 
and not the method by which it is produced. Monoclonal antibodies can be prepared using a wide 
variety of techniques known in the art including the use of hybridoma, recombinant, and phage 

20 display technology. 

Hybridoma techniques include those known in the art {See, e.g., Harlow et al. 1988; 
Hammerling, et al, 1981). (Said references incorporated by reference in their entireties). Fab and 
F(ab')2 fragments may be produced, for example, from hybridoma-produced antibodies by 
proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to 

25 produce F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the application 
of recombinant DNA technology or through synthetic chemistry using methods known in the art. 
For example, the antibodies of the present invention can be prepared using various phage display 
methods known in the art. In phage display methods, functional antibody domains are displayed on 

30 the surface of a phage particle, which carries polynucleotide sequences encoding them. Phage with 
a desired binding property are selected from a repertoire or combinatorial antibody library (e.g. 
human or murine) by selecting directly with antigen, typically antigen bound or captured to a solid 
surface or bead. Phage used in these methods are typically filamentous phage including fd and Ml 3 
with Fab, Fv or disulfide stabilized Fv antibody domains recombinantly fused to either the phage 

35 gene III or gene VIII protein. Examples of phage display methods that can be used to make the 
antibodies of the present invention include those disclosed in Brinkman et al. (1995); Ames, et al. 
(1995); Kettleborough, et al. (1994); Persic, et al. (1997); Burton et al (1994); PCT/GB91/01 134; 
WO 90/02809; WO 91/10737; WO 92/01047; WO 92/18619; WO 93/1 1236; WO 95/15982; WO 
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95/20401; and US Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 
5,821,047, 5,571,698, 5,427,908, 5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references 
incorporated by reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 
5 from the phage can be isolated and used to generate whole antibodies, including human antibodies, 
or any other desired antigen binding fragment, and expressed in any desired host including 
mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, techniques to 
recombinantly produce Fab, Fab ? F(ab)2 and F(ab')2 fragments can also be employed using 
methods known in the art such as those disclosed in WO 92/22324; Mullinax et al. (1992); and 
10 Sawai et al. (1995); and Better et al. (1988) (said references incorporated by reference in their 
entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al. (1991); Shu et al. 
(1993); and Skerra et ah (1988), which disclosures are hereby incorporated by reference in their 

15 entireties. For some uses, including in vivo use of antibodies in humans and in vitro detection 
assays, it may be preferable to use chimeric, humanized, or human antibodies. Methods for 
producing chimeric antibodies are known in the art. See e.g., Morrison, (1985); Oi et al., (1986); 
Gillies et al. (1989); and US Patent 5,807,715, which disclosures are hereby incorporated by 
reference in their entireties. Antibodies can be humanized using a variety of techniques including 

20 CDR-grafting (EP 0 239 400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or 
resurfacing, (EP 0 592 106; EP 0 519 596; Padlan, 1991 ; Studnicka et al, 1994; Roguska et aL, 
1994), and chain shuffling (US Patent 5,565,332), which disclosures are hereby incorporated by 
reference in their entireties. Human antibodies can be made by a variety of methods known in the 
art including phage display methods described above. See also, US Patents 4,444,887, 4,716,1 11, 

25 5,545,806, and 5,814,318; WO 98/46645; WO 98/50433; WO 98/24893; WO 96/34096; WO 
96/33735; and WO 91/10741 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or chemically 
conjugated (including both covalently and non-covalently conjugations) to a polypeptide of the 
present invention. The antibodies may be specific for antigens other than polypeptides of the 

30 present invention. For example, antibodies of the present invention may be recombinantly fused or 
conjugated to molecules useful as labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/12624; 
US Patent 5,314,995; and EP 0 396 387, which disclosures are hereby incorporated by reference in 
their entireties. Fused antibodies may also be used to target the polypeptides of the present 

35 invention to particular cell types, either in vitro or in vivo, by fusing or conjugating the polypeptides 
of the present invention to antibodies specific for particular cell surface receptors. Antibodies fused 
or conjugated to the polypeptides of the present invention may also be used in vitro immunoassays 
and purification methods using methods known in the art (See e.g., Harbor et al. supra; WO 
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93/21232; EP 0 439 095; Naramura, M. et al. 1994; US Patent 5,474,981 ; Gillies et al, 1992; Fell et 
al., 1991) (said references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions. For 
5 example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention may 
comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any combination of 
whole domains or portions thereof. The polypeptides of the present invention may be fused or 
conjugated to the above antibody portions to increase the in vivo half-life of the polypeptides or for 

10 use in immunoassays using methods known in the art. The polypeptides may also be fused or 

conjugated to the above antibody portions to form multimers. For example, Fc portions fused to the 
polypeptides of the present invention can form dimers through disulfide bonding between the Fc 
portions. Higher multimeric forms can be made by fusing the polypeptides to portions of IgA and 
IgM. Methods for fusing or conjugating the polypeptides of the present invention to antibody 

1 5 portions are known in the art. See e.g., US Patents 5,336,603, 5,622,929, 5,359,046, 5,349,053, 
5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 96/04388, WO 91/06570; Ashkenazi et al. 
(1991); Zheng et al. (1995); and Vil et al. (1992) (said references incorporated by reference in their 
entireties). 

Non-human animals or mammals, whether wild-type or transgenic, which express a 
20 different species of GENSET than the one to which antibody binding is desired, and animals which 
do not express GENSET (i.e. a GENSET knock out animal as described herein) are particularly 
useful for preparing antibodies. GENSET knock out animals will recognize all or most of the 
exposed regions of a GENSET protein as foreign antigens, and therefore produce antibodies with a 
wider array of GENSET epitopes. Moreover, smaller polypeptides with only 10 to 30 amino acids 
25 may be useful in obtaining specific binding to any one of the GENSET proteins. In addition, the 
humoral immune system of animals which produce a species of GENSET that resembles the 
antigenic sequence will preferentially recognize the differences between the animal's native 
GENSET species and the antigen sequence, and produce antibodies to these unique sites in the 
antigen sequence. Such a technique will be particularly useful in obtaining antibodies that 
30 specifically bind to any one of the GENSET proteins. 

The antibodies of the invention may be labeled by, e.g., any one of the radioactive, 
fluorescent or enzymatic labels known in the art. 

Uses ofpol ynucleotides 

35 Uses of polynucleotides as reagents 

The polynucleotides of the present invention, particularly those described in the 
"Oligonucleotide primers and probes" section, may be used as reagents in isolation procedures, 
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diagnostic assays, and forensic procedures. For example, sequences from the GENSET 
polynucleotides of the invention may be detectably labeled and used as probes to isolate other 
sequences capable of hybridizing to them. In addition, sequences from the GENSET 
polynucleotides of the invention may be used to design PCR primers to be used in isolation, 
5 diagnostic, or forensic procedures. 

In forensic analyses 

PCR primers may be used in forensic analyses, such as the DNA fingerprinting techniques 
described below. Such analyses may utilize detectable probes or primers based on the sequences of 
the polynucleotides of the invention. Consequently, the present invention encompasses methods of 
10 identification of an individual using the polynucleotides of the invention in forensic analyses, 
wherein said method includes the steps of: 

a) obtaining a biological sample containing nucleic acid material from an individual; 

b) obtaining an identification pattern for this individual using the polynucleotides of the 
invention, particularly using GENSET primers and probes; 

15 c) comparing said identification pattern with a reference identification pattern; and 

d) determining whether said identification pattern is identical to said reference identification 

pattern. 

In one embodiment of this method, the identification pattern consists in sequences of 
amplicons obtained using GENSET primers as explained in the sections entitled "Forensic 
20 Matching by DNA Sequencing" and "Positive Identification by DNA Sequencing". 

In another embodiment, the identification pattern consists in unique band or dot patterns 
obtained using any method described in the sections entitled "Southern Blot Forensic 
Identification", "Dot Blot Identification Procedure" and "Alternative "Fingerprint" Identification 
Technique". 

25 Table I provides sets of related cDNAs of the invention, e.g. sequences that represent allelic 

variants of a single sequence. Such variants are especially useful for the herein-described forensic 
analyses, and are also useful as polymorphic markers to examine, e.g. associations between the 
herein-discussed GENSET genes and various diseases or conditions. 

Forensic Matching by DNA Sequencing 

30 In one exemplary method, DNA samples are isolated from forensic specimens of, for 

example, hair, semen, blood or skin cells by conventional methods. A panel of PCR primers 
designed from different polynucleotides of the invention using any technique known to those skilled 
in the art including those described herein, is then utilized to amplify DNA of approximately 100- 
200 bases in length from the forensic specimen. Corresponding sequences are obtained from a test 

35 subject. Each of these identification DNAs is then sequenced using standard techniques, and a 
simple database comparison determines the differences, if any, between the sequences from the 
subject and those from the sample. Statistically significant differences between the suspect's DNA 
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sequences and those from the sample conclusively prove a lack of identity. This lack of identity can 
be proven, for example, with only one sequence. Identity, on the other hand, should be 
demonstrated with a large number of sequences, all matching. Preferably, a minimum of 50 
statistically identical sequences of 100 bases in length are used to prove identity between the 
5 suspect and the sample. 

Positive Identification by DNA Sequencing 

The "Forensic Matching by DNA Sequencing" technique described herein may also be used 
on a larger scale to provide a unique fingerprint-type identification of any individual. In this 
technique, primers are prepared from a large number of polynucleotides of the invention. 

10 Preferably, 20 to 50 different primers are used. These primers are used to obtain a corresponding 
number of PCR-generated DNA segments from the individual in question. Each of these DNA 
segments is sequenced. The database of sequences generated through this procedure uniquely 
identifies the individual from whom the sequences were obtained. The same panel of primers may 
then be used at any later time to absolutely correlate tissue or other biological specimen with that 

15 individual. 

Southern Blot Forensic Identification 

The "Positive Identification by DNA Sequencing" procedure described herein is repeated to 
obtain a panel of at least 10 amplified sequences from an individual and a specimen. Preferably, the 
panel contains at least 50 amplified sequences. More preferably, the panel contains 100 amplified 

20 sequences. In some embodiments, the panel contains 200 amplified sequences. This PCR- 
generated DNA is then digested with one or a combination of, preferably, four base specific 
restriction enzymes. Such enzymes are commercially available and known to those of skill in the 
art. After digestion, the resultant gene fragments are size separated in multiple duplicate wells on 
an agarose gel and transferred to nitrocellulose using Southern blotting techniques well known to 

25 those with skill in the art. For a review of Southern blotting see Davis et al. (1 986), which 
disclosure is hereby incorporated by reference in its entirety. 

A panel of probes based on the sequences of the polynucleotides of the invention, or 
fragments thereof of at least 10 bases, are radioactively or colorimetrically labeled using methods 
known in the art, such as nick translation or end labeling, and hybridized to the Southern blot using 

30 techniques known in the art. Preferably, the probe comprises at least 12, 1 5, or 17 consecutive 

nucleotides from the polynucleotide of the invention. More preferably, the probe comprises at least 
20-30 consecutive nucleotides from the polynucleotide of the invention. In some embodiments, the 
probe comprises more than 30 nucleotides from the polynucleotide of the invention. In other 
embodiments, the probe comprises at least 40, at least 50, at least 75, at least 100, at least 1 50, or at 

35 least 200 consecutive nucleotides from the polynucleotide of the invention. 

Preferably, at least 5 to 10 of these labeled probes are used, and more preferably at least 
about 20 or 30 are used to provide a unique pattern. The resultant bands appearing from the 
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hybridization of a large sample of polynucleotide of the invention will be a unique identifier. Since 
the restriction enzyme cleavage will be different for every individual, the band pattern on the 
Southern blot will also be unique. Increasing the number of cDNA probes will provide a 
statistically higher level of confidence in the identification since there will be an increased number 
5 of sets of bands used for identification. 

Dot Blot Identification Procedure 

Another technique for identifying individuals using the polynucleotide sequences disclosed 
herein utilizes a dot blot hybridization technique. 

Genomic DNA is isolated from nuclei of subject to be identified. Oligonucleotide probes of 

10 approximately 30 bp in length are synthesized that correspond to at least 10, preferably 50 
sequences from the polynucleotide of the invention. The probes are used to hybridize to the 
genomic DNA through conditions known to those in the art. The oligonucleotides are end labeled 
with P 32 using polynucleotide kinase (Pharmacia). Dot Blots are created by spotting the genomic 
DNA onto nitrocellulose or the like using a vacuum dot blot manifold (BioRad, Richmond 

1 5 California). The nitrocellulose filter containing the genomic sequences is baked or U V linked to the 
filter, prehybridized and hybridized with labeled probe using techniques known in the art (Davis et 
ah 1986). The 32 P labeled DNA fragments are sequentially hybridized with successively stringent 
conditions to detect minimal differences between the 30 bp sequence and the DNA. 
Tetramethylammonium chloride is useful for identifying clones containing small numbers of 

20 nucleotide mismatches (Wood et aL, 1985). A unique pattern of dots distinguishes one individual 
from another individual. 

Alternative "Fingerprint" Identification Technique 

In a representative alternative fingerprinting procedure, the probes are derived from 
cDNAs. Preferably, a plurality of probes having sequences from different genes are used as 

25 follows. Polynucleotides containing at least 10 consecutive bases from these sequences can be used 
as probes. Preferably, the probe comprises at least 12, 15, or 17 consecutive nucleotides from the 
polynucleotide of the invention. More preferably, the probe comprises at least 20-30 consecutive 
nucleotides from the polynucleotide of the invention. In some embodiments, the probe comprises 
more than 30 nucleotides from the polynucleotide of the invention. In other embodiments, the 

30 probe comprises at least 40, at least 50, at least 75, at least 100, at least 1 50, or at least 200 
consecutive nucleotides from the polynucleotide of the invention. 

Oligonucleotides, generally 20-mers, are prepared from a large number, e.g. 50, 100, or 
200, of polynucleotides of the invention using commercially available oligonucleotide services such 
as Genset (Paris, France). Cell samples from the test subject are processed for DNA using 

35 techniques well known to those with skill in the art. The nucleic acid is digested with restriction 
enzymes such as EcoRI and Xbal. Following digestion, samples are applied to wells for 
electrophoresis. The procedure, as known in the art, may be modified to accommodate 
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polyacrylamide electrophoresis, however in this example, samples containing 5 ug of DNA are 
loaded into wells and separated on 0.8% agarose gels. The gels are transferred onto nitrocellulose 
using standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled with P 32 . The 
5 nitrocellulose is prehybridized with blocking solution and hybridized with the labeled probes. 

Following hybridization and washing, the nitrocellulose filter is exposed to X-Omat AR X-ray film. 
The resulting hybridization pattern will be unique for each individual. 

It is additionally contemplated within this example that the number of probe sequences used 
can be varied for additional accuracy or clarity. 

10 To find corresponding genomic DNA sequences 

The GENSET cDNAs of the invention may also be used to clone sequences located 
upstream of the cDNAs of the invention on the corresponding genomic DNA. Such upstream 
sequences may be capable of regulating gene expression, including promoter sequences, enhancer 
sequences, and other upstream sequences which influence transcription or translation levels. Once 
1 5 identified and cloned, these upstream regulatory sequences may be used in expression vectors 
designed to direct the expression of an inserted gene in a desired spatial, temporal, developmental, 
or quantitative fashion. 

Use of cDNAs or Fragments thereof to Clone Upstream Sequences from Genomic DNA 

Sequences derived from polynucleotides of the inventions may be used to isolate the 

20 promoters of the corresponding genes using chromosome walking techniques. In one chromosome 
walking technique, which utilizes the Genome Walker™ kit available from Clontech, five complete 
genomic DNA samples are each digested with a different restriction enzyme which has a 6 base 
recognition site and leaves a blunt end. Following digestion, oligonucleotide adapters are ligated to 
each end of the resulting genomic DNA fragments. 

25 For each of the five genomic DNA libraries, a first PCR reaction is performed according to 

the manufacturer's instructions (which are incorporated herein by reference) using an outer adaptor 
primer provided in the kit and an outer gene specific primer. The gene specific primer should be 
selected to be specific for the polynucleotide of the invention of interest and should have a melting 
temperature, length, and location in the polynucleotide of the invention which is consistent with its 

30 use in PCR reactions. Each first PCR reaction contains 5ng of genomic DNA, 5 \i\ of 1 OX Tth 

reaction buffer, 0.2 mM of each dNTP, 0.2 uM each of outer adaptor primer and outer gene specific 
primer, 1.1 mM of Mg(OAc) 2 , and 1 uj of the Tth polymerase SOX mix in a total volume of 50 ul 
The reaction cycle for the first PCR reaction is as follows: 1 min at 94 degrees Celsius / 2 sec at 94 
degree Celsius, 3 min at 72 degrees Celsius (7 cycles) / 2 sec at 94 degrees Celsius, 3 min at 67 

35 degrees Celsius (32 cycles) / 5 min at 67 degrees Celsius. 

The product of the first PCR reaction is diluted and used as a template for a second PCR 
reaction according to the manufacturer's instructions using a pair of nested primers which are 
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located internally on the amplicon resulting from the first PCR reaction. For example, 5 \il of the 
reaction product of the first PCR reaction mixture may be diluted 180 times. Reactions are made in 
a 50 fil volume having a composition identical to that of the first PCR reaction except the nested 
primers are used. The first nested primer is specific for the adaptor, and is provided with the 
5 GenomeWalker™ kit. The second nested primer is specific for the particular polynucleotide of the 
invention for which the promoter is to be cloned and should have a melting temperature, length, and 
location in the polynucleotide of the invention which is consistent with its use in PCR reactions. 
The reaction parameters of the second PCR reaction are as follows: 1 min at 94 degrees Celsius / 2 
sec at 94 degrees Celsius, 3 min at 72 degrees Celsius (6 cycles) / 2 sec at 94 degrees Celsius, 3 min 

10 at 67 degrees Celsius (25 cycles) / 5 min at 67 degrees Celsius 

The product of the second PCR reaction is purified, cloned, and sequenced using standard 
techniques. Alternatively, two or more human genomic DNA libraries can be constructed by using 
two or more restriction enzymes. The digested genomic DNA is cloned into vectors which can be 
converted into single stranded, circular, or linear DNA. A biotinylated oligonucleotide comprising 

15 at least 15 nucleotides from the polynucleotide of the invention sequence is hybridized to the single 
stranded DNA. Hybrids between the biotinylated oligonucleotide and the single stranded DNA 
containing the polynucleotide of the invention sequence are isolated as described herein. 
Thereafter, the single stranded DNA containing the polynucleotide of the invention sequence is 
released from the beads and converted into double stranded DNA using a primer specific for the 

20 polynucleotide of the invention sequence or a primer corresponding to a sequence included in the 
cloning vector. The resulting double stranded DNA is transformed into bacteria. DNAs containing 
the GENSET polynucleotide sequences are identified by colony PCR or colony hybridization. 

Identification of Promoters in Cloned Upstream Sequences 

Once the upstream genomic sequences have been cloned and sequenced as described above, 
25 prospective promoters and transcription start sites within the upstream sequences may be identified 
by comparing the sequences upstream of the polynucleotides of the inventions with databases 
containing known transcription start sites, transcription factor binding sites, or promoter sequences. 

In addition, promoters in the upstream sequences may be identified using promoter reporter 
vectors as follows. The expression of the reporter gene will be detected when placed under the 
30 control of regulatory active polynucleotide fragments or variants of the GENSET promoter region 
located upstream of the first exon of the GENSET gene. Suitable promoter reporter vectors, into 
which the GENSET promoter sequences may be cloned include pSEAP-Basic, pSEAP-Enhancer, 
p(3gal-Basic, ppgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from Clontech, or 
pGL2-basic or pGL3 -basic promoterless luciferase reporter gene vector from Promega. Briefly, 
35 each of these promoter reporter vectors include multiple cloning sites positioned upstream of a 

reporter gene encoding a readily assayable protein such as secreted alkaline phosphatase, luciferase, 
beta-galactosidase, or green fluorescent protein. The sequences upstream the GENSET coding 
region are inserted into the cloning sites upstream of the reporter gene in both orientations and 
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introduced into an appropriate host cell. The level of reporter protein is assayed and compared to 
the level obtained from a vector which lacks an insert in the cloning site. The presence of an 
elevated expression level in the vector containing the insert with respect to the control vector 
indicates the presence of a promoter in the insert. If necessary, the upstream sequences can be 
5 cloned into vectors which contain an enhancer for increasing transcription levels from weak 

promoter sequences. A significant level of expression above that observed with the vector lacking 
an insert indicates that a promoter sequence is present in the inserted upstream sequence. 

Promoter sequence within the upstream genomic DNA may be further defined by site 
directed mutagenesis, linker scanning analysis, or other techniques familiar to those skilled in the 

10 art. For example, the boundaries of promoters may be further investigated by constructing nested 5 5 
and/or 3' deletions in the upstream DNA using conventional techniques such as Exonuclease III or 
appropriate restriction endonuclease digestion. The resulting deletion fragments can be inserted 
into the promoter reporter vector to determine whether the deletion has increased, reduced or 
illuminated promoter activity, such as described, for example, by Coles et al. (1998), the disclosure 

1 5 of which is incorporated herein by reference in its entirety. In this way, the boundaries of the 

promoters may be defined. If desired, potential individual regulatory sites within the promoter may 
be identified using site directed mutagenesis or linker scanning to obliterate potential transcription 
factor binding sites within the promoter individually or in combination. The effects of these 
mutations on transcription levels may be determined by inserting the mutations into cloning sites in 

20 promoter reporter vectors. This type of assay is well known to those skilled in the art and is 

described in WO 97/17359, US Patent No. 5,374,544; EP 582 796; US Patent No. 5,698,389; US 
5,643,746; US Patent No. 5,502,176; and US Patent 5,266,488; the disclosures of which are 
incorporated by reference herein in their entirety. 

The strength and the specificity of the promoter of each GENSET gene can be assessed 

25 through the expression levels of a detectable polynucleotide operably linked to the GENSET 
promoter in different types of cells and tissues. The detectable polynucleotide may be either a 
polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 
polynucleotide encoding a detectable protein, including a GENSET polypeptide or a fragment or a 
variant thereof. This type of assay is well known to those skilled in the art and is described in US 

30 Patent No. 5,502, 1 76; and US Patent No. 5,266,488; the disclosures of which are incorporated by 
reference herein in their entirety. Some of the methods are discussed in more detail elsewhere in 
the application. 

The promoters and other regulatory sequences located upstream of the polynucleotides of 
the inventions may be used to design expression vectors capable of directing the expression of an 
35 inserted gene in a desired spatial, temporal, developmental, or quantitative manner. A promoter 
capable of directing the desired spatial, temporal, developmental, and quantitative patterns may be 
selected using the results of the expression analysis described herein. For example, if a promoter 
which confers a high level of expression in muscle is desired, the promoter sequence upstream of a 
polynucleotide of the invention derived from an mRNA which is expressed at a high level in muscle 
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may be used in the expression vector. Such vectors are described in more detail elsewhere in the 
application. 

Preferably, the desired promoter is placed near multiple restriction sites to facilitate the 
cloning of the desired insert downstream of the promoter, such that the promoter is able to drive 
5 expression of the inserted gene. The promoter may be inserted in conventional nucleic acid 
backbones designed for extrachromosomal replication, integration into the host chromosomes or 
transient expression. Suitable backbones for the present expression vectors include retroviral 
backbones, backbones from eukaryotic episomes such as SV40 or Bovine Papilloma Virus, 
backbones from bacterial episomes, or artificial chromosomes. 
10 Preferably, the expression vectors also include a polyA signal downstream of the multiple 

restriction sites for directing the polyadenylation of mRNA transcribed from the gene inserted into 
the expression vector. 

To find similar sequences 

Polynucleotides of the invention may be used to isolate and/or purify nucleic acids similar 
1 5 thereto using any methods well known to those skilled in the art including the techniques based on 
hybridization or on amplification described in this section. These methods may be used to obtain the 
genomic DNAs which encode the mRNAs from which the GENSET cDNAs are derived, mRNAs 
corresponding to GENSET cDNAs, or nucleic acids which are homologous to GENSET cDNAs or 
fragments thereof, such as variants, species homologues or orthologs. Thus, a plurality of cDNAs 
20 similar to GENSET polynucleotides may be provided as cDNA libraries for subsequent evaluation 
of the encoded proteins or used in diagnostic assays as described herein. cDNAs prepared by any 
method described therein may be subsequently engineered to obtain nucleic acids which include 
desired fragments of the cDNA using conventional techniques such as subcloning, PCR, or in vitro 
oligonucleotide synthesis. For example, nucleic acids which include only the coding sequences 
25 may be obtained using techniques known to those skilled in the art. Similarly, nucleic acids 
containing any other desired fragment of the coding sequences for the encoded protein may be 
obtained. 

Indeed, cDNAs of the present invention or fragments thereof may be used to isolate nucleic 
acids similar to cDNAs from a cDNA library or a genomic DNA library. Such cDNA libraries or 
30 genomic DNA libraries may be obtained from a commercial source or made using techniques 

familiar to those skilled in the art such as those described in PCT publication WO 00/37491, which 
disclosure is hereby incorporated by reference in its entirety. Examples of methods for obtaining 
nucleic acids similar to GENSET polynucleotides are described below. 

Hybridization-based methods 

35 Techniques for identifying cDNA clones in a cDNA library which hybridize to a given 

probe sequence are disclosed in Sambrook et aL, (1989) and in Hames and Higgins (1985), the 
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disclosures of which are incorporated herein by reference in their entireties. The same techniques 

may be used to isolate genomic DNAs. 

Briefly, cDNA or genomic DNA clones which hybridize to the detectable probe are 

identified and isolated for further manipulation as follows. Any polynucleotide fragment of the 
5 invention may be used as a probe, in particular those defined in the "Oligonucleotide primers and 

probes" section. A probe comprising at least 10 consecutive nucleotides from a GENSET cDNA or 

fragment thereof is labeled with a detectable label such as a radioisotope or a fluorescent molecule. 

Preferably, the probe comprises at least 12, 15, or 17 consecutive nucleotides from the cDNA or 

fragment thereof. More preferably, the probe comprises 20 to 30 consecutive nucleotides from the 
10 cDNA or fragment thereof. In some embodiments, the probe comprises more than 30 nucleotides 

from the cDN A or fragment thereof. 

Techniques for labeling the probe are well known and include phosphorylation with 

polynucleotide kinase, nick translation, in vitro transcription, and non radioactive techniques. The 

cDNAs or genomic DNAs in the library are transferred to a nitrocellulose or nylon filter and 
1 5 denatured. After blocking of non specific sites, the filter is incubated with the labeled probe for an 

amount of time sufficient to allow binding of the probe to cDNAs or genomic DNAs containing a 

sequence capable of hybridizing thereto. 

By varying the stringency of the hybridization conditions used to identify cDNAs or 

genomic DNAs which hybridize to the detectable probe, cDNAs or genomic DNAs having different 
20 levels of identity to the probe can be identified and isolated as described below. 

Stringent conditions 

"Stringent hybridization conditions" are defined as conditions in which only nucleic acids 
having a high level of identity to the probe are able to hybridize to said probe. These conditions may 
25 be calculated as follows: 

For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: Tm=81 .5+16.6(log (Na+))+0.41 (fraction G+C)-(600/N) where N is 
the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
30 temperature may be calculated using the equation: Tm=81 .5+16.6(log (Na+))+0.41 (fraction G+C)- 
(0.63% formamide)-(600/N) where N is the length of the probe. 

Pre hybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 fig 
denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 u.g 
denatured fragmented salmon sperm DNA, 50% formamide. The formulas for SSC and Denhardt's 
35 solutions are listed in Sambrook et al, 9 1986. 

Hybridization is conducted by adding the detectable probe to the prehybridization solutions 
listed above. Where the probe comprises double stranded DNA, it is denatured before addition to 
the hybridization solution. The filter is contacted with the hybridization solution for a sufficient 
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period of time to allow the probe to hybridize to nucleic acids containing sequences complementary 
thereto or homologous thereto. For probes over 200 nucleotides in length, the hybridization may be 
carried out at 1 5-25°C below the Tm. For shorter probes, such as oligonucleotide probes, the 
hybridization may be conducted at 15-25°C below the Tm. Preferably, for hybridizations in 6X 
5 SSC, the hybridization is conducted at approximately 68°C. Preferably, for hybridizations in 50% 
formamide containing solutions, the hybridization is conducted at approximately 42°C. 

Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room temperature for 
15 minutes. The filter is then washed with 0.1X SSC, 0.5% SDS at room temperature for 30 
minutes to 1 hour. Thereafter, the solution is washed at the hybridization temperature in 0.1 X SSC, 
10 0.5% SDS. A final wash is conducted in 0.1X SSC at room temperature. 

Nucleic acids which have hybridized to the probe are identified by autoradiography or other 
conventional techniques. 

Low and moderate conditions 

Changes in the stringency of hybridization and signal detection are primarily accomplished 

15 through the manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. The above procedure may thus be modified to 
identify nucleic acids having decreasing levels of identity to the probe sequence. For example, the 
hybridization temperature may be decreased in increments of 5°C from 68°C to 42°C in a 
hybridization buffer having a sodium concentration of approximately 1M. Following hybridization, 

20 the filter may be washed with 2X SSC, 0.5% SDS at the temperature of hybridization. These 
conditions are considered to be "moderate" conditions above 50°C and "low" conditions below 
50°C. Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, containing 
formamide at a temperature of 42°C. In this case, the concentration of formamide in the 
hybridization buffer may be reduced in 5% increments from 50% to 0% to identify clones having 

25 decreasing levels of identity to the probe. Following hybridization, the filter may be washed with 
6X SSC, 0.5% SDS at 50°C. These conditions are considered to be "moderate" conditions above 
25% formamide and "low" conditions below 25% formamide. cDNAs or genomic DNAs which 
have hybridized to the probe are identified by autoradiography or other conventional techniques. 
Note that variations in the above conditions may be accomplished through the inclusion 

30 and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured 
salmon sperm DNA, and commercially available proprietary formulations. The inclusion of specific 
blocking reagents may require modification of the hybridization conditions described above, due to 
problems with compatibility. 

35 Consequently, the present invention encompasses methods of isolating nucleic acids similar 

to the polynucleotides of the invention, comprising the steps of: 

a) contacting a collection of cDNA or genomic DNA molecules with a detectable probe 
comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40 or 50 consecutive nucleotides of a sequence 
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selected from the group consisting of the sequences of SEQ ID NOs: 1-169, 339-455, 561-784, the 
sequences of clones inserts of the deposited clone pool and sequences complementary thereto under 
stringent, moderate or low conditions which permit said probe to hybridize to at least a cDNA or 
genomic DNA molecule in said collection; 
5 b) identifying said cDNA or genomic DNA molecule which hybridizes to said detectable 

probe; and 

c) isolating said cDNA or genomic DNA molecule which hybridized to said probe. 
PCR-based methods 

In addition to the above described methods, other protocols are available to obtain 
10 homologous cDNAs using GENSET cDNA of the present invention or fragment thereof as outlined 
in the following paragraphs. 

cDNAs may be prepared by obtaining mRNA from the tissue, cell, or organism of interest 
using mRNA preparation procedures utilizing polyA selection procedures or other techniques 
known to those skilled in the art. A first primer capable of hybridizing to the polyA tail of the 
1 5 mRNA is hybridized to the mRNA and a reverse transcription reaction is performed to generate a 
first cDNA strand. 

The term "capable of hybridizing to the polyA tail of said mRNA" refers to and embraces 
all primers containing stretches of thymidine residues, so-called oligo(dT) primers, that hybridize to 
the 3' end of eukaryotic poly(A)+ mRNAs to prime the synthesis of a first cDNA strand. 

20 Techniques for generating said oligo (dT) primers and hybridizing them to mRNA to subsequently 
prime the reverse transcription of said hybridized mRNA to generate a first cDNA strand are well 
known to those skilled in the art and are described in Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc. 1997 and Sambrook et a!., 1989. Preferably, said oligo (dT) primers are 
present in a large excess in order to allow the hybridization of all mRNA 3'ends to at least one oligo 

25 (dT) molecule. The priming and reverse transcription steps are preferably performed between 37°C 
and 55°C depending on the type of reverse transcriptase used. Preferred oligo(dT) primers for 
priming reverse transcription of mRNAs are oligonucleotides containing a stretch of thymidine 
residues of sufficient length to hybridize specifically to the polyA tail of mRNAs, preferably of 12 
to 18 thymidine residues in length. More preferably, such oligo(T) primers comprise an additional 

30 sequence upstream of the poly(dT) stretch in order to allow the addition of a given sequence to the 
5 'end of all first cDNA strands which may then be used to facilitate subsequent manipulation of the 
cDNA. Preferably, this added sequence is 8 to 60 residues in length. For instance, the addition of a 
restriction site in 5' of cDNAs facilitates subcloning of the obtained cDNA. Alternatively, such an 
added 5' end may also be used to design primers of PCR to specifically amplify cDNA clones of 

35 interest. 

The first cDNA strand is then hybridized to a second primer. Any polynucleotide fragment 
of the invention may be used, and in particular those described in the "Oligonucleotide primers and 
probes" section. This second primer contains at least 10 consecutive nucleotides of a polynucleotide 



73 



WO 02/083898 PCT/IBO 1/009 14 

of the invention. Preferably, the primer comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 
consecutive nucleotides of a polynucleotide of the invention. In some embodiments, the primer 
comprises more than 30 nucleotides of a polynucleotide of the invention. If it is desired to obtain 
cDNAs containing the full protein coding sequence, including the authentic translation initiation 
5 site, the second primer used contains sequences located upstream of the translation initiation site. 
The second primer is extended to generate a second cDNA strand complementary to the first cDNA 
strand. Alternatively, RT-PCR may be performed as described above using primers from both ends 
of the cDNA to be obtained. 

The double stranded cDNAs made using the methods described above are isolated and 
10 cloned. The cDNAs may be cloned into vectors such as plasmids or viral vectors capable of 

replicating in an appropriate host cell. For example, the host cell may be a bacterial, mammalian, 
avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRNA to 
generate a first cDNA strand, extending a primer to make a second cDNA strand complementary to 
1 5 the first cDNA strand, isolating the double stranded cDNA and cloning the double stranded cDN A 
are well known to those skilled in the art and are described in Current Protocols in Molecular 
Biology, John Wiley & Sons, Inc. 1997 and Sambrook et al y 1989. 

Consequently, the present invention encompasses methods of making cDNAs. In a first 
embodiment, the method of making a cDNA comprises the steps of 
20 a) contacting a collection of mRNA molecules from human cells with a primer comprising 

at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected 
from the group consisting of the sequences complementary to SEQ ID NOs:l-169, 339-455, 561- 
784 and sequences complementary to a clone insert of the deposited clone pool; 
b) hybridizing said primer to an mRNA in said collection; 
25 c) reverse transcribing said hybridized primer to make a first cDNA strand from said 

mRNA; 

d) making a second cDNA strand complementary to said first cDNA strand; and 

e) isolating the resulting cDNA comprising said first cDN A strand and said second cDNA 

strand. 

30 Another embodiment of the present invention is a purified cDNA obtainable by the method 

of the preceding paragraph. In one aspect of this embodiment, the cDNA encodes at least a portion 
of a human polypeptide. 

In a second embodiment, the method of making a cDNA comprises the steps of 

a) contacting a collection of mRNA molecules from human cells with a first primer capable 
35 of hybridizing to the polyA tail of said mRNA; 

b) hybridizing said first primer to said polyA tail; 

c) reverse transcribing said mRNA to make a first cDNA strand; 

d) making a second cDNA strand complementary to said first cDNA strand using at least 
one primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of 
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a sequence selected from the group consisting of SEQ ID NOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second cDNA 

strand. 

5 In another aspect of this method the second cDNA strand is made by 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ ID NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool, and a third primer which sequence is fully included within the sequence of said first primer; 
10 b) performing a first polymerase chain reaction with said second and third primers to 

generate a first PCR product; 

c) contacting said first PCR product with a fourth primer, comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of said sequence selected from the group 
consisting of SEQ ID NOs: 1-1 69, 339-455, 561-784 and sequences of clone inserts of the deposited 

15 clone pool, and a fifth primer, which sequence is fully included within the sequence of said third 
primer, wherein said fourth and fifth hybridize to sequences within said first PCR product; and 

d) performing a second polymerase chain reaction, thereby generating a second PCR 

product. 

Alternatively, the second cDNA strand may be made by contacting said first cDNA strand 
20 with a second primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 

nucleotides of a sequence selected from the group consisting of SEQ ID NOs:l-169, 339-455, 561- 
784 and sequences of clone inserts of the deposited clone pool, and a third primer which sequence is 
fully included within the sequence of said first primer and performing a polymerase chain reaction 
with said second and third primers to generate said second cDNA strand. 
25 Alternatively, the second cDNA strand may be made by: 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ ID NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool; 

30 b) hybridizing said second primer to said first strand cDNA; and 

c) extending said hybridized second primer to generate said second cDNA strand. 

Another embodiment of the present invention is a purified cDNA obtainable by a method of 

making a cDNA of the invention. In one aspect of this embodiment, said cDNA encodes at least a 

portion of a human polypeptide. 

3 5 Other protocols 

Alternatively, other procedures may be used for obtaining homologous cDNAs. In one 
approach, cDNAs are prepared from mRNA and cloned into double stranded phagemids as follows. 
The cDNA library in the double stranded phagemids is then rendered single stranded by treatment 
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with an endonuclease, such as the Gene II product of the phage Fl and an exonuclease (Chang et 
al, 1993, which disclosure is hereby incorporated by reference in its entirety). A biotinylated 
oligonucleotide comprising the sequence of a fragment of a known GENSET cDNA, genomic DNA 
or fragment thereof is hybridized to the single stranded phagemids. Preferably, the fragment 
5 comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides of a sequence 

selected from the group consisting of the sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool. 

Hybrids between the biotinylated oligonucleotide and phagemids are isolated by incubating 
the hybrids with streptavidin coated paramagnetic beads and retrieving the beads with a magnet 

10 (Fry et al, 1992, which disclosure is hereby incorporated by reference in its entirety). Thereafter, 
the resulting phagemids are released from the beads and converted into double stranded DNA using 
a primer specific for the GENSET cDNA or fragment used to design the biotinylated 
oligonucleotide. Alternatively, protocols such as the Gene Trapper kit (Gibco BRL), which 
disclosure is which disclosure is hereby incorporated by reference in its entirety, may be used. The 

15 resulting double stranded DNA is transformed into bacteria. Homologous cDNAs to the GENSET 
cDNA or fragment thereof sequence are identified by colony PCR or colony hybridization. 

As a chromosome marker 

Chromosomal localization of the cDNA of the present invention were determined using 
information from public and proprietary databases. Table II lists the putative chromosomal location 

20 of the polynucleotides of the present invention. Column one lists the sequence identification number 
with the corresponding chromosomal location listed in column two. Thus, the present invention 
also relates to methods and compositions using the chromosomal location of the polynucleotides of 
the invention to construct a human high resolution map or to identify a given chromosome in a 
sample using any techniques known to those skilled in the art including those disclosed below. 

25 GENSET polynucleotides may also be mapped to their chromosomal locations using any 

methods or techniques known to those skilled in the art including radiation hybrid (RH) mapping, 
PCR-based mapping and Fluorescence in situ hybridization (FISH) mapping described below. 

Radiation hybrid mapping 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used for high 
30 resolution mapping of the human genome. In this approach, cell lines containing one or more 

human chromosomes are lethally irradiated, breaking each chromosome into fragments whose size 
depends on the radiation dose. These fragments are rescued by fusion with cultured rodent cells, 
yielding subclones containing different fragments of the human genome. This technique is 
described by Benham et al. (1989) and Cox et al., (1990), which disclosures are hereby 
35 incorporated by reference in their entireties. The random and independent nature of the subclones 
permits efficient mapping of any human genome marker. Human DNA isolated from a panel of 80- 
100 cell lines provides a mapping reagent for ordering GENSET cDNAs or genomic DNAs. In this 
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approach, the frequency of breakage between markers is used to measure distance, allowing 
construction of fine resolution maps as has been done using conventional ESTs (Schuler et aL 9 
1996), which disclosure is hereby incorporated by reference in its entirety. 

RH mapping has been used to generate a high-resolution whole genome radiation hybrid 
5 map of human chromosome 17q22-q25.3 across the genes for growth hormone (GH) and thymidine 
kinase (TK) (Foster et aL, 1996), the region surrounding the Gorlin syndrome gene (Obermayr et 
aL, 1996), 60 loci covering the entire short arm of chromosome 12 (Raeymaekers et al, 1995), the 
region of human chromosome 22 containing the neurofibromatosis type 2 locus (Frazer et ah, 1992) 
and 13 loci on the long arm of chromosome 5 (Warrington et al., 1991), which disclosures are 
10 hereby incorporated by reference in their entireties. 

Mapping of cDNAs to Human Chromosomes using PCR techniques 

GENSET cDNAs and genomic DNAs may be assigned to human chromosomes using PCR 
based methodologies. In such approaches, oligonucleotide primer pairs are designed from the 
cDNA sequence to minimize the chance of amplifying through an intron. Preferably, the 
15 oligonucleotide primers are 1 8-23 bp in length and are designed for PCR amplification. The 

creation of PCR primers from known sequences is well known to those with skill in the art. For a 
review of PCR technology see Erlich (1992), which disclosure is hereby incorporated by reference 
in its entirety. 

The primers are used in polymerase chain reactions (PCR) to amplify templates from total 

20 human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA is used as a template 
for PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq polymerase, and 1 uCu of a 32 P- 
labeled deoxycytidine triphosphate. The PCR is performed in a microplate thermocycler (Techne) 
under the following conditions: 30 cycles of 94 degrees Celsius, 1.4 min; 55 degrees Celsius, 2 min; 
and 72 degrees Celsius, 2 min; with a final extension at 72 degrees Celsius for 10 min. The 

25 amplified products are analyzed on a 6% polyacrylamide sequencing gel and visualized by 

autoradiography. If the length of the resulting PCR product is identical to the distance between the 
ends of the primer sequences in the cDNA from which the primers are derived, then the PCR 
reaction is repeated with DNA templates from two panels of human-rodent somatic cell hybrids, 
BIOS PCRable DNA (BIOS Corporation) and NIGMS Human-Rodent Somatic Cell Hybrid 

30 Mapping Panel Number 1 (NIGMS, Camden, NJ). 

PCR is used to screen a series of somatic cell hybrid cell lines containing defined sets of 
human chromosomes for the presence of a given cDNA or genomic DNA. DNA is isolated from 
the somatic hybrids and used as starting templates for PCR reactions using the primer pairs from the 
GENSET cDNAs or genomic DNAs. Only those somatic cell hybrids with chromosomes 

35 containing the human gene corresponding to the GENSET cDNA or genomic DNA will yield an 
amplified fragment. The GENSET cDNAs or genomic DNAs are assigned to a chromosome by 
analysis of the segregation pattern of PCR products from the somatic hybrid DNA templates. The 
single human chromosome present in all cell hybrids that give rise to an amplified fragment is the 
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chromosome containing that GENSET cDNA or genomic DNA. For a review of techniques and 
analysis of results from somatic cell gene mapping experiments, see Ledbetter et ah, (1990), which 
disclosure is hereby incorporated by reference in its entirety. 

Mapping of cDNAs to Chromosomes Using Fluorescence in situ Hybridization 

5 Fluorescence in situ hybridization (FISH) allows the GENSET cDNA or genomic DNA to 

be mapped to a particular location on a given chromosome. The chromosomes to be used for 
fluorescence in situ hybridization techniques may be obtained from a variety of sources including 
cell cultures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of a GENSET cDNA or genomic 

10 DNA is obtained by FISH as described by Cherif et ah (1990), which disclosure is hereby 
incorporated by reference in its entirety. Metaphase chromosomes are prepared from 
phytohemagglutinin (PHA)-stimulated blood cell donors. PHA-stimulated lymphocytes from 
healthy males are cultured for 72 h in RPMI-1640 medium. For synchronization, methotrexate (10 
uM) is added for 17 h, followed by addition of 5-bromodeoxyuridine (5-BudR, 0.1 mM) for 6 h. 

15 Colcemid (1 ug/ml) is added for the last 15 min before harvesting the cells. Cells are collected, 
washed in RPMI, incubated with a hypotonic solution of KC1 (75 mM) at 37 degrees Celsius for 15 
min and fixed in three changes of methanol :acetic acid (3:1). The cell suspension is dropped onto a 
glass slide and air dried. The GENSET cDNA or genomic DNA is labeled with biotin-16 dUTP by 
nick translation according to the manufacturer's instructions (Bethesda Research Laboratories, 

20 Bethesda, MD), purified using a Sephadex G-50 column (Pharmacia, Upssala, Sweden) and 

precipitated. Just prior to hybridization, the DNA pellet is dissolved in hybridization buffer (50% 
formamide, 2 X SSC, 10% dextran sulfate, 1 mg/ml sonicated salmon sperm DNA, pH 7) and the 
probe is denatured at 70 degrees Celsius for 5-10 min. 

Slides kept at -20 degrees Celsius are treated for 1 h at 37 degrees Celsius with RNase A 

25 (100 ug/ml), rinsed three times in 2 X SSC and dehydrated in an ethanol series. Chromosome 
preparations are denatured in 70% formamide, 2 X SSC for 2 min at 70 degrees Celsius, then 
dehydrated at 4 degrees Celsius. The slides are treated with proteinase K (10 ug/100 ml in 20 mM 
Tris-HCI, 2 mM CaCI 2 ) at 37 degrees Celsius for 8 min and dehydrated. The hybridization mixture 
containing the probe is placed on the slide, covered with a coverslip, sealed with rubber cement and 

30 incubated overnight in a humid chamber at 37 degrees Celsius. After hybridization and post- 
hybridization washes, the biotinylated probe is detected by avidin-FITC and amplified with 
additional layers of biotinylated goat anti-avidin and avidin-FITC. For chromosomal localization, 
fluorescent R-bands are obtained as previously described (Cherif et ah, 1990). The slides are 
observed under a LEICA fluorescence microscope (DMRXA). Chromosomes are counterstained 

35 with propidium iodide and the fluorescent signal of the probe appears as two symmetrical yellow- 
green spots on both chromatids of the fluorescent R-band chromosome (red). Thus, a particular 
GENSET cDNA or genomic DNA may be localized to a particular cytogenetic R-band on a given 
chromosome. 
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Use of cDNAs to Construct or Expand Chromosome Maps 

Once the GENSET cDNAs or genomic DNAs have been assigned to particular 
chromosomes using any technique known to those skilled in the art those skilled in the art, 
5 particularly those described herein, they may be utilized to construct a high resolution map of the 
chromosomes on which they are located or to identify the chromosomes in a sample. 

Chromosome mapping involves assigning a given unique sequence to a particular 
chromosome as described above. Once the unique sequence has been mapped to a given 
chromosome, it is ordered relative to other unique sequences located on the same chromosome. 

10 One approach to chromosome mapping utilizes a series of yeast artificial chromosomes (YACs) 
bearing several thousand long inserts derived from the chromosomes of the organism from which 
the GENSET cDNAs or genomic DNAs are obtained. This approach is described in Nagaraja et al. 
(1997), which disclosure is hereby incorporated by reference in its entirety. Briefly, in this 
approach each chromosome is broken into overlapping pieces which are inserted into the YAC 

1 5 vector. The YAC inserts are screened using PCR or other methods to determine whether they 
include the GENSET cDNA or genomic DNA whose position is to be determined. Once an insert 
has been found which includes the GENSET cDNA or genomic DNA, the insert can be analyzed by 
PCR or other methods to determine whether the insert also contains other sequences known to be on 
the chromosome or in the region from which the GENSET cDNA or genomic DNA was derived. 

20 This process can be repeated for each insert in the YAC library to determine the location of each of 
the GENSET cDNA or genomic DNA relative to one another and to other known chromosomal 
markers. In this way, a high resolution map of the distribution of numerous unique markers along 
each of the organisms chromosomes may be obtained. 

Identification of genes associated with hereditary diseases or drug response 

25 This example illustrates an approach useful for the association of GENSET cDNAs or 

genomic DNAs with particular phenotypic characteristics. In this example, a particular GENSET 
cDNA or genomic DNA is used as a test probe to associate that GENSET cDNA or genomic DNA 
with a particular phenotypic characteristic. 

GENSET cDNAs or genomic DNAs are mapped to a particular location on a human 

30 chromosome using techniques such as those described herein or other techniques known in the art. 
A search of Mendelian Inheritance in Man (V. McKusick, Mendelian Inheritance in Man; available 
on line through Johns Hopkins University Welch Medical Library) reveals the region of the human 
chromosome which contains the GENSET cDNA or genomic DNA to be a very gene rich region 
containing several known genes and several diseases or phenotypes for which genes have not been 

35 identified. The gene corresponding to this GENSET cDNA or genomic DNA thus becomes an 
immediate candidate for each of these genetic diseases. 
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Cells from patients with these diseases or phenotypes are isolated and expanded in culture. 
PCR primers from the GENSET cDNA or genomic DNA are used to screen genomic DNA, mRNA 
or cDNA obtained from the patients. GENSET cDNAs or genomic DNAs that are not amplified in 
the patients can be positively associated with a particular disease by further analysis. Alternatively, 
5 the PCR analysis may yield fragments of different lengths when the samples are derived from an 
individual having the phenotype associated with the disease than when the sample is derived from a 
healthy individual, indicating that the gene containing the cDNA may be responsible for the genetic 
disease. 

Uses of polynucleotides in recombinant vectors 

10 The present invention also relates to recombinant vectors including the isolated 

polynucleotides of the present invention, and to host cells recombinant for a polynucleotide of the 
invention, such as the above vectors, as well as to methods of making such vectors and host cells 
and for using them for production of GENSET polypeptides by recombinant techniques. 

Recombinant Vectors 

1 5 The term "vector" is used herein to designate either a circular or a linear DNA or RNA 

molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. The present invention encompasses a family of recombinant vectors 
that comprise a regulatory polynucleotide and/or a coding polynucleotide derived from either the 

20 GENSET genomic sequence or the cDNA sequence. Generally, a recombinant vector of the 
invention may comprise any of the polynucleotides described herein, including regulatory 
sequences, coding sequences and polynucleotide constructs, as well as any GENSET primer or 
probe as defined herein. 

In a first preferred embodiment, a recombinant vector of the invention is used to amplify 

25 the inserted polynucleotide derived from a GENSET genomic sequence or a GENSET cDNA, for 
example any cDNA selected from the group consisting of sequences of SEQ ID NOs: 1-169, 339- 
455, 561-784, sequences of clone inserts of the deposited clone pool, variants and fragments thereof 
in a suitable cell host, this polynucleotide being amplified at every time that the recombinant vector 
replicates. 

30 A second preferred embodiment of the recombinant vectors according to the invention 

comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic acid 
of the invention, or both. Within certain embodiments, expression vectors are employed to express 
a GENSET polypeptide which can be then purified and, for example be used in ligand screening 
assays or as an immunogen in order to raise specific antibodies directed against the GENSET 

35 protein. In other embodiments, the expression vectors are used for constructing transgenic animals 
and also for gene therapy. Expression requires that appropriate signals are provided in the vectors, 
said signals including various regulatory elements, such as enhancers/promoters from both viral and 
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mammalian sources that drive expression of the genes of interest in host cells. Dominant drug 
selection markers for establishing permanent, stable cell clones expressing the products are 
generally included in the expression vectors of the invention, as they are elements that link 
expression of the drug selection markers to expression of the polypeptide. 
5 More particularly, the present invention relates to expression vectors which include nucleic 

acids encoding a GENSET protein, preferably a GENSET protein with an amino acid sequence 
selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool, variants and 
fragments thereof. The polynucleotides of the present invention may be used to express an encoded 

10 protein in a host organism to produce a beneficial effect. In such procedures, the encoded protein 
may be transiently expressed in the host organism or stably expressed in the host organism. The 
encoded protein may have any of the activities described herein. The encoded protein may be a 
protein which the host organism lacks or, alternatively, the encoded protein may augment the 
existing levels of the protein in the host organism. 

1 5 Some of the elements which can be found in the vectors of the present invention are 

described in further detail in the following sections. 

General features of the expression vectors of the invention 

A recombinant vector according to the invention comprises, but is not limited to, a YAC 
(Yeast Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a phagemid, a 
20 cosmid, a plasmid or even a linear DNA molecule which may comprise a chromosomal, non- 
chromosomal, semi-synthetic and synthetic DNA. Such a recombinant vector can comprise a 
transcriptional unit comprising an assembly of: 

(1) a genetic element or elements having a regulatory role in gene expression, for example 
promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from about 10 to 300 

25 bp in length that act on the promoter to increase the transcription. 

(2) a structural or coding sequence which is transcribed into mRNA and eventually 
translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units intended 
30 for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling 

extracellular secretion of translated protein by a host cell. Alternatively, when a recombinant 
protein is expressed without a leader or transport sequence, it may include a N-terminal residue. 
This residue may or may not be subsequently cleaved from the expressed recombinant protein to 
provide a final product. 

35 Generally, recombinant expression vectors will include origins of replication, selectable 

markers permitting transformation of the host cell, and a promoter derived from a highly expressed 
gene to direct transcription of a downstream structural sequence. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination sequences, 



81 



WO 02/083898 PCT/IBO 1/009 14 

and preferably a leader sequence capable of directing secretion of the translated protein into the 
periplasmic space or the extracellular medium. In a specific embodiment wherein the vector is 
adapted for transfecting and expressing desired sequences in mammalian host cells, preferred 
vectors will comprise an origin of replication in the desired host, a suitable promoter and enhancer, 
5 and also any necessary ribosome binding sites, polyadenylation signals, splice donor and acceptor 
sites, transcriptional termination sequences, and 5 '-flanking non-transcribed sequences. DNA 
sequences derived from the SV40 viral genome, for example SV40 origin, early promoter, 
enhancer, splice and polyadenylation signals may be used to provide the required non-transcribed 
genetic elements. 

10 The in vivo expression of a GENSET polypeptide of the present invention may be useful in 

order to correct a genetic defect related to the expression of the native gene in a host organism, for 
the treatment or prevention of any disease or condition that can be treated or prevented by 
increasing the level of GENSET polypeptide expression, or to the production of a biologically 
inactive GENSET protein. Consequently, the present invention also comprises recombinant 

1 5 expression vectors mainly designed for the in vivo production of a GENSET polypeptide the present 
invention by the introduction of the appropriate genetic material in the organism or the patient to be 
treated. This genetic material may be introduced in vitro in a cell that has been previously extracted 
from the organism, the modified cell being subsequently reintroduced in the said organism, directly 
in vivo into the appropriate tissue. 

20 Regulatory Elements 

The suitable promoter regions used in the expression vectors according to the present 
invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid sequence 
of interest is not believed to be important, so long as it is capable of directing the expression of the 

25 nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is preferable to position the 
nucleic acid coding region adjacent to and under the control of a promoter that is capable of being 
expressed in a human cell, such as, for example, a human or a viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide containing 

30 the coding sequence to be expressed. Additionally, the promoter is generally heterologous with 
respect to the recombinant vector sequences within which the construct promoter/coding sequence 
has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
(chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
35 Preferred bacterial promoters are the LacI, LacZ, the T3 or T7 bacteriophage RNA 

polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin 
promoter, or the plO protein promoter from baculovirus (Kit Novagen), (Smith et al., 1983; 
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O'Reilly et aL, 1992; which disclosures are hereby incorporated by reference in their entireties), the 
lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient vector and 
5 promoter is well within the level of ordinary skill in the art. The choice of a promoter is well within 
the ability of a person skilled in the field of genetic engineering. For example, one may refer to the 
book of Sambrook et al., (1989) or also to the procedures described by Fuller et at., (1996), which 
disclosures are hereby incorporated by reference in their entireties. 

Other regulatory elements 

10 Where a cDNA insert is employed, one will typically desire to include a polyadenylation 

signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such sequence 
may be employed such as human growth hormone and SV40 polyadenylation signals. Also 
contemplated as an element of the expression cassette is a terminator. These elements can serve to 

15 enhance message levels and to minimize read through from the cassette into other sequences. 

Selectable Markers 

Selectable markers confer an identifiable change to the cell permitting easy identification of 
cells containing the expression construct. The selectable marker genes for selection of transformed 
host cells are preferably dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, 
20 TRP1 for X cerevisiae or tetracycline, rifampicin or ampicillin resistance in E. Coli, or levan 
saccharase for mycobacteria, this latter marker being a negative selection marker. 

Preferred Vectors 
Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial use 
25 can comprise a selectable marker and a bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial 
vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and pGEMl (Promega 
Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and 
30 commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 (Qiagen), 
pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNH18A, pNH46A 
(Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); pWLNEO, pSV2CAT, 
p OG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE-30 
(QIAexpress). 
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The PI bacteriophage vector may contain large inserts ranging from about 80 to about 100 
kb. The construction of PI bacteriophage vectors such as pi 58 or pl58/neo8 are notably described 
by Sternberg (1992, 1994), which disclosure is hereby incorporated by reference in its entirety. 
5 Recombinant PI clones comprising GENSET nucleotide sequences may be designed for inserting 
large polynucleotides of more than 40 kb (See Linton et al., 1993), which disclosure is hereby 
incorporated by reference in its entirety. To generate PI DNA for transgenic experiments, a 
preferred protocol is the protocol described by McCormick et al. (1994), which disclosure is hereby 
incorporated by reference in its entirety. Briefly, E. coli (preferably strain NS3529) harboring the 

10 PI plasmid are grown overnight in a suitable broth medium containing 25 |ig/ml of kanamycin. 
The PI DNA is prepared from the E. coli by alkaline lysis using the Qiagen Plasmid Maxi kit 
(Qiagen, Chatsworth, CA, USA), according to the manufacturer's instructions. The PI DNA is 
purified from the bacterial lysate on two Qiagen-tip 500 columns, using the washing and elution 
buffers contained in the kit. A phenol/chloroform extraction is then performed before precipitating 

1 5 the DNA with 70% ethanol. After solubilizing the DNA in TE ( 1 0 mM Tris-HCl, pH 7.4, 1 mM 
EDTA), the concentration of the DNA is assessed by spectrophotometry. 

When the goal is to express a PI clone comprising GENSET polypeptide-encoding 
nucleotide sequences in a transgenic animal, typically in transgenic mice, it is desirable to remove 
vector sequences from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting 

20 sites within the PI polylinker (S/zI, Not\ or Sail). The PI insert is then purified from vector 

sequences on a pulsed-field agarose gel, using methods similar to those originally reported for the 
isolation of DNA from YACs (See e. g., Schedl et ah, 1993a; Peterson et al., 1993), which 
disclosures are hereby incorporated by reference in their entireties. At this stage, the resulting 
purified insert DNA can be concentrated, if necessary, on a Millipore Ultrafree-MC Filter Unit 

25 (Millipore, Bedford, MA, USA — 30,000 molecular weight limit) and then dialyzed against 

microinjection buffer (10 mM Tris-HCl, pH 7.4; 250 \iM EDTA) containing 100 mM NaCl, 30 uM 
spermine, 70 jiM spermidine on a microdyalisis membrane (type VS, 0.025 jxM from Millipore). 
The intactness of the purified PI DNA insert is assessed by electrophoresis on 1% agarose (Sea 
Kem GTG; FMC Bio-products) pulse-field gel and staining with ethidium bromide. 

30 Viral vectors 

In one specific embodiment, the vector is derived from an adenovirus. Preferred adenovirus 
vectors according to the invention are those described by Feldman and Steg (1996), or Ohno et al., 
(1994), which disclosures are hereby incorporated by reference in their entireties. Another 
preferred recombinant adenovirus according to this specific embodiment of the present invention is 
35 the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an adenovirus of animal origin (French patent 
application No. FR-93. 05954), which disclosure is hereby incorporated by reference in its entirety. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in vivo , 
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particularly to mammals, including humans. These vectors provide efficient delivery of genes into 
cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 
Particularly preferred retroviruses for the preparation or construction of retroviral in vitro or in vitro 
gene delivery vehicles of the present invention include retroviruses selected from the group 
5 consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis virus 
and Rous Sarcoma virus. Particularly preferred Murine Leukemia Viruses include the 4070A and 
the 1504A viruses, Abelson (ATCC No VR-999), Friend (ATCC No VR-245), Gross (ATCC No 
VR-590), Rauscher (ATCC No VR-998) and Moloney Murine Leukemia Virus (ATCC No VR- 
190; PCT Application No WO 94/24298). Particularly preferred Rous Sarcoma Viruses include 

10 Bryan high titer (ATCC Nos VR-334, VR-657, VR-726, VR-659 and VR-728). Other preferred 
retroviral vectors are those described in Roth et al. (1996), PCT Application No WO 93/25234, 
PCT Application No WO 94/ 06920, Roux et al., (1989), Julan et al., (1992), and Neda et al., 
(1991), which disclosures are hereby incorporated by reference in their entireties. 

Yet another viral vector system that is contemplated by the invention comprises the adeno- 

1 5 associated virus (AAV). The adeno-associated virus is a naturally occurring defective virus that 
requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 
replication and a productive life cycle (Muzyczka et al., 1992), which disclosure is hereby 
incorporated by reference in its entirety. It is also one of the few viruses that may integrate its DNA 
into non-dividing cells, and exhibits a high frequency of stable integration (Flotte et al. 1992; 

20 Samulski et al., 1989; McLaughlin et al., 1989), which disclosures are hereby incorporated by 
reference in their entireties. One advantageous feature of AAV derives from its reduced efficacy 
for transducing primary cells relative to transformed cells. 

BAC vectors 

The bacterial artificial chromosome (BAC) cloning system (Shizuya et al., 1992), which 
25 disclosure is hereby incorporated by reference in its entirety, has been developed to stably maintain 
large fragments of genomic DNA (100-300 kb) in E. coli. A preferred BAC vector comprises a 
pBeloBACl 1 vector that has been described by Kim et al. (1996), which disclosure is hereby 
incorporated by reference in its entirety. BAC libraries are prepared with this vector using size- 
selected genomic DNA that has been partially digested using enzymes that permit ligation into 
30 either the Bam HI or Hindlll sites in the vector. Flanking these cloning sites are T7 and SP6 RNA 
polymerase transcription initiation sites that can be used to generate end probes by either RNA 
transcription or PCR methods. After the construction of a BAC library in E. coli, BAC DNA is 
purified from the host cell as a supercoiled circle. Converting these circular molecules into a linear 
form precedes both size determination and introduction of the BACs into recipient cells. The 
35 cloning site is flanked by two Not I sites, permitting cloned segments to be excised from the vector 
by Not I digestion. Alternatively, the DNA insert contained in the pBeloBACl 1 vector may be 
linearized by treatment of the BAC vector with the commercially available enzyme lambda 
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terminase that leads to the cleavage at the unique casN site, but this cleavage method results in a 
full length BAC clone containing both the insert DNA and the BAC sequences. 

Baculovirus 

Another specific suitable host vector system is the pVL 1392/1 393 baculovirus transfer 
5 vector (Pharmingen) that is used to transfect the SF9 cell line (ATCC No. CRL 171 1) which is 
derived from Spodoptera frugiperda. Other suitable vectors for the expression of the GENSET 
polypeptide of the present invention in a baculovirus expression system include those described by 
Chai etaL, (1993), Vlasak etaL, (1983), and Lenhard etaL, (1996), which disclosures are hereby 
incorporated by reference in their entireties. 

1 0 Delivery Of The Recombinant Vectors 

To effect expression of the polynucleotides and polynucleotide constructs of the invention, 
the constructs must be delivered into a cell. This delivery may be accomplished in vitro, as in 
laboratory procedures for transforming cell lines, or in vivo or ex vivo, as in the treatment of certain 
diseases states. One mechanism is viral infection where the expression construct is encapsulated in 

15 an infectious viral particle. 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian cells 
are also contemplated by the present invention, and include, without being limited to, calcium 
phosphate precipitation (Graham et al. 9 1973; Chen et al., 1987); DEAE-dextran (Gopal, 1985); 
electroporation (Tur-Kaspa et al., 1986; Potter et al., 1984); direct microinjection (Harland et al., 

20 1985); DNA-loaded liposomes (Nicolau et aL, 1982; Fraley et al, 1979); and receptor-mediated 
transfection. (Wu and Wu, 1987, 1988), which disclosures are hereby incorporated by reference in 
their entireties. Some of these techniques may be successfully adapted for in vivo or ex vivo use. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location and 

25 orientation via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments 
or "episomes" encode sequences sufficient to permit maintenance and replication independent of or 
in synchronization with the host cell cycle. 

30 One specific embodiment for a method for delivering a protein or peptide to the interior of a 

cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the naked 
polynucleotide is taken up into the interior of the cell and has a physiological effect. This is 

35 particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 
described in PCT application No. WO 90/1 1092 (Vical Inc.) and also in PCT application No. WO 
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95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of Tascon et al. 
(1996) and of Huygen et al., (1996), which disclosures are hereby incorporated by reference in their 
entireties. 

In still another embodiment of the invention, the transfer of a naked polynucleotide of the 
5 invention, including a polynucleotide construct of the invention, into cells may be accomplished 
with particle bombardment (biolistic), said particles being DNA-coated microprojectiles accelerated 
to a high velocity allowing them to pierce cell membranes and enter cells without killing them, such 
as described by Klein et al., (1987), which disclosure is hereby incorporated by reference in its 
entirety. 

10 In a further embodiment, the polynucleotide of the invention may be entrapped in a 

liposome (Ghosh and Bacchawat, 1991; Wong et al., 1980; Nicolau et al., 1987, which disclosures 
are hereby incorporated by reference in their entireties). 

In a specific embodiment, the invention provides a composition for the in vivo production 
of the GENSET polypeptides described herein. It comprises a naked polynucleotide operatively 

1 5 coding for this polypeptide, in solution in a physiologically acceptable carrier, and suitable for 
introduction into a tissue to cause cells of the tissue to express the said protein or polypeptide. 

The amount of vector to be injected to the desired host organism varies according to the site 
of injection. As an indicative dose, it will be injected between 0. 1 and 1 00 ng of the vector in an 
animal body, preferably a mammal body, for example a mouse body. 

20 In another embodiment of the vector according to the invention, it may be introduced in 

vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated and 
more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has been 
transformed with the vector coding for the desired GENSET polypeptide or the desired fragment 
thereof is reintroduced into the animal body in order to deliver the recombinant protein within the 

25 body either locally or systemically. 



Secretion vectors 

Some of the GENSET cDNAs or genomic DNAs of the invention may also be used to 
construct secretion vectors capable of directing the secretion of the proteins encoded by genes 
30 inserted in the vectors. Such secretion vectors may facilitate the purification or enrichment of the 
proteins encoded by genes inserted therein by reducing the number of background proteins from 
which the desired protein must be purified or enriched. Exemplary secretion vectors are described 
below. 

The secretion vectors of the present invention include a promoter capable of directing gene 
35 expression in the host cell, tissue, or organism of interest. Such promoters include the Rous 
Sarcoma Virus promoter, the SV40 promoter, the human cytomegalovirus promoter, and other 
promoters familiar to those skilled in the art. 
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A signal sequence from a polynucleotide of the invention, preferably a signal sequences 
selected from the group of signal sequences of SEQ ID NOs: 1-85, 339-400, 406-407, 413-415, 
561-594, and 634-651 and signal sequences of clone inserts of the deposited clone pool is operably 
linked to the promoter such that the mRNA transcribed from the promoter will direct the translation 
5 of the signal peptide. The host cell, tissue, or organism may be any cell, tissue, or organism which 
recognizes the signal peptide encoded by the signal sequence in the GENSET cDNA or genomic 
DNA. Suitable hosts include mammalian cells, tissues or organisms, avian cells, tissues, or 
organisms, insect cells, tissues or organisms, or yeast. 

In addition, the secretion vector contains cloning sites for inserting genes encoding the 
10 proteins which are to be secreted. The cloning sites facilitate the cloning of the insert gene in frame 
with the signal sequence such that a fusion protein in which the signal peptide is fused to the protein 
encoded by the inserted gene is expressed from the mRNA transcribed from the promoter. The 
signal peptide directs the extracellular secretion of the fusion protein. 

The secretion vector may be DNA or RNA and may integrate into the chromosome of the 
15 host, be stably maintained as an extrachromosomal replicon in the host, be an artificial 

chromosome, or be transiently present in the host. Preferably, the secretion vector is maintained in 
multiple copies in each host cell. As used herein, multiple copies means at least 2, 5, 10, 20, 25, 50 
or more than 50 copies per cell. In some embodiments, the multiple copies are maintained 
extrachromosomal ly. In other embodiments, the multiple copies result from amplification of a 
20 chromosomal sequence. 

Many nucleic acid backbones suitable for use as secretion vectors are known to those 
skilled in the art, including retroviral vectors, SV40 vectors, Bovine Papilloma Virus vectors, yeast 
integrating plasmids, yeast episomal plasmids, yeast artificial chromosomes, human artificial 
chromosomes, P element vectors, baculovirus vectors, or bacterial plasmids capable of being 
25 transiently introduced into the host. 

The secretion vector may also contain a polyA signal such that the polyA signal is located 
downstream of the gene inserted into the secretion vector. 

After the gene encoding the protein for which secretion is desired is inserted into the 
secretion vector, the secretion vector is introduced into the host cell, tissue, or organism using 
30 calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection, 
viral particles or as naked DNA. The protein encoded by the inserted gene is then purified or 
enriched from the supernatant using conventional techniques such as ammonium sulfate 
precipitation, immunoprecipitation, immunochrornatography, size exclusion chromatography, ion 
exchange chromatography, and hplc. Alternatively, the secreted protein may be in a sufficiently 
35 enriched or pure state in the supernatant or growth media of the host to permit it to be used for its 
intended purpose without further enrichment. 

The signal sequences may also be inserted into vectors designed for gene therapy. In such 
vectors, the signal sequence is operably linked to a promoter such that mRNA transcribed from the 
promoter encodes the signal peptide. A cloning site is located downstream of the signal sequence 
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such that a gene encoding a protein whose secretion is desired may readily be inserted into the 
vector and fused to the signal sequence. The vector is introduced into an appropriate host cell. The 
protein expressed from the promoter is secreted extracellularly, thereby producing a therapeutic 
effect. 

5 

Cell Hosts 

Another object of the invention comprises a host cell that has been transformed or 
transfected with one of the polynucleotides described herein, and in particular a polynucleotide 
either comprising a GENSET polypeptide-encoding polynucleotide regulatory sequence or the 

10 polynucleotide coding for a GENSET polypeptide. Also included are host cells that are transformed 
(prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant vector such as one of 
those described above. However, the cell hosts of the present invention can comprise any of the 
polynucleotides of the present invention. In a preferred embodiment, host cells contain a 
polynucleotide sequence comprising a sequence selected from the group consisting of sequences of 

1 5 SEQ ID NOs: 1-169, 339-455, 561-784, sequences of clone inserts of the deposited clone pool, 

variants and fragments thereof. Preferred host cells used as recipients for the expression vectors of 
the invention are the following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-a strain), Bacillus subtilis, 
Salmonella typhimurium, and strains from species like Pseudomonas, Streptomyces and 

2 0 Staphylococcus . 

b) Eukaryotic host cells: HeLa cells (ATCC No.CCL2; No.CCL2. 1 ; No.CCL2.2), Cv 1 
cells (ATCC No.CCL70), COS cells (ATCC No.CRL1650; No.CRL1651), Sf-9 cells (ATCC 
No.CRL171 1), C127 cells (ATCC No. CRL-1804), 3T3 (ATCC No. CRL-6361), CHO (ATCC No. 
CCL-61), human kidney 293. (ATCC No. 45504; No. CRL-1573) and BHK (ECACC No. 

25 84100501; No. 84111301). 

c) Other mammalian host cells. 

The present invention also encompasses primary, secondary, and immortalized 
homologously recombinant host cells of vertebrate origin, preferably mammalian origin and 

30 particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 

polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete endogenous 
chromosomal DNA, and/or c) replace endogenous chromosomal DNA with exogenous 
polynucleotides. Insertions, deletions, and/or replacements of polynucleotide sequences may be to 
the coding sequences of the targeted gene and/or to regulatory regions, such as promoter and 

35 enhancer sequences, operably associated with the targeted gene. 

In addition to encompassing host cells containing the vector constructs discussed herein, the 
invention also encompasses primary, secondary, and immortalized host cells of vertebrate origin, 
particularly mammalian origin, that have been engineered to delete or replace endogenous genetic 
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material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous 
polynucleotide sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 
known in the art may be used to operably associate heterologous control regions (e.g., promoter 
5 and/or enhancer) and endogenous polynucleotide sequences via homologous recombination, see, 
e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; International Publication No. WO 96/2941 1, 
published September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Koller et al., (1989); and Zijlstra et al (1989) (the disclosures of each of which are 
incorporated by reference in their entireties). 

1 0 The present invention further relates to a method of making a homologously recombinant 

host cell in vitro or in vivo, wherein the expression of a targeted gene not normally expressed in the 
cell is altered. Preferably the alteration causes expression of the targeted gene under normal growth 
conditions or under conditions suitable for producing the polypeptide encoded by the targeted gene. 
The method comprises the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide 

1 5 construct, said polynucleotide construct comprising; (i) a targeting sequence; (ii) a regulatory 
sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby 
producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under 
conditions appropriate for homologous recombination. 

The present invention further relates to a method of altering the expression of a targeted 

20 gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising 
the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, said 
polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a 
coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a 
transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions 

25 appropriate for homologous recombination, thereby producing a homologously recombinant cell; 
and (c) maintaining the homologously recombinant cell in vitro or in vivo under conditions 
appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 
invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 

30 wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting the 
cell in vitro with a polynucleotide construct, said polynucleotide construct comprising: (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice 
donor site, if necessary, thereby producing a transfected cell; (b) maintaining the transfected cell in 
vitro or in vivo under conditions appropriate for homologous recombination, thereby producing a 

35 homologously recombinant cell; and c) maintaining the homologously recombinant cell in vitro or 
in vivo under conditions appropriate for expression of the gene thereby making the polypeptide. 

The present invention further relates to a polynucleotide construct which alters the 
expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
occurs when the polynucleotide construct is inserted into the chromosomal DNA of the target cell, 
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wherein said polynucleotide construct comprises: a) a targeting sequence; b) a regulatory sequence 
and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further included are a 
polynucleotide construct, as described above, wherein said polynucleotide construct further 
comprises a polynucleotide which encodes a polypeptide and is in-frame with the targeted 
5 endogenous gene after homologous recombination with chromosomal DNA. 

The compositions may be produced, and methods performed, by techniques known in the 
art, such as those described in U.S. Patent NOs: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
5,580,734 ; International Publication NOs:W096/2941 1, WO 94/12650; and scientific articles 
10 described by Koller et aL, (1994). (The disclosures of each of which are incorporated by reference 
in their entireties). 

GENSET gene expression in mammalian cells, preferably human cells, may be rendered 
defective, or alternatively may be altered by replacing endogenous GENSET polypeptide-encoding 
genes in the genome of an animal cell by a GENSET polypeptide-encoding polynucleotide 

1 5 according to the invention. These genetic alterations may be generated by homologous 
recombination using previously described specific polynucleotide constructs. 

Mammal zygotes, such as murine zygotes may be used as cell hosts. For example, murine 
zygotes may undergo microinjection with a purified DNA molecule of interest, for example a 
purified DNA molecule that has previously been adjusted to a concentration ranging from 1 ng/ml - 

20 for BAC inserts- to 3 ng/ul -for PI bacteriophage inserts- in 10 mM Tris-HCl, pH 7.4, 250 uM 
EDTA containing 100 mM NaCl, 30 \xM spermine, and70 \iM spermidine. When the DNA to be 
microinjected has a large size, polyamines and high salt concentrations can be used in order to avoid 
mechanical breakage of this DNA, as described by Schedl etal (1993b), which disclosure is hereby 
incorporated by reference in its entirety. 

25 Any one of the polynucleotides of the invention, including the polynucleotide constructs 

described herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES 
cell line. ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of pre- 
implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC No.CRL- 
1 821), ES-D3 (ATCC No.CRL1934 and No. CRL-1 1632), YS001 (ATCC No. CRL-1 1776), 36.5 

30 (ATCC No. CRL-1 1116). ES cells are maintained in an uncommitted state by culture in the 
presence of growth- inhibited feeder cells which provide the appropriate signals to preserve this 
embryonic phenotype and serve as a matrix for ES cell adherence. Preferred feeder cells are 
primary embryonic fibroblasts that are established from tissue of day 13- day 14 embryos of 
virtually any mouse strain, that are maintained in culture, such as described by Abbondanzo et al. 

35 (1993) and are growth-inhibited by irradiation, such as described by Robertson (1987), or by the 
presence of an inhibitory concentration of LIF, such as described by Pease and Williams (1990), 
which disclosures are hereby incorporated by reference in their entireties. 

The constructs in the host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. 
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Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
chemical induction, and cells are cultivated for an additional period. Cells are typically harvested 
by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained 
5 for further purification. Microbial cells employed in the expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
cell lysing agents. Such methods are well known by the skilled artisan. 

Transgenic Animals 

The terms "transgenic animals" or "host animals" are used herein to designate animals that 

10 have their genome genetically and artificially manipulated so as to include one of the nucleic acids 
according to the invention. Preferred animals are non-human mammals and include those 
belonging to a genus selected from Mus (e.g. mice), Rattus (e.g. rats) and Oryctogalus (e.g. rabbits) 
which have their genome artificially and genetically altered by the insertion of a nucleic acid 
according to the invention. In one embodiment, the invention encompasses non-human host 

15 mammals and animals comprising a recombinant vector of the invention or a GENSET gene 
disrupted by homologous recombination with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a cloned 
recombinant or synthetic DNA sequence, more specifically one of the purified or isolated nucleic 
acids comprising a GENSET polypeptide coding sequence, a GENSET polynucleotide regulatory 

20 sequence, a polynucleotide construct, or a DNA sequence encoding an antisense polynucleotide 
such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. In a 
first preferred embodiment, these transgenic animals may be good experimental models in order to 

25 study the diverse pathologies related to the dysregulation of the expression of a given GENSET 

gene, in particular the transgenic animals containing within their genome one or several copies of an 
inserted polynucleotide encoding a native GENSET polypeptide, or alternatively a mutant GENSET 
polypeptide. 

In a second preferred embodiment, these transgenic animals may express a desired 
30 polypeptide of interest under the control of the regulatory polynucleotides of the GENSET gene, 

leading to high yields in the synthesis of this protein of interest, and eventually to tissue specific 

expression of the protein of interest. 

The design of the transgenic animals of the invention may be made according to the 

conventional techniques well known from the one skilled in the art. For more details regarding the 
35 production of transgenic animals, and specifically transgenic mice, it may be referred to US Patents 

Nos 4,873,191, issued Oct. 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,215, issued Aug 4, 

1998; these documents being herein incorporated by reference to disclose methods producing 

transgenic mice. 
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Transgenic animals of the present invention are produced by the application of procedures 
which result in an animal with a genome that has incorporated exogenous genetic material. The 
procedure involves obtaining the genetic material which encodes either a GENSET polypeptide 
coding sequence, a GENSET polynucleotide regulatory sequence, or a DNA sequence encoding a 
5 GENSET polynucleotide antisense sequence, or a portion thereof, such as described in the present 
specification. A recombinant polynucleotide of the invention is inserted into an embryonic or ES 
stem cell line. The insertion is preferably made using electroporation, such as described by Thomas 
et al. (1987), which disclosure is hereby incorporated by reference in its entirety. The cells 
subjected to electroporation are screened (e.g. by selection via selectable markers, by PCR or by 
10 Southern blot analysis) to find positive cells which have integrated the exogenous recombinant 
polynucleotide into their genome, preferably via an homologous recombination event. An 
illustrative positive-negative selection procedure that may be used according to the invention is 
described by Mansour et al. (1988), which disclosure is hereby incorporated by reference in its 
entirety. 

15 The positive cells are then isolated, cloned and injected into 3.5 days old blastocysts from 

mice, such as described by Bradley (1987), which disclosure is hereby incorporated by reference in 
its entirety. The blastocysts are then inserted into a female host animal and allowed to grow to 
term. Alternatively, the positive ES cells are brought into contact with embryos at the 2.5 days old 
8-16 cell stage (morulae) such as described by Wood et al. (1993), or by Nagy et al. (1993), which 

20 disclosures are hereby incorporated by reference in their entireties, the ES cells being internalized to 
colonize extensively the blastocyst including the cells which will give rise to the germ line. 

The offspring of the female host are tested to determine which animals are transgenic e.g. 
include the inserted exogenous DNA sequence and which ones are wild type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, a 

25 recombinant expression vector or a recombinant host cell according to the invention. 

In another embodiment, transgenic animals are produced by microinjecting polynucleotides 
ares microinjected into a fertilized oocyte. Typically, fertilized oocytes are microinjected using 
standard techniques, and then cultured in vitro until a "pre-implantation embryo" is obtained. Such 
pre-implantation embryos preferably contain approximately 16 to 150 cells. Methods for culturing 

30 fertilized oocytes to the pre-implantation stage are described, e.g., by Gordon et al. ((1984) Methods 
in Enzymology, 101,414); Hogan et al. ((1986) in Manipulating the mouse embryo. A Laboratory 
Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y) (for the mouse embryo); 
Hammer et al. ((1985) Nature, 315, 680) (for rabbit and porcine embryos); Gandolfi et al. ((1987) J. 
Reprod. Fert. 81, 23-28); Rexroad et al. ((1988) J. Anim. Sci. 66, 947-953) (for ovine embryos); and 

35 Eyestone et al. ((1989) J. Reprod. Fert. 85, 715-720); Camous et al. ((1984) J. Reprod. Fert. 72, 
779-785); and Heyman et al. ((1987) Theriogenology 27, 5968) (for bovine embryos); the 
disclosures of each of which are incorporated herein in their entireties. Pre-implantation embryos 
are then transferred to an appropriate female by standard methods to permit the birth of a transgenic 
or chimeric animal, depending upon the stage of development when the transgene is introduced. 
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As the frequency of transgene incorporation is often low, the detection of transgene 
integration in pre-implantation embryos is often desirable using any of the herein-described 
methods. Any of a number of methods can be used to detect the presence of a transgene in a pre- 
implantation embryo. For example, one or more cells may be removed from the pre-implantation 
5 embryo, and the presence or absence of the transgene in the removed cell or cells can be detected 
using any standard method e.g. PCR. Alternatively, the presence of a transgene can be detected in 
utero or post partum using standard methods. 

In a particularly preferred embodiment of the present invention, transgenic mammals are 
generated that secrete recombinant GENSET polypeptides in their milk. As the mammary gland is 

10 a highly efficient protein-producing organ, such methods can be used to produce protein 

concentrations in the gram per liter range, and often significantly more. Preferably, expression in 
the mammary gland is accomplished by operably linking the polynucleotide encoding the GENSET 
polypeptide to a mammary gland specific promoter and, optionally, other regulatory elements. 
Suitable promoters and other elements include, but are not limited to, those derived from 

15 mammalian short and long WAP, alpha, beta, and kappa, casein, alpha and beta lactoglobulin, beta- 
CN 5' genes, as well as the the mouse mammary tumor virus (MMTV) promoter. Such promoters 
and other elements may be derived from any mammal, including, but not limited to, cows, goats, 
sheep, pigs, mice, rabbits, and guinea pigs. Promoter and other regulatory sequences, vectors, and 
other relevant teachings are provided, e.g., by Clark (1998) J Mammary Gland Biol Neoplasia 

20 3:337-50; Jost et al. (1999) Nat. Biotechnol 17:160-4; U.S. Patent Nos. 5,994,616; 6,140,552; 
6,013,857; Sohn et al. (1999) DNA Cell Biol. 18:845-52; Kim et al. (1999) J. Biochem. (Japan) 
126:320-5; Soulier et al. (1999) Euro. J. Biochem. 260:533-9; Zhang et al. (1997) Chin. J. Biotech. 
13:271-6; Rijnkels et al. (1998) Transgen. Res. 7:5-14; Korhonen et al. (1997) Euro. J. Biochem. 
245:482-9; Uusi-Oukari et al. (1997) Transgen. Res. 6:75-84; Hitchin et al. (1996) Prot. Expr. Purif. 

25 7:247-52; Platenburg et al. (1994) Transgen. Res. 3:99-108; Heng-Cherl et al. (1993) Animal 

Biotech. 4:89-107; and Christa et al. (2000) Euro. J. Biochem. 267:1665-71; the entire disclosures 
of each of which is herein incorporated by reference. 

In another embodiment, the polypeptides of the invention can be produced in milk by 
introducing polynucleotides encoding the polypeptides into somatic cells of the mammary gland in 

30 vivo, e.g. mammary secreting epithelial cells. For example, plasmid DNA can be infused through 
the nipple canal, e.g. in association with DEAE-dextran (see, e.g., Hens et al. (2000) Biochim. 
Biophys. Acta 1523:161-171), in association with a ligand that can lead to receptor-mediated 
endocytosis of the construct (see, e.g., Sobolev et al. (1998) 273:7928-33), or in a viral vector such 
as a retroviral vector, e.g. the Gibbon ape leukemia virus (see, e.g., Archer et al. (1994) PNAS 

35 91 :6840-6844). In any of these embodiments, the polynucleotide may be operably linked to a 
mammary gland specific promoter, as described above, or, alternatively, any strongly expressing 
promoter such as CMV or MoMLV LTR. 

The suitability of any vector, promoter, regulatory element, etc. for use in the present 
invention can be assessed beforehand by transfecting cells such as mammary epithelial cells, e.g. 
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MacT cells (bovine mammary epithelial cells) or GME cells (goat mammary epithelial cells), in 
vitro and assessing the efficiency of transfection and expression of the transgene in the cells. 

For in vivo administration, the polynucleotides can be administered in any suitable 
formulation, at any of a range of concentrations (e.g. 1-500 )tig/ml, preferably 50-100 jig/ml), at any 
5 volume (e.g. 1-100 ml, preferably 1 to 20 ml), and can be administered any number of times (e.g. 1, 
2, 3, 5, or 10 times), at any frequency (e.g. every 1 , 2, 3, 5, 10, or any number of days). Suitable 
concentrations, frequencies, modes of administration, etc. will depend upon the particular 
polynucleotide, vector, animal, etc., and can readily be determined by one of skill in the art. 

In a preferred embodiment, a retroviral vector such as as Gibbon ape leukemia viral vector 

10 is used, as described in Archer et al. ((1994) PNAS 91:6840-6844). As retroviral infection typically 
requires cell division, cell division in the mammary glands can be stimulated in conjunction with 
the administration of the vector, e.g. using a factor such as estrodiol benzoate, progesterone, 
reserpine, or dexamethasone. Further, retroviral and other methods of infection can be facilitated 
using accessory compounds such as polybrene. 

1 5 In any of the herein-described methods for obtaining GENSET polypeptides from milk, the 

quantity of milk obtained, and thus the quantity of GENSET polypeptides produced, can be 
enhanced using any standard method of lacation induction, e.g. using hexestrol, estrogen, and/or 
progesterone. 

The polynucleotides used in such embodiments can either encode a full-length GENSET 
20 protein or a GENSET fragment. Typically, the encoded polypeptide will include a signal sequence 
to ensure the secretion of the protein into the milk. 

Recombinant Cell Lines Derived From The Transgenic Animals Of The Invention: 

A further object of the invention comprises recombinant host cells obtained from a 
25 transgenic animal described herein. In one embodiment the invention encompasses cells derived 
from non-human host mammals and animals comprising a recombinant vector of the invention or a 
GENSET gene disrupted by homologous recombination with a knock out vector. 

Recombinant cell lines may be established in vitro from cells obtained from any tissue of a 
transgenic animal according to the invention, for example by transfection of primary cell cultures 
30 with vectors expressing owc-genes such as SV40 large T antigen, as described by Chou (1989), and 
Shay et al. (1991), which disclosures are hereby incorporated by reference in their entireties. 

Uses of pol ypeptides of the invention 

The polypeptides and polynucleotides of the present invention can be used in any of a large 
35 number of ways, including numerous in vitro and in vivo uses. Specific uses for many of the 
herein-described polypeptides and polynucleotides are described in detail below. 
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Protein of SEP IDNO:255 (internal designation 500762786 255-24-5-0-A2-R 104) 

The cDNA of clone 500762786_255-24-5-0-A2-R_104 (SEQ ID NO:86) encodes the 
human EDR4 protein 

5 LFPAPAPPPAPAFAPPPKVPSPERSAPRVPLPSPQPSYPFRPAASGGTPPPACLPPAQPCQGSP 
AMNLFRFLGDLSHLLAIILLLLKIWKSRSCAAHPQLPLSFCLSVCLSVSLSLSXSLSLSFSVSK 
KKKK (SEQ ID NO:255). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:86 and polypeptides of SEQ ID NO:355, described throughout the 
present application also pertain to the human cDNA of clone 500762786 255-24-5-0-A2-R104 
10 and polypeptides encoded thereby. Polypeptide fragments having a biological activity described 
herein and polynucleotides encoding the same are included in the present invention. Related 
polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:406 
and 520. 

The normal functioning of the eukaryotic cell requires that all newly synthesized proteins 

15 be correctly folded, modified, and delivered to specific inter and extracellular sites. Newly 
synthesized membrane and secretory proteins enter a cellular sorting and distribution network 
during or immediately after synthesis (cotranslationally or posttranslationally) and are routed to 
specific locations inside and outside of the cell. The initial compartment in this process is the 
endoplasmic reticulum (ER) where proteins undergo modifications such as glycosylation, disulfide 

20 bond formation, and assembly into oligomers. The proteins are then transported through an 
additional series of membrane-bound compartments which include the various cisternae of the 
Golgi complex, where further carbohydrate modifications occur. Transport between compartments 
occurs by means of vesicles that bud and fuse in a specific manner; once within the secretory 
pathway, proteins do not have to cross a membrane to reach the cell surface. 

25 The complexity of this system has advantages for the cell because it allows proteins to fold 

and mature in closed compartments that contain the appropriate enzyme catalysts. It is, however, 
dependent on sorting mechanisms that position the enzymes correctly and maintain them in place. 

The first organelle in this system, the ER, contains multiple enzymes involved in protein 
structure modifications. Among these are BiP (binding protein) which directs the correct folding of 

30 proteins and, PDI (protein disulfide isomerase) and a homologue of the 90 kDa heat-shock protein, 
both of which catalyze the formation and rearrangement of disulfide bonds (Gething, M. J. and 
Sambrook, J. (1992) Nature 355:33-45). These abundant soluble proteins must be retained in the ER 
and must be distinguished from the newly synthesized secretory proteins which are rapidly 
transported to the Golgi apparatus. The signal for retention in the ER in mammalian cells consists of 

35 the tetrapeptide sequence, KDEL, located at the carboxy terminus of proteins. This sequence was 
first identified when the sequences of rat BiP and PDI were compared and it was subsequently 
found at the carboxy terminus of other luminal ER proteins from a number of species (Munro, S. 
(1986) Cell 46:291-300; Pelham, H. R. (1989) Ann. Rev. Cell. Biol. 5:1-23). Proteins containing 
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this sequence leave the ER but are quickly retrieved from the early Golgi compartment and returned 
to the ER, while proteins without this signal continue through the distribution pathway. 

Two endoplasmic retrieval receptors were first identified in S. cerevesiae; two human 
endoplasmic retrieval receptors were subsequently isolated by the use of degenerate PCR primers 
5 based on the S. cerevesiae sequences (Hardwick, K. G. (1990) EMBO J. 9:623-630; Semenza, J. C. 
(1990) Cell 61:1349-1357; Lewis, M. J. and Pelham, H. R. (1990) Nature 348:162-163; Lewis, M. 
J. and Pelham, H. R. (1992) J. Mol. Biol. 226:913-916). Comparisons of these sequences shows that 
they consist of a conserved 7-transmembrane domain structure with only short loops in the cell 
cytoplasm and the ER lumen. Studies with these endoplasmic retrieval receptors show that ligand 

10 binding controls the movement of the receptor; when expressed in COS cells, the human receptor is 
normally concentrated in the Golgi, but moves to the ER when bound to a ligand such as KDEL- 
tagged hen lysozyme (Lewis, M. J. and Pelham, H. R. (1992) Cell 68:353-364). 

The ER retrieval function of these molecules serves to maintain the pool of enzymes in the 
ER that are necessary to perform protein structure modifications, retains newly synthesized proteins 

1 5 in the ER until they have been correctly modified, and regulates the structure of the Golgi 

apparatus. Saccharomyces cerevisiae cells that lack an ER retrieval receptor (Erd2) have a defective 
Golgi apparatus and fail to grow. Analysis of yeast Erd2 mutants suggests that their growth requires 
both the retention of multiple proteins in the ER and the selective removal of specific proteins from 
the Golgi (Townsley, F. M. (1994) J. Cell Biol. 127:21-28). Overexpression of a human ER 

20 retrieval receptor in COS cells results in hyperactive retrograde traffic from the Golgi to the ER 
leading to a loss of the Golgi structure and the breakdown of the secretory pathway (Hsu V. W. 
(1992) Cell 69:625-635). 

Disruptions in the cellular secretory pathway have been implicated in several human 
diseases. In familial hypercholesterolemia the low density lipoprotein receptors remain in the ER, 

25 rather than moving to the cell surface (Pathak, R. K. (1988) J. Cell Biol. 106:1831-1841). A form of 
congenital hypothyroidism is produced by a deficiency of thyroglobulin, the thyroid prohormone. In 
this disease the thyroglobulin is incorrectly folded and is therefore retained in the ER (Kim, P. S. 
(1996) J.Cell Biol. 133:517-527). Mutant forms of proteolipid protein (PLP) have been examined as 
they play a role in generating dysmyelinating or hypomyelinating diseases. In this case, the 

30 mutations that result in disease are mutations that arrest transport of PLP in the ER and the early 
Golgi; the subsequent accumulation of PLP in the ER results in rapid oligodendrocyte death (Gow, 
A. (1994) J. Neurosci. Res. 37:574-583). 

The human ER retrieval receptor function is necessary for processing and presentation of 
specific antigens to T cells. Many antigens must be processed intracellularly before they can be 

35 presented, in association with major histocompatability complex (MHC) molecules at the cell 
surface, for recognition by the antigen-specific receptor of T cells. Disruption of the ER retrieval 
receptor function with an antibiotic, Brefeldin A, abolishes the ability of a cell to present these 
specific antigen complexes to T cells. These antigenic proteins must be retained in the ER for 
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cleavage to smaller peptides which can then bind to MHC molecules and be released for 
presentation at the cell surface. (Kakiuchi, T. (1991) J. Immunol. 147:3289-3295). 

The discovery of polynucleotides encoding a novel human KDEL receptor, and the 
molecules themselves, provides the means to further investigate the regulation of the cellular 
5 protein secretory pathway. Discovery of molecules related to a novel human KDEL receptor 
satisfies a need in the art by providing a means or a tool for the study of this pathway and the 
diseases that involve the dysfunction of this pathway. 

In an embodiment of the present invention, ERD4 polypeptides of the present invention are 
used to purify KDEL containing proteins and other homologous proteins with similar signals for ER 

10 retention such as "HDEL", "DDEL", "ADEL", "SDEL", "RDEL", "KEEL", "QEDL", "HIEL", 

"HTEL" and "KQDL". This may be carried out by covalently or non-covalently attached the EDR4 
polypeptides of the present invention to a column or other solid support using techniques well 
known in the art (e.g., affinity chromatography, panning, etc.). Once bound to the ERD4 
polypeptide, the complex is washed to remove contaminants. The target protein is released using 

1 5 increasing salt concentrations either in a gradient or step type purification. The bound target protein 
may also be released from the ERD4 polypeptide by a single step up in salt concentration. 

In another embodiment of the present invention, the EDR4 polypeptides of the present 
invention are used to detect KDEL containing proteins and other homologous sequences as 
described above by methods comprising the steps of contacting KDEL or other homologous 

20 sequences with an EDR4 polypeptide under conditions that allow binding to said sequence, and 
detecting the presence of bound EDR4. The presence of bound EDR4 can be detected using 
methods known in the art, such as by labeling EDR4 directly or indirectly. Bound EDR4 can be 
detected, for example, by using an antibody that specifically binds to EDR4 or another EDR4 - 
binding compound that is detectable directly or indirectly. 

25 Preferred ERD4 polypeptides for binding KDEL containing proteins and other homologous 

sequences described above comprise the amino acid sequence -KIWK- or - 
MNLFRFLGDLSHLLAIILLLLKIWKSRSCA-. 

The present invention is further directed to a transformant comprising the following 
expression units in a co-expressible state: an expression unit containing a gene coding for an ERD4 

30 polypeptide which is capable of binding to a protein localizing in the endoplasmic reticulum and 
having a signal for staying therein; an expression unit containing a gene coding for said protein 
localizing in endoplasmic reticulum; and an expression unit containing a foreign gene coding for a 
polypeptide which is a subject of function of said protein localizing in endoplasmic reticulum, and 
to a transformant comprising, in a co-expressible state, a fusion gene which is composed of a DNA 

35 fragment coding for a human serum albumin prepro-sequence and a foreign gene coding for a useful 
polypeptide. The present invention is also directed to a process for producing said polypeptide by 
co-expressing said genes in said transformant such that the polypeptide is predominantly secreted 
out of the transformant cell. Consequently, the invention has an advantage of improving the 
productivity of said polypeptide. 
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More particularly, the invention relates to: A transformed yeast cell comprising the 
following expression units integrated on a yeast chromosome in a co-expressible state: a first 
expression unit containing a gene coding for a receptor for an endoplasmic reticulum retention 
signal, wherein the receptor is the receptor protein ERD4 or a fragment thereof which is capable of 
5 binding to a retention signal selected from the group consisting of "KDEL", "HDEL", "DDEL", 
"ADEL", "SDEL", "RDEL", "KEEL", "QEDL", "HIEL", "HTEL" and "KQDL". and a second 
expression unit containing a gene encoding a protein disulfide isomerase, wherein said isomerase 
comprises an endoplasmic reticulum retention signal, or a gene encoding a fusion protein 
comprising the amino acid sequence of said isomerase and a human serum albumin prepro- 
10 sequence. These methods can be carried out using methods known in the art or described in U.S. 
Patent 5,578,466, incorporated herein by reference in its entirety. 

Proteins of SEP 1DNQ:193 and 194 (internal designation 585770 2 15-1 6-5-0-E8-F and 
123996 140-002-5-0-B4-F) 

15 The cDNA of clones 585770_215-16-5-0-E8-F (SEQ ID NO:24) and 123996J 40-002-5-0- 

B4-F (SEQ ID NO:25) encode the human Smooth Muscle and Pain Effector (SMPE) proteins: 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGRXGEEC 
HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF (SEQ ID NO: 193) and 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC 

20 HPG SHKI P FFRKRKHHTCPCLPNLLC SRFPDGR YRC SMDLKN INF (SEQ ID NO: 1 94), 

respectively. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs:24 and 25 and polypeptides of SEQ ID NO: 193 and 194, described throughout the present 
application also pertain to the human cDNA of clones 585770 215-16-5-0-E8-F and 123996_140- 
002-5-0-B4-F, and the polypeptides encoded thereby. Polypeptide fragments having a biological 

25 activity described herein and polynucleotides encoding the same are also included in the present 
invention. Related polynucleotide and polypeptide sequences included in the present invention are 
SEQ ID NOs:360 and 447. 

SMPE contracts longitudinal ileal muscle and distal colon, and relaxes the proximal colon. 
SMPE binds with a high affinity to both ileum and brain membranes. Therefore, included as 

30 embodiments of the present invention is a method of causing gastrointestinal smooth muscle cells to 
contract, in vitro or in vivo, comprising the steps of contacting said cells with a contracting effective 
amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth muscle cells are those of 
the longitudinal ileal or distal colon. A further embodiments of the present invention is a method of 
causing gastrointestinal smooth muscle cells to relax comprising the steps of contacting said cells 

35 with a relaxing effective amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth 
muscle cells are proximal colon cells. SMPE can also be used in the same manner to contract 
uterine cells. Therefore, included in the present invention is a method of causing uterine smooth 
muscle cells to contract comprising contacting said cells with a contracting effective amount of an 
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SMPE polypeptide. Further included in the present invention is a method of causing smooth muscle 
cells (e.g., bladder, vascular) to contract comprising contacting said cells with a contracting 
effective amount of an SMPE polypeptide. Further included in the present invention is a method of 
inhibiting angiogenesis comprising contacting vascular endothelial cells with an angiogenesis 
5 inhibiting effective amount of an SMPE polypeptide. The SMPE anti-angiogenic affect can be 
measured using assays known in the art. For example, the anti-angiogenic effect in vivo can be 
assayed by using the 10-day-old embryo chick chorioallantoic membrane model. 

SMPE binds with a high affinity to both ileum and brain membranes. Thefore, as a further 
embodiment of the present invention is a method of binding an SMPE polypeptide to ileum or brain 

10 membranes. The method can be further used as a method of detecting ileum or brain membranes 
comprising the steps of contacting ileum or brain membranes with an SMPE polypeptide under 
conditions that allow binding to said membranes, and detecting the presence of SMPE. The 
presence of SMPE can be detected using methods known in the art, such as by labeling SMPE 
directly or indirectly. Bound SMPE can be detected, for example, by using an antibody that 

1 5 specifically binds to SMPE or another SMPE-binding compound that is detectable directly or 
indirectly. 

SMPE is also expressed in spermatocytes. Therefore, a further embodiment of the present 
invention is a method of detecting testes or spermatocytes by detecting an SMPE polypeptide or 
nucleic acid. An SMPE polypeptide can be detected using anti-SMPE antibodies or other SMPE- 

20 binding compounds. SMPE polynucleotides, such as mRNA, can be detected using methods known 
in the art such as PCR (RT-PCR), hybridization (Northern blot analysis), etc. 

SMPE elicits hyperalgesia when it contacts the GNS, e.g., the brain. Therefore, the present 
invention includes a method of causing hyperalgesia comprising contacting the CNS with a 
hyperalgesia effecting amount of an SMPE polypeptide. SMPE can be delivered to the CNS using 

25 methods well known in the art including those described in PCT application WO9906060, 

incorporated herein by reference in its entirety. Using the methods of WO9906060, the TGF-alpha 
or other polypeptide that binds the epidermal growth factor (EGF) receptor, is substituted with an 
SMPE polypeptide of the present invention. 

Further included in the present invention are methods of inhibiting the above SMPE 

30 activities using an inhibitor of SMPE. A preferred inhibitor of SMPE is an anti-SMPE antibody. 
Thus, an embodiment of the present invention is a method of inhibiting smooth muscle contraction 
(bladder, gastrointestional cells, uterine) or pain comprising the step of contacting said cells with an 
effective contractive or pain inhibiting amount of an anti-SMPE antibody or other SMPE inhibitor. 
The invention further relates to a method of screening for test compounds that bind and/or 

35 inhibit an SMPE activity above comprising the steps of contacting an SMPE polypeptide with said 
test compound and detecting or measuring whether said test compound binds said SMPE 
polypeptide. Alternatively, the method comprises the steps of contacting an SMPE polypeptide 
with a binding target (e.g., smooth muscle cells or brain cells) of said SMPE polypeptide in the 
presence of a test compound, and detecting or measuring the binding of the SMPE polypeptide to 
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said binding target, wherein a difference in the amount of said binding in the presence of said test 
compound relative to the amount of binding in the absence of the test compound indicates that the 
test compound modulates, preferably inhibits, the binding of said polypeptide to said binding target. 
The method may alternatively comprise the steps of contacting an SMPE polypeptide with a 
5 binding target in the presence of a test compound, wherein the binding of said SMPE polypeptide 
with said binding target elicits or causes a biological activity (e.g., activities described above) which 
is detected or measured, and further wherein a difference in the level of said biological activity in 
the presence of the test compound relative to the amount of biological activity in the absence of the 
test compound indicates that the test compound modulates, preferably inhibits or activates, the 
1 0 biological activity of said SMPE polypeptide. 

Preferred SMPE polypeptides for use in the methods described herein include the amino 
acid sequences — 

AVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHPGSHKIPFFRKRKHH- or - 
TGACERDVQCGAGTCCAISLWLRGLRMCT- of SEQ ID NO: 193 or 194. 

15 

Protein of SEP IDNO:305 (internal designation 500691428 255-2-5-0-D4-R 104) 

The human cDNA of clone 500691428_255-2-5-0-D4-R_104 (SEQ ID NO:136) encodes 
the human VESICLE-ASSOCIATED MEMBRANE PROTEIN 10 or VAMP-10 protein: 
MSATAATAPPAAPAGEGGPPAPPPNLTSNRRLQQTQAQVDEVVDIMRWVDKVLERDQKL 

20 SELDDRADALQAGPSQFETSAAKLKRKYWWKNLKMMIILGVICAIILIIIIVYFST (SEQ ID 
NO:305). It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 136 and polypeptides of SEQ ID NO:305 described throughout the present application also 
pertain to the human cDNA of clone 50069 1428_255-2-5-0-D4-R_l 04 and the polypeptides 
encoded thereby. Polypeptide fragments having a biological activity described herein and 

25 polynucleotides encoding the same are also included in the present invention. Related 

polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:432 
and 546. 

VAMP-10 is an integral membrane protein involved in the movement of vesicles from the 
plasmalemma of one cell, across the synapse, to the plasma membrane of the receptive neuron. This 

30 regulated vesicle trafficking pathway and the endocytotic process may be blocked by the highly 
specific action of clostridial, tetanus toxin (TeTx) and botulinum toxin (BoNT) and other 
metalloendoprotease neurotoxins which prevents neurotransmitter release by cleaving VAMPs. 
VAMP-10 is important in membrane trafficking. It participates in axon extension via exocytosis 
during development, in the release of neurotransmitters and modulatory peptides, and in 

35 endocytosis. The tightly-regulated synaptic vesicle cycle at the nerve terminal consists of the 
formation of synaptic vesicles, the docking of vesicles comprising VAMP-10 to the presynaptic 
plasma membrane, the fusion of these membranes and consequent neurotransmitter release, 
endocytosis of the empty vesicles and the regeneration of fresh vesicles. Endocytotic vesicular 
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transport includes such intracellular events as the fusions and fissions of the nuclear membrane, 
endoplasmic reticulum, Golgi apparatus, and various inclusion bodies such as peroxisomes or 
lysosomes. 

VAMP- 10, like other VAMPs, has a three domain organization. The domains include a 
5 variable proline-rich, N-terminal sequence, a highly conserved central hydrophilic core of amino 
acids, and a hydrophobic sequence of amino acids presumed to be the membrane anchor. 

In one aspect, the invention includes a VAMP- 10 polypeptide composition for use in 
delivering a second composition, preferably nucleic acids, polypeptides, or small molecules such as 
therapeutic drugs, to target biological cells either in vitro or in vivo. The composition comprises a 

10 VAMP-10 polypeptide as a first molecule and a second molecule. The second molecule may, if 
desirable, be covalently or non-covalently attached or fused to the VAMP-10 polypeptide. The 
VAMP-10 polypeptide composition may further comprise artificial lipids to facilitate delivery of 
the second molecule by lipisomes or lipid vesicles. Methods for using VAMP-10 polypeptides in 
these methods are known in the art and include U.S. Patents 6,074,844, 6,203,794 and 6,099,857, 

1 5 incorporated by reference in their entireties. In a preferred embodiment, VAMP-1 0 polypeptides 
are used to faciliate delivery of a second composition, e.g., lipisome mediated DNA transfection, to 
cells in culture, preferably neuronal cells, and further preferably to the presynaptic membrane. 

VAMP-10 polypeptides are also useful in methods of inhibiting the release of 
neurotransmitters by preventing the docking and/or fusing of a presynaptic vesicle to the 

20 presynaptic membrane. These polypeptides may be referred to as excitation-secretion uncoupling 
peptides (ESUPs). Fragments of VAMP-10 having this blocking activity can be identified using 
methods known in the art (See e.g., U.S. Patents 6,090,63 1 and 6,169,074 incorporated by reference 
in their entireties). ESUPs of the present invention comprise synthetic and purified VAMP-10 
peptide fragments which correspond in primary structure to peptides which serve as binding 

25 domains for the assembly of a ternary protein complex ("docking complex") which is critical to 
neuronal vesicle docking with the cellular plasma membrane prior to neurotransmitter secretion. 
Preferably, the primary sequence of the ESUPs of the invention also includes amino acids which are 
identical in sequence to the VAMP-10 peptide products of BoTx and TeTx proteolytic cleavage in 
neuronal cells, or fragments thereof ("proteolytic products"). For optimal activity, ESUPs of the 

30 invention have a minimum length of about 20 amino acids and a maximal length of about 28 amino 
acids, although they may be larger or smaller. Preferably, the ESUPs correspond in primary 
structure to binding domains in the docking complex, most preferably the region of such binding 
domains that are involved in the formation of a coiled-coil structure in the native docking complex 
proteins. ESUPs may also be used as pharmaceutical carriers as part of fusion proteins to deliver 

35 substances of interest into neural cells in a targeted manner. Preferred VAMP-10, or ESUP, 
polypeptides for use in inhibiting the release of neurotransmitters include those comprising - 
NRRLQQTQAQVDEVVDIMRVNVDKVLERDQKLSELDDRADALQAGPSQFETSAAKLKRK 
- of SEQ ID NO:305. More preferred ESUP polypeptides comprise an amino acid sequence portion 
of SEQ IDNO:305 selected from the group consisting of: RVNVDKVLERDQKLSELDD; 
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KVLERDQKLSELDDRA; VNVDKVLERDQKLSELDDRA; 

DIMRVNVDKVLERDQKLSELDDRADAL; DEVVDIMRVNVD; QAQVDEVVDIMRVNVD; 
LQQTQAQVDEVVDIMRVNVD; QQTQAQVDEVVD; NRRLQQTQAQVDEVVD; and 
NLTSNRRLQQTQAQVDEVVD. 
5 The ESUPs above may be used to inhibit or treat pain according to U.S. Patent 6,1 1 3,91 5 or 

5,989,545 (incorporated by reference herein in their entireties) by substituting the polypeptides of 
the present invention for BoTx type A. 

Because VAMP-10 is a component of vesicles, antibodies to VAMP- 10 are useful in the 
detection of vesicles, preferably neuronal vesicles transporting neurotransmitters. VAMP-10 can be 

10 used during purification of vesicles as a marker for vesicles or vesicles can be detected using 

antibodies to VAMP-10 in assays such as immunohistochemistry. Following exocytosis of vesicles, 
a portion of the VAMP-10 inserted in the vesicle appears on the surface of the axon, thus making 
VAMP-10 useful for the detection and monitoring of exocytosis of synaptic vesicles. 

Detection of VAMP-10 expression (mRNA or protein) levels or mutated forms of VAMP- 

15 10 is further useful in the determination or diagnosing of whether someone is at risk of developing 
or has a neurological disorder, such as mood disorders selected from depression, bipolar disorder, 
schizophrenia, etc.), wherein a decreased level in expression of VAMP-10, mRNA or protein, as 
compared to an individual without a neurological disorder indicates the individual has the disorder 
or is as risk of having the disorder in the future. 

20 The present invention further includes a novel assay system for toxins, such as clostridial, 

tetanus toxin (TeTx) and botulinum toxin (BoNT), using novel reagents. Preferably, methods of 
U.S. Patent 6,043,042, incorporated by reference in its entirety, are used to perform the assay, 
wherein a VAMP-10 polypeptide is the substrate cleaved by the test compound. More specifically, 
the assay comprises the steps of: 

25 The invention relates to an assay for botulinum toxin or tetanus toxin comprising the steps 

of: 

(a) combining a test compound with a substrate and with antibody, wherein the substrate 
has a cleavage site for the toxin and when cleaved by toxin forms a product, and wherein the 
antibody binds to the product but not to the substrate; and wherein the substrate is a VAMP- 1 0 

30 polypeptide; and 

(b) testing for the presence of antibody bound to the product, which product is attached to a 
solid phase assay component. 

Preferably, in the practice of this invention, the VAMP-10 polypeptide is cleaved by the 
toxin to generate new peptides having N- and C-terminal ends. In addition, the peptide substrate is 
35 attached to a solid phase component of the assay. 

The assay according to the invention may utilize assay components (a) and (b): 
(a) a peptide linked to a solid-phase, the peptide being cleavable by the toxin to generate a 
cleavage product, 
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(b) an antibody that binds to the cleavage product but not to the uncleaved polypeptide or 
an antibody that binds a cleavage product that is either the N-terminal or C-terminal portion of the 
VAMP- 10, and the assay may comprise the steps of: 

(i) combining a test compound that may contain or consist of the toxin with the solid-phase 
5 peptide to form an assay mixture, 

(ii) subsequently or simultaneously combining the assay mixture with the antibody, and 

(iii) subsequently or simultaneously determining whether there has been formed any 
conjugate between the antibody and the cleavage product. 

Preferably, the step (i) of the assay is carried out in the presence of a zinc compound and a 
1 0 VAMP- 1 0 polypeptide. 

In this embodiment, the assay comprises: 

(i) combining the test compound with a solid phase comprising a VAMP- 10 polypeptide, 

(ii) washing the test compound from the solid phase, 

(iii) combining the solid phase with an antibody adapted for binding selectively with 
1 5 peptide cleaved by toxin, and 

(iv) detecting a conjugate of the antibody with cleaved peptide. 
In another embodiment, the assay comprises: 

(1) adding a test solution to an assay plate comprising immobilized peptide, the peptide 
being a VAMP- 10 polypeptide; 

20 (ii) incubating the assay plate, 

(iii) washing the plate with a buffer, 

(iv) adding to the plate an antibody solution, said solution comprising an antibody adapted 
selectively to bind to a peptide selected from the group consisting of (I) the 50 C-terminal amino 
acid residues SEQ ID NO:305, the 30 C-terminal amino acid residues SEQ ID NO:305, and the 20 

25 C-terminal amino acid residues SEQ ID NO:305 (any other VAMP-10 polypeptide of the present 
invention may also be selected). 

(2) a peptide the N-terminal end of which is selected from the group consisting of: (1) the 50 
N-terminal amino acid residues SEQ ID NO:305, the 30 N-terminal amino acid residues SEQ ID 
NO:305, and the 25 N-terminal amino acid residues SEQ ID NO:305 (any other VAMP-10 

30 polypeptide of the present invention may also be selected). 

(v) incubating the assay plate, 

(vi) washing the plate with a buffer, and 

(vii) measuring the presence of antibody on the assay plate. 

In this embodiment, the antibody may be linked to an enzyme and the presence of antibody 
35 on the plate is measured by adding an enzyme substrate and measuring the conversion of the 
substrate into detectable product. The detectable product may be colored and measured by 
absorbance at a selected wavelength. 

In the practice of the invention, the inactive toxin present in the test compound may be 
converted to active toxin. This may be accomplished by adding a protease to the test compound. 
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The antibody-peptide conjugate may be detected using a further antibody specific to the 
first antibody and linked to an enzyme. 



5 Proteins of SEP ID NO: 171 (internal designation Clone ID:589115) and related protein of SEQ ID 
NO:457. 

The polynucleotides of SEQ ID NO:2 and SEQ ID NO:340 and polypeptides of SEQ ID 
NO: 171 and 457 encode a C-terminal variant of Apolipoprotein Al, herein referred to as ApoAI- 
CTV. An embodiment of the invention includes compositions of SEQ ID NO:2, 340, 171, and 457 
10 which encode for this novel variant of the apolipoprotein family of lipid transporting proteins. 
Specifically, ApoAI-CTV is a component of high density lipoprotein which functions to remove 
cholesterol from circulation and thus providing protection against the development of 
atherosclerosis, coronary atherosclerotic lesions and subsequent microvascular and cardiovascular 
disease. 

1 5 Preferred polynucleotides of the invention are compositions of the novel portion of the 

cDNA from bases 465 to 521 of SEQ ID NO:2 including the nucleic acids comprising the 
sequences — 

GCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTTTTTGA-, - 
GAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTTTTT 
20 GA-, or- 

AGCTCTACCGCCAGAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGG 
AAATAAAGCTTTTTGATGAAAAAA- of SEQ ID NO:2 and 340. 

Preferred polypeptides of the invention are compositions of the novel C-terminal portion 
comprising the amino acid sequence -AAFLTILTSLGPNGNKAF, - 
25 MELYRQKAAFLTILTSLGPNGNKAF, or -QKKWQEEMELYRQKAAFLTILTSLGPNGNKAF 
of SEQ ID NO: 171 and SEQ ID NO:457. 

Further preferred polypeptides of the invention include the compositions comprising the 
apolipoprotein domain 

KAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVDVLKDSGRDYVS 
30 QFEGSALGKQLNLKLLDNWDSVTSAFSNLREQLGPVTQEXWDNLEKETEGLRQEMSKDLE 
EVKAKVQPYLDDFQKKWQEEMELYRQKAAFLTILTSLGPNGNKA of SEQ ID NO: 1 7 1 and 
457, or the amino acid residue positions -17 to +141 of SEQ IDNO:171. 

An embodiment of the invention includes a method for treatment of atherosclerosis or 
cardiovascular diseases, comprising administering to an individual a therapeutically effective 
35 amount of apoAI-CTV or variants or mixtures thereof to lower total plasma cholesterol at least 5% 
of pretreatment levels. 

Further utility of the polypeptides of the present invention may be further confirmed by 
methods of production and use of other apolipoproteins by those skilled in the art or as described by 
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Ageland et al in US Patent 5990081, which disclosure is hereby incorporated by reference in its 
entirety. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:2 and SEQ ID NO:340 and polypeptides of SEQ ID NO: 1 7 1 and 457 described throughout the 
5 present application also pertain to the human cDNA of clone 5891 15. 

Proteins of SEP ID NO:302 (internal designation Clone ID: 1000853793) and related protein of 
SEP ID NO:543. 

MRLFLSLPVLVVVLSIVLEGPAPAQGTPDVSSALDKLKEFGNTLEDKARELISRIKQ 
10 SELSAKMREWFSETFQKVKDKLKIDS 

The polynucleotides of SEQ ID NO: 133 and SEQ ID NO:429 encode human apolipoprotein 
CI (ApoCI) polypeptide of SEQ ID NO:302 and SEQ ID NO:543, respectively. The ApoCI of the 
invention differs by 1 amino acid comprising the amino acid sequence FQKVKDKLKI, where 
aspartate (D at position 77 of SEQ ID NO:302) replaces a glutamate (E) of the ApoCI of GENPEP 
15 accession X00570, AF050154, and M20902. ApoCI is a member of the apolipoprotein family of 
lipid binding and transporting proteins specifically functioning to transport cholesterol esters. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 133 and SEQ ID NO:429 and polypeptides of SEQ ID NO:302 and SEQ ID NO:543, described 
throughout the present application also pertain to the human cDNA of clone 1000853793, and the 
20 polypeptides encoded thereby. 

Proteins of SEP ID NO:295 (internal designation Clone ID:642948\ SEP ID NO:296 (internal 
designation Clone ID:638743V and SEP ID NP:539. 

MEASALTSSAVTSVAKVVRVASGSAVVLPLARIATVVIGGVVAMAAVPMVLSAMGFTAA 
25 GIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLGATGLSGLTKFILGSIGSAIAAVIARFY 

The polynucleotides of SEQ ID NO: 126, 127, and 425 and the polypeptides of SEQ ID 
NO:295, 296 and 539 encode human transmembrane, alpha-interferon-inducible polypeptides, 
aINFIP-1, aINFIP-1, and aINFIP-3, respectively. Preferred polynucleotides and polypeptides of the 
invention comprise the nucleic acid sequences of SEQ ID NO: 126, 127, and 425 and amino acid 

30 sequences of SEQ ID NO:295, 296 and 539. 

Preferred polypeptides of SEQ ID NO:295 and SEQ ID NO:539 for use in the methods 
described herein include the amino acid sequences comprising - 
VLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASG-, - 
SSIAAKMMSAAAIANGGGVASGSLVATLQSLGAT-, or - 

35 VIGGVVAMAAVPMVLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLG 
ATGLSGLTK-. 
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Preferred polypeptides of SEQ ID NO:296 for use in the methods described herein include 
the amino acid sequence -AAAIANXGGVASGSLVATLQSLGATGLSGLTKF- or - 
LSAMGFTAAGIASSSIAAKMMSAAAIANXGGVASGSLVATLQSLGATGLS-. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
5 NOs: 1 26, 1 27, and 425 and the polypeptides of SEQ ID NOs:295, 296 and 539, described 

throughout the present application also pertain to the human cDNA of Clone ID:642948 and Clone 
ID:638743, and the polypeptides encoded thereby. 

Sites of glycine myristylation within a polypeptide function to modulate the activity and 
compartmentalization of the protein (Resh, M. D. Biochim Biophys Acta 1451:1-16 (1999)). 

10 Preferred polypeptides of the invention include fragments comprising the sites of N- 

myristylation. Preferred amino acids of said sites within SEQ ID NO:295 and SEQ ID NO:539 
include GGVVAM (positions 39-44), GIASSS (positions 60-65), GGGVAS (positions 79-84), 
GSLVAT (positions 85-90), and GSIGSX (positions 108-1 13). Further preferred are amino acids 
within 6 residues preceding or 6 residues following said amino acid sequences. Further preferred 

15 amino acids include sequences comprising the sites of N-myristylation in the polypeptides of SEQ 
ID NO:296. Preferred amino acids of said sites within SEQ ID NO:296 include 
IATVVIGGVVAMAAVPMV, MGFTAAGIASSSIAAKMM, AAIANXGGVASGSLVATL, 
NXGGVASGSLVATLQSLGA, and LTKFILGSIGSAIAAVIAR. 

Interferons (IFNs) are a part of the group of intercellular messenger proteins known as 

20 cytokines and are part of the body's natural defense to viruses and tumors. Type I IFNs (alpha and 
beta interferons) are produced in a variety of cells types and their biosynthesis is stimulated by 
viruses and other pathogens, and by various cytokines and growth factors. Both a- and y-IFNs are 
immunomodulators and anti-inflammatory agents, activating macrophages, T-cells and natural killer 
cells (reviewed in Jonasch and Haluska, Oncologist 6(1):34 (2001)). As part of the body's natural 

25 defense to viruses and tumors, INFs affect the function of the immune system and have direct action 
on pathogens and tumor cells. IFNs mediate these multiple effects by inducing the synthesis of 
cellular proteins, including the polypeptides of the present invention, aINFIP-1, aINFIP-1, and 
aINFIP-3. 

Antiviral activity of the aINFIP polypeptides are assayed according to conventional 
30 methods (Tovey et al, Proc. Soc. Exp. Biol, and Med., 1974 146: 809-815). Preferred polypeptides 
of SEQ ID NO:295, 296 and 539 and fragments thereof include those which possess antiviral 
function, where preferred antiviral activity is against herpes simplex virus and hepatitis virus C, 
alone or in combination with known antiviral treatments such as interferon alpha. 

The antitumor activity the aINFIP polypeptides of the invention can be demonstrated by 
35 similar methods using tumor cell lines rather than treatment of cells with virus as used to test 

antiviral activity. Tumor cell lines examples include MCF-7 (human breast cancer derived), NOS-1 
(human oral primary squamous cell carcinoma derived), and MedB-1 (human primary mediastinal 
large B-cell lymphoma derived). 
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Further utility of the polypeptides of the present invention may be further confirmed by 
methods of interferon inducible proteins in the inhibition of viral functions such as cell penetration, 
uncoating, RNA and protein synthesis, assembly and release described in Hardman et al., 
Pharmacological Basis of Therapeutics, McGraw-Hill, New York NY pp 121 1-1214, 25 (1996), 
5 disclosure of which is hereby incorporated by reference in its entirety. 

Another embodiment of the present invention relates to the use of aINFIP polypeptides or 
fragments thereof to treat and/or prevent the ill-effect of bacterial infection. In a preferred 
embodiment, the protein of the invention may be used to counteract the effects of the bacterial 
endotoxin lipopolysaccharide (LPS). The methods for using such compositions is described in 

10 Dziegielewska and Andersen, Biol. Neonate, 74:372-5 (1998), the disclosure of which is 
incorporated herein by reference in its entirety. 

Furthermore, the aINFIP polypeptides or fragments thereof may be used to identify specific 
molecules with which it binds such as agonists, antagonists or inhibitors. Another embodiment of 
the present invention relates to methods of using the aINFIP polypeptides or fragments thereof to 

1 5 identify and/or quantify cytokines of the interferon family as well as other cytokines such as IL10 
and tumor antigens, which may interact with the aINFIP polypeptides of the invention. 

The aINFIP polypeptides of the invention or fragments thereof are included in 
pharmaceutical preparations for treatment, prevention or alleviation of cancers. In another 
embodiment of the present invention, the aINFIP polypeptides of the invention or fragments thereof 

20 are used included in pharmaceutical preparations for treatment, prevention or alleviation of viral or 
bacterial infections. In another embodiment of the present invention, the aINFIP polypeptides of 
the invention or fragments thereof are used to inhibit and/or modulate the effect of cytokines and 
related molecule such as 11-2, TNF alpha, CTLA4, CD28, and others, by preventing the binding of 
the endogenous cytokine to their natural receptors, thereby blocking cell proliferation or inhibitory 

25 signals generated by the ligand-receptor binding event. 

In another embodiment of the present invention, the aINFIP polypeptides of the invention 
or fragments thereof are useful to correct defects in in vivo models of disease such as autoimmune, 
inflammation and tumor models, by injecting the protein either intra peritoneally intravenously, 
subcutaneously or directly in the diseased tissue. 

30 The polynucleotides of SEQ ID NO: 126, 127, and 425 or fragments thereof is useful in 

diagnostic assays for aINFIP-1, aINFlP-2, or aINFIP-3 gene expression in in vitro models or in 
conditions associated with expression of the aINFIP polypeptides of the invention. The diagnostic 
assay is useful to distinguish between absence, presence, and excess expression of the gene and to 
monitor regulation of levels of the gene of the invention during therapeutic intervention. The DNA 

35 may also be incorporated into effective eukaryotic expression vectors and directly targeted to a 
specific tissue, organ, or cell population for use in gene therapy to treat the above mentioned 
conditions, including tumors and/or to correct disease- or genetic-induced defects in any of the 
above mentioned proteins including the protein of the invention. 



108 



WO 02/083898 PCT/IBO 1/009 14 

Protein of SEP ID NO: 170 (internal designation Clone ID:502084) and related protein of SEP ID 

NO:456. 



The polynucleotides of SEQ ID NO:339 and polypeptides of SEQ ID NO:456 encode 
neutrophil stimulating protein 2, previously described in WO 9006321 (GENPEP accession 
5 AO 13 19) as a novel factor having neutrophil-stimulating activity. The polynucleotide of SEQ ID 
NO:l encodes a novel polypeptide variant, neutrophil stimulating protein 2v, comprising the amino 
acid sequence of SEQ ID NO: 170 in which an aspartate (D) residue is located at position +16 of 
SEQ ID NO: 170 rather than a glutamate (E). Preferred compositions of the invention include the 
polypeptides of SEQ ID NO: 1 70. Further preferred amino acids of SEQ ID NO: 1 70 comprise the 

1 0 sequence LAKGKDESLDS, QXKRNLAKGKDESLDSDLYAE, or 

SSTKGQXKRNLAKGKDESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIA 
TLKDGRKICLDPDAPRIKKIVQKKLAGDESAD. It will be appreciated that all characteristics 
and uses of the polynucleotides of SEQ ID NO:l and SEQ ID NO:339 and polypeptides of SEQ ID 
NO: 1 70 and SEQ ID NO:456, described throughout the present application also pertain to the 

1 5 human cDNA of clone 502084, and the polypeptides encoded thereby. 

A preferred embodiment of the invention includes use of the novel neutrophil stimulating 
protein 2v of SEQ ID NO: 1 70 in a method to stimulate wound healing by contacting the wound area 
with effective amount of polypeptide of SEQ ID NO: 1 70 or further use as described in US Patent 
5,804,176, which disclosure is hereby incorporated by reference in its entirety. A further preferred 

20 includes use of neutrophil stimulating protein 2v in the enhancement of angiogenesis for 
revascularization after injury such following myocardial infarction, wherein site of injury is 
contacted with effective amount of polypeptide of SEQ ID NO: 1 70 or use as further described in 
US Patent 5,871,723, which disclosure is hereby incorporated by reference in its entirety. 
Antibodies against neutrophil stimulating protein 2v, by preventing or blocking the deposition of 

25 connective tissue matrix, are useful in the treatment of fibrotic disorders by contacting the 

polypeptides of SEQ ID NO: 1 70 with fibrotic tissue, such as in scleroderma, liver cirrhosis, and 
myelofibrosis. 

An embodiment of the invention includes fragments of SEQ ID NO: 1 70 which comprise 
domains which impart function to this cytokine. Preferred fragments include the amino acid 

30 sequence comprising the IL8 domain, 

DSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKI 
VQKKL. Further preferred amino acids include the small cytokines (intercrine/chemokine) C-x-C 
subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLD. 

35 Further preferred polypeptides include portions comprising sites of Protein Kinase C 

phosphorylation including amino acid residues 2 to 4, residues 13 to 15, residues 36 to 38 and 
residues 97 to 99 of SEQ ID NO: or amino acids sequence comprising SLR, SAR, STK, and TLK. 
Further preferred polypeptides include portions of the amino acid sequence comprising sites of 
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Casein kinase II phosphorylation including amino acid residues 97 to 100 or the amino acid 
sequence comprising TLKD. 

Further preferred polypeptides include portions of the amino acid sequence comprising sites 
of N-myristylation or the amino acid residues comprising GTHCNQ. 
5 Further preferred polypeptides include the small cytokines (intercrine/chemokine) C-x-C 

subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLD. 

Proteins of SEP ID NO: 227 (internal designation Clone ID: 166601) and related protein of SEP ID 
10 NP: 502. 

Polynucleotides of SEQ ID NP:58 and SEQ ID NP:385 encode the polypeptides of SEQ 
ID NP:227 and SEQ ID NP:502, respectively, with amino acid sequence 

MAAAAVPSLLLSLPPHQGLTFSNKIQPFGAQGVLHPEPGLRDWLLPTCSRQLRVALPEKGS 

EGSLCQTQLPATPCFLPSNTVRT. It will be appreciated that all characteristics and uses of the 
15 polynucleotides of SEQ ID NPs:58 and 385 and polypeptides of SEQ ID NP:227 and 502, 

described throughout the present application also pertain to the human cDNA of clone 166601, and 

the polypeptides encoded thereby 

The polynucleotides of SEQ ID NP:58 and 385 and polypeptides of SEQ ID NG:227 and 

502 encode a transcriptional regulatory protein. Preferred polynucleotides of the invention include 
20 the nucleic acid sequences comprising Clone 166601, the polynucleotides comprising SEQ ID 

NP:58 and the polynucleotides comprising SEQ ID NG:385. Preferred polypeptides of the 

invention include the amino acid sequences derived from the nucleic acid sequence comprising 

Clone 1 66601 , the polypeptides comprising the amino acid sequences of SEQ ID NP:227 and the 

polypeptides comprising the amino acid sequences of SEQ ID NP:502. 
25 In an embodiment of the invention, preferred polypeptides include the portion comprising 

the site of protein kinase C phosphorylation or the amino acid sequences comprising SNK or TVR 

of SEQ IDNP:227 and 502. 

In another embodiment, preferred polypeptides of the invention include the portion of the 

amino acid sequence comprising sites of myristylation or the amino acids comprising the sequence 
30 GLTFSN or GSEGSL of SEQ ID NG:227 or 502. 

Proteins of SEP ID NP:268 (internal designation Clone ID:21 1056) and related protein of SEP ID 
NP:530. 

The polynucleotides of SEQ ID NP:99 and SEQ ID NP:4 1 6 and polypeptides of SEQ ID 
35 NG:268 and SEQ ID NG:530, respectively, encode a novel human tryptophan hydroxylase, 
including the amino acid sequence hereafter referred to as nhTPH. Tryptophan is taken up by 
active transport into the neurons where it is hydroxylated to 5-hydroxytryptophan (5HTP). The 
latter is then decarboxylated to serotonin, a neurotransmitter involved in central nervous disorders, 
especially mood disorders, sleep disorders, and eating disorders. Activity of the polypeptide of the 



110 



WO 02/083898 PCT/IBO 1/009 14 

invention increases production of serotonin levels and increase the metabolism of tryptophan. Thus 
polypeptides of the invention are useful in the in vitro production of the serotonin and metabolism 
of tryptophan. As example, an expression vector containing the polynucleotides of SEQ ID NO:99 
or SEQ ID NO:416 can be introduced into a cell line by methods known in the art such as by 
5 calcium precipitation; tryptophan can be supplied in the media; and serotonin produced by the cells 
can be extracted by known methods. 

The invention further relates to a method of screening for test compounds that bind hnTOH 
comprising the steps of contacting a hnTOH polypeptide with said test compound and detecting or 
measuring whether said test compound binds said hnTOH polypeptide. The invention further 

10 relates to a method of screening for test compounds that activate hnTOH comprising the steps of 
contacting a hnTOH polypeptide with said test compound and detecting or measuring whether said 
test compound activates said hnTOH polypeptide, for example by measuring serotonin production 
or tryptophan depletion. 

Another embodiment includes physiologically acceptable compositions of test compounds 

15 found to increase serotonin production, referred to as activators, in a screen. Further embodiments 
include methods to use activators that have been identified in a screen or previously known in the 
art in the preparation of physiological acceptable formulations for use in in vivo. Further preferred 
are methods to use activators in a physiologically acceptable formulation in the treatment of CNS 
disorders in which tryptophan and serotonin levels are aberrant, particularly depression, anxiety 

20 disorder, bipolar disorder, and eating disorders. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:99 and SEQ IDNO:416 and polypeptides of SEQ ID NO:268 and SEQ IDNO:530, described 
throughout the present application also pertain to the human cDNA of clone 21 1056, and the 
polypeptides encoded thereby. 

25 

Proteins of SEQ ID NO: 190 (internal designation Clone 10:147648) and related protein of SEP ID 
NO:474. 

The polynucleotides of SEQ ID NO:21 and SEQ ID NO:357 and polypeptides of SEQ ID 
NO: 190 and SEQ ID NO:474 encode a novel DNA binding polypeptide containing a leucine zipper 

30 pattern multimerization domain, thereafter referred to as LZP, also known as bZIP transcription 

factor basic domain signature (Hai et al., Genes Dev. 3:2083(1989)). An embodiment of the present 
invention includes the polynucleotides, polypeptides and fragments thereof comprising the 
sequences of SEQ ID NO:2 1 , 357, 1 90, and 474 of the invention. Preferred polypeptides of the 
present invention are directed to the amino acid sequences which comprise the leucine zipper 

35 domain selected from the following amino acids of SEQ ID NO: 1 90 and 474 including 
LAAGAVTLGIGFFALASALWFL; PKGFFNYLTYFLAAGAVTLGIG; or 
FFALASALWFLICKRREIFQNS. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:21 and SEQ ID NO:357 and polypeptides of SEQ ID NO: 190 and 
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SEQ ID NO:474, described throughout the present application also pertain to the human cDNA of 
clone 147648, and the polypeptides encoded thereby. 

Leucine-zippers permit dimerization of various cytoplasmic hormone receptors and 
enzymes (Forman, et al., Mol Endocrinol^ 3, 1610-1626 (1989)). Leucine zippers are also a 
5 common feature of transcription factors, where they permit homo- or heterodimerization resulting in 
tight binding to DNA strands (for reviews, see Abel, et al., Nature 341, 24-25 (1989); Jones, et al., 
Cell 61, 9-1 1 (1990); Lamb, et al., Trends in Biochemical Sciences 16, 417-422 (1991)). Therefore, 
preferred polypeptides of the present invention are useful tools in several areas of biotechnology, 
especially in protein engineering, where their ability to mediate homo-dimerization or hetero- 

10 dimerization has found several applications, including but not limited to immunochemistry, 
antibody generation, preparation of soluble oligomeric proteins, complementation assasys. The 
utility of the present invention may be further confirmed by methods described, for example, by 
Bosslet et al (US patent 5,643,731) in which use of a pair of leucine zippers for in vitro diagnosis, in 
particular for the immunochemical detection and determination of an analyte in a biological liquid ; 

1 5 by Tso et al (US patent 5,932,448) in which use of leucine zippers for producing bispecific antibody 
heterodimers ; by Conrad et al (US patent 5,965,712) , Ciardelli et al (US patent 5,837,816) , and 
Spriggs et al (WO9410308) in which methods of preparing soluble oligomeric proteins using 
leucine zippers have been described; and by Pelletier et al (WO9834120) in which methods to use 
leucine zipper forming sequences in protein fragment complementation assays to detect 

20 biomolecular interactions has been described, all examples which disclosures are hereby 
incorporated by reference in their entireties. 

The multimerization activity of the polypeptides of the present invention containing leucine 
zipper domains may be assayed using any of the assays known to those skilled in the art including 
circular dichroism spectrum and thermal melting analyses as described in US patent 5,942,433, 

25 which disclosures are hereby incorporated by reference in their entirety. Alternatively, the leucine 
zipper motif in LZP could be used by those skilled in art as a "bait protein" in a well established 
yeast double hybridization system to identify its interacting protein partners in vivo from cDNA 
library derived from different tissues or cell types of a given organism. Alternatively, LZP or part 
thereof could be used by those skilled in art in mammalian cell transfection experiments. When 

30 fused to a suitable peptide tag such as [His] 6 tag in a protein expression vector and introduced into 
culture cells, this expressed fusion protein can be immunoprecipitated with its potential interacting 
proteins by using anti-tag peptide antibody. This method could be chosen either to identify the 
associated partner or to confirm the results obtained by other methods such as those just mentioned. 
In a preferred embodiment, the invention relates to compositions and methods of using the 

35 LZP polynuceotides and polypeptides of SEQ ID NO: and SEQ ID NO: or fragment thereof for 
preparing soluble multimeric proteins, which consist in multimers of fusion proteins containing a 
leucine zipper fused to a protein of interest, using any technique known to those skilled in the art 
including those described in international patent W094 10308, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, LZP or derivative 
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thereof is used to produce bispecific antibody heterodimers as described in US patent 5,932,448, 
which disclosure is hereby incorporated by reference in its entirety. Briefly, leucine zippers capable 
of forming heterodimers are respectively linked to epitope binding components with different 
specificities. Bispecific antibodies are formed by pairwise association of the leucine zippers, 
5 forming an heterodimer which links two distinct epitope binding components. In still another 
preferred embodiment, LZP or part thereof or derivative thereof is used for detection and 
determination of an analyte in a biological liquid as described in US patent 5,643,73 1 , which 
disclosure is hereby incorporated by reference in its entirety. Briefly, a first leucine zipper is 
immobilized on a solid support and the second leucine zipper is coupled to a specific binding 

10 partner for an analyte in a biological fluid. The two peptides are then brought into contact thereby 
immobilizing the binding partner on the solid phase. The biological sample is then contacted with 
the immobilized binding partner and the amount of analyte in the sample bound to the binding 
partner determined. In still another preferred embodiment, the LZP or part thereof may be used to 
synthesize novel nucleic acid binding proteins which are able to multimerize with proteins of 

15 interest, for example to inhibit and/or control cellular growth using any genetic engineering 

technique known to those skilled in the art including the ones described in the US patent 5,942,433, 
which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the invention relates to compositions and methods using the LZP or 
part thereof or derivative thereof in protein fragment complementation assays to detect 

20 biomolecular interactions in vivo and in vitro as described in international patent WO9834120, 
which disclosures is hereby incorporated by reference in its entirety. Such assays may be used to 
study the equilibrium and kinetic aspects of molecular interactions including protein-protein, 
protein-nucleic acid, protein-carbohydrate and protein-small molecule interactions, for screening 
cDNA libraries for binding to a target protein with unknown proteins or libraries of small organic 

25 molecules for biological activity. 

Still, another object of the present invention relates to the use of the LZP or part thereof for 
identifying new leucine zipper domains using any techniques for detecting protein-protein 
interaction known to those skilled in the art. Among the traditional methods which may be 
employed are co-immunoprecipitation, crosslinking and co-purification through gradients or 

30 chromatographic columns of cell lysates. Once isolated as a protein interacting with the LZP, such 
an intracellular protein can be identified (e.g. its amino acid sequence determined) and can, in turn, 
be used, in conjunction with standard techniques, to identify other proteins with which it interacts. 
The amino acid sequence thus obtained may be used as a guide for the generation of oligonucleotide 
mixtures that can be used to screen for gene sequences encoding such intracellular proteins. 

35 Screening may be accomplished, for example, by standard hybridization or PCR techniques. 

Techniques for the generation of oligonucleotide mixtures and the screening are well-known. (See, 
e.g., Ausubel et ah> eds., Current Protocols in Molecular Biology, J.Wiley and Sons (New York, 
NY 1993) and PR Protocols: A Guide to Methods and Applications, 1990, Innis, M. et al., eds. 
Academic Press, Inc., New York ). 
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Alternatively, methods may be employed which result in the simultaneous identification of 
genes which encode the intracellular proteins that can dimerize with the LZP or part thereof using 
any technique known to those skilled in the art. These methods include, for example, probing cDNA 
expression libraries, in a manner similar to the well known technique of antibody probing of 
5 lambda.gtl 1 libraries, using as a probe a labeled version of the LZP or part thereof, or fusion 

protein, e.g., the LZP or part thereof fused to a marker (e.g., an enzyme, fluor, luminescent protein, 
or dye), or an Ig-Fc domain (for technical details on screening of cDNA expression libraries, see 
Ausubel et al, supra). Alternatively, another method for the detection of protein interaction in vivo, 
the two-hybrid system, may be used. 

10 

Proteins of SEP ID NO:3 1 8 (internal designation Clone ID: 1 24608) and related protein of SEP ID 
NO:556. 

The polynucleotides of SEQ ID NP: 149 and SEQ ID NP:442 and polypeptides of SEQ ID 
NP:3 1 8 and SEQ ID NP:556 encode an RNA-binding protein, hgRBP, which functions in RNA 
1 5 processing and protein expression. The preferred composition of SEQ ID NO:3 1 8 and 556 include 
MERPDKAALNALQPPEFRNESSLASTLKTLLFFTALMITVPIGLYFTTKSYIFEGALGMSNR 
DSYFYAAIVAVVAVHVVLALFVYVAWNEGSRQWREGKQD. 

Further preferred polypeptides include those of SEQ ID NG:3 1 8 or SEQ ID NP:556 
comprising an N-myristoylation site or the amino acid sequence at positions 43-48 or comprising 
20 the amino acid sequence GLYFTT which targets the protein to the membrane of the endoplasmic 
reticulum for function of hgRBP in translation of cellular mRNA into protein. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NP: 149 and SEQ ID NP:442 and polypeptides of SEQ ID NP:3 1 8 and SEQ ID NP:556, described 
throughout the present application also pertain to the human cDNA 124608of clone , and the 
25 polypeptides encoded thereby. 

Protein of SEQ ID NP:337 (internal designation Clone ID: 1 13448) 

The polynucleotides of SEQ IDNP:168 and related SEQ IDNP:454 and polypeptides of 
SEQ ID NP:337 encode a novel human RNA-binding protein involved in RNA processing and 
30 protein expression which is related to Clone ID: 1 83902 and Clone ID:635993. 

It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NP:168 and related SEQ ID NP:454 and polypeptides of SEQ ID NP:337, described throughout 
the present application also pertain to the human cDNA of clone 1 13448, and the polypeptides 
encoded thereby. 

35 

Protein of SEP ID NP:328 (internal designation Clone ID: 1 83902) 

Polynucleotides of SEQ ID NP:159 and related SEQ ID NP:450 and polypeptides of SEQ 
ID NP:328 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 13448 and Clone ID:635993. 
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It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NO: 159 and related SEQ ID NO:450 and polypeptides of SEQ ID NO:328, described throughout 
the present application also pertain to the human cDNA of clone 1 83902, and the polypeptides 
encoded thereby. 

5 

Protein of SEP ID NO:329 (internal designation Clone ID:635993) 

Polynucleotides of SEQ ID NO: 1 60 and related SEQ ID NO:45 1 and polypeptides of SEQ 
ID NO:329 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 83902 and Clone ID: 1 13448. It will be appreciated that 

10 all characteristics and uses of the The polynucleotides of SEQ ID NO: 160 and related SEQ ID 
NO:45 1 and polypeptides of SEQ ID NO:329, described throughout the present application also 
pertain to the human cDNA of clone 635993, and the polypeptides encoded thereby. 

Many eukaryotic proteins that bind single-stranded RNA contain one or more copies of a 
putative RNA-binding domain of about 90 amino acids. This is known as the eukaryotic RNA- 

15 binding region, RNP-1 signature or RNA recognition motif (RRM) (Bandziulis et al. Genes Dev. 
3:431 (1989); Swanson et al. Trends Biochem. Sci. 13: 86-91 (1988)). RRMs are found in a 
variety of RNA binding proteins, including heterogeneous nuclear ribonucleoproteins (hnRNPs), 
proteins implicated in regulation of alternative splicing, and protein components of small nuclear 
ribonucleoproteins (snRNPs). The polypeptides of SEQ ID NO:337, 328, and 329 encode novel 

20 human RNA binding protein, hereafter referred to as ghRBP which contains one copy of an RRM. 
Further characteristic of a protein which binds to nucleic acids, ghRBP contains a zinc finger motif 
comprising the amino acid sequence. Preferred polynucleotides of the invention include 
polynucleotides comprising the nucleic acids of SEQ ID NO: 159, 160, 168, 450, 451, and 454. 
Preferred polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID 

25 NO:337, 328 and 329. It will be appreciated that all characteristics and uses of the polynucleotides 
of SEQ ID NO: 1 59, 1 60, 1 68, 450, 45 1 ,and 454 described throughout the present application also 
pertain to the human cDNA of Clone ID:183902, Clone ID:635993 and Clone ID:1 13448. 

Preferred amino acids of the invention are residues which comprise the RNA-binding 
domain or portion thereof. Preferred amino acid sequences are selected from the following set of 

30 sequences including 

AFVRRXPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQQE 
NHIIDGVKVQV; 

SINQPVAFVRRJKPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWQFSSEEGLRN 
ALQQENHIID; 

35 PWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQQENHIIDGV 
KVQVHTRRP. Further preferred are polypeptides of the invention include any fragment of SEQ 
ID NO: which binds to RNA. 

An embodiment of the invention relates to methods of using the polypeptides of the 
invention to bind to RNA molecules in vitro by techniques that are known in the art. Preferred use 
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of the polypeptides of the invention includes extraction of RNA from biological samples, chemical 
reagents, cell homogenates and tissue homogenates. Further utility of the polypeptides of the 
present invention or part thereof may be further confirmed by binding methods described in 
Trifillis, et ai., RNA 5(8): 1071-82 (1999) and U.S. Patent 6,107,029, which disclosures are hereby 
5 incorporated by reference in their entireties. 

Proteins of SEQ ID NO:248 (internal designation Clone ID: 199782), SEP ID NO:249 (internal 
designation Clone ID:821212V SEP ID NO:250 (internal designation Clone ID:202863) and related 
protein of SEP ID NP:5 1 8. 

10 The polynucleotides of SEQ ID NPs:79, 80, 81 and 401 and polypeptides of SEQ ID 

NPs:248, 249, 250 and 518 encode human RNA-associated polypeptides which act as splicing 
factors. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NPs:79, 80, 81 and 401 and polypeptides of SEQ ID NOs:248, 249, 250 and 518, described 
throughout the present application also pertain to the human cDNA of clones 199782, 821212, and 

1 5 202863, and the polypeptides encoded thereby. 

The translation of genetic information into protein depends on RNA and the first step in this 
process is the transcription of DNA into RNA while retaining all the genetic information encoded in 
DNA. The RNA transcript undergoes various processing steps which include splicing and 
polyadenylation. The mature RNA transcript is translated into protein by the ribosomal machinery. 

20 Nascent RNA transcripts are spliced in the nucleus by the spliceosomal complex which catalyzes 
the removal of introns and the rejoining of exons. At least 40 splicing factors have been identified 
and interaction of these factors are important in the conformational changes needed for the 
enzymatic removal of introns and religation of the exons. Both protein and RNA components are 
involved in the spliceosome assembly and the splicing reaction. There are 2 distinct catalytic steps 

25 involved in the RNA splicing reaction with distinct proteins and RNA species. Alternative splicing 
factors include developmentally regulated proteins that play key roles in developmental processes 
such as pattern formation and sex determination, respectively (Hodgkin, J. et al. (1994) 
Development 120:3681-3689). Alternate splicing is also involved in the tissue specific expression 
of isoforms of proteins, including structural proteins and enzymes. 

30 An embodiment of the present invention relates to compositions of the polynucleotides of 

SEQ IDNP:79, 80, 81 and 401 and polypeptides of SEQ IDNP:248, 249, 250 and 518. Preferred 
amino acids of the invention comprise the zinc finger region or fragment thereof and are selected 
from the following sequences of amino acids from SEQ ID NP:248, 249, 250 and 5 1 8 including 
GACENCGAMTHKKKDCFE; NSIITKYRKGACENCGAM; or THKKKDCFERPRRVGAKF. 

35 The polypeptides of SEQ ID NG:248, 249, 250 and 51 8 are involved in the splicesome 

complex and have function in the processing of RNA processing. Alternatively, the polypeptides of 
the present invention are involved in RNA processing and thus involved in protein expression. A 
preferred embodiment of the invention relates to a method of using the polynucleotides of 
polynucleotides of SEQ ID NP:79, 80, 81 and 401 in vitro. A preferred method of use relates to 
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introduction of said polypeptides or fragments thereof into cells by techniques known in the art such 
as transfection or microinjection. Further preferred are methods to use the polynucleotides of the 
invention to alter protein expression in the given cell. Alternately, polypeptides of the present 
invention can be used in combination with reagents known in the art to alter protein expression in 
5 cell free expression systems, mammalian expression systems, insect expression systems, or bacterial 
expression systems. Furthermore, methods to use the polynucleotides or polypeptides the present 
invention to increase or decrease protein expression is preferred. The utility of the polypeptides of 
the invention or part thereof may be further confirmed using methods described in US Patent 
6020164 and Chua and Reed, Gene Devel 13:841-850 (1999), which disclosures are hereby 

1 0 incorporated by reference in their entireties. 

In another embodiment, methods to screen for inhibitors and activators of the polypeptides 
of the invention are preferred. In another embodiment, molecules or compounds which are 
identified in such a screen are further preferred. Further preferred are compounds which activate or 
inhibit activity of the polypeptides of the current invention. Activity of the the polypeptides of the 

1 5 invention is modified by phosphorylation at cAMP and cGMP dependent phosphorylation sites 
(including 3-6:55-58; 107-1 10) and casein kinase II phosphorylation sites (including 33-36:58- 
61 : 126-129). Preferred activators include but are not limited to compounds which promote 
accumulation of intracellular cAMP and cGMP. Further preferred activators include those 
compounds which activate casein kinase II. Preferred inhibitors are those compounds which inhibit 

20 intracellular cAMP and cGMP accumulation, or those compounds which promote cAMP and cGMP 
degradation. Further preferred inhibitors include compounds which promote the deactivation of 
casein kinase II. Furthermore, inhibitors and activators of the polypeptides of the present invention 
include compounds known in the art as well as compounds to be identified by the method of 
screening. Furthermore, compounds that inhibit or activate the activity of the polypeptides of the 

25 present invention by means other than phosphorylation or dephosphorylation are also preferred. 

In another embodiment, a method for the use of the polynucleotides or polypeptides of the 
present invention in the treatment, prevention, attenuation or diagnosis of disorders of RNA 
processing or protein processing are preferred. Such disorders are selected from a group which 
includes but is not limited to cancers such as adenocarcinoma, leukemia, sarcoma, teratocarcinoma, 

30 and any disorder associated with cell growth and differentiation, embryogenesis, and 

morphogenesis involving any tissue, organ, or system, e.g., the brain, adrenal gland, or reproductive 
system. 

Proteins of SEP ID NO:256 (internal designation Clone ID:822794V SEP ID NO:257 (internal 
35 designation Clone ID:337572) and related protein of SEP ID NG:521 . 

The polynucleotides of SEQ IDNP:87, 88 and 407 and polypeptides of SEQ ID NP:256, 
257 and 521 encode human nuclear polypeptides which interact with transcription factors of the 
Signal Transducers and Activators of Transcription (STAT) family of proteins involved in the 
regulation of cell division. It will be appreciated that all characteristics and uses of the 
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polynucleotides of SEQ ID NO: 87, 88 and 407 and polypeptides of SEQ ID NO:256, 257 and 521, 
described throughout the present application also pertain to the human cDNA of clones 822794 and 
337572, and the polypeptides encoded thereby. 

STATs are pleiotropic transcription factors which mediate cytokine-stimulated gene 
5 expression in multiple cell populations (Levy, Cytokine Growth Factor Rev., 8:81 (1997)). All 
STAT proteins contain a DNA binding domain, a Src homology 2 (SH2) domain, and a 
transactivation domain necessary for transcriptional activation of target gene expression. Janus 
kinases (JAK), including JAK1, JAK2, Tyk, and JAK3, are cytoplasmic protein tyrosine kinases 
(PTKs) which play pivotal roles in initiation of cytokine-triggered signaling events by activating the 

10 cytoplasmic latent forms of STAT proteins via tyrosine phosphorylation on a specific tyrosine 
residue near the SH2 domain (Ihle et a!., Trends Genet., 1 1 : 69 (1995); Darnell, Science 
277(5332):1630 (1997); Johnston et al., Nature, 370: 1513 (1994)). Tyrosine phosphorylated STAT 
proteins dimerize through specific reciprocal SH2-phosphotyrosine interactions and translocate 
from the cytoplasm to the nucleus where they stimulate the transcription of specific target genes by 

1 5 binding to response elements in their promoters (Leonard, Nature Medicine, 2: 968 (1996); Zhong 
et al., PNAS USA, 91:4806 (1994) Darnell, Science, 277:1630 (1997)). 

In an embodiment of the present invention, compositions of the polynucleotides and 
polypeptides or fragments thereof SEQ ID NO:87, 88 and 407 and SEQ ID NO:256, 257 and 521, 
respectively are included. Further preferred are polypeptides of the present invention which interact 

20 with activated STAT 3, but may also interact with STAT1, STAT2 or other STAT homologues. 
Preferred polypeptides of the invention act to inhibit or decrease the activity of STATs. Further 
preferred amino acids of SEQ ID NO:256, 257 and 521 include the SAP domain 
VSSFRVSELQVLLGFAGRNKSGRKHDLLMRALHLL. Activation of cytokine receptors by 
their cognate ligands activate JAKs which in turn, activate STATS. Therefore cytokines and other 

25 hormones which signal through cytokine-like receptors may be modulated by polypeptides or 
polynucleotides of the present invention. Cytokines and other hormones which can thus be 
modulated by the present invention include but are not limited to interferons, interleukins, prolactin, 
and growth hormone. The utility of the polypeptides of the present invention or part thereof may be 
further confirmed using the methods described in WIPO Publication W09928465 which disclosure 

30 is hereby incorporated by reference in its entirety. 

The polypeptides of this invention can be used in a method of inhibiting the activity of 
STAT proteins in a cell in vitro, the method comprising introducing a nucleic acid into the cell, 
wherein the nucleic acid comprises a nucleotide sequence encoding the amino acid sequence of 
SEQ ID NO:256, 257 and 521 or the amino acid sequence of SEQ ID NO:256, 257 and 521 with 

35 one or more conservative amino acid alterations, and wherein the nucleic acid expresses the amino 
acid sequence in an amount and for a time sufficient for the amino acid sequence to specifically 
bind to STAT proteins and to decrease STAT activity, thereby decreasing STAT activity in the cell. 

Suitable compositions of polypeptides or polynucleotides of the present invention are useful 
as a method of treatment of pathologies such as diseases, syndromes, or other undesirable 
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conditions resulting from defects in cell cycle progression. Such cell cycle defects may result from 
defects in the regulation of activated STAT or an upstream factor such as activated JNKs or 
activated cytokine receptors. Alternatively, polypeptides or polynucleotides of the present 
invention may be used in a method of treating pathologies resulting from defects in cell cycle 
5 progression due to defects in a step "downstream" of STAT regulation of cell cycle progression. In 
preferred embodiments, agonists of polypeptides or polynucleotides of the present invention are 
useful in the treatment of pathologies such as but not limited to hyperproliferative diseases such as 
cancer (e.g., leukemia, lymphoma, breast cancer, colon cancer, prostate cancer, Wilms' tumor), 
coronary artery disease, pulmonary vascular obstructive disease, either primary or as a feature of 

10 Eisenmenger's syndrome, and other disorders of abnormal cellular proliferation. Cells to be treated 
include but are not limited to hyperproliferative cells, cancer cells, vascular smooth muscle cells, 
endothelial cells, and gametes. 

In some embodiments of the invention, antagonists of the polypeptides or polynucleotides 
of the present invention are used to stimulate, promote, or facilitate progression through the cell 

1 5 cycle, such as in the cellular regeneration of terminally differentiated cardiac myocytes or tissues, 
e.g., striated muscle myocytes. For example, this could allow restoration of damaged myocardium 
after cardiac injury, myocardial infarction, myocarditis, cardiomyopathy, trauma, as a consequence 
of cardiac surgery, etc., or repletion of striated muscle exhausted by muscular dystrophy. 

In further embodiments, expression of the polypeptides encoded by the nucleic acids is 

20 expected to prevent, ameliorate, or lessen the cell cycle defect of the host cell, or to restore normal 
cell cycle progression of the host cell. Whether provided via nucleic acid or polypeptides delivered 
directly to cells, the therapeutic formulations of the invention can also be used as adjuncts to other 
forms of therapy, including but not limited to chemotherapy, and radiation therapy. 

25 Protein of SEP ID NO:330 (internal designation Clone ID:398703) and related protein of SEP ID 
NP:330. 

The polynucleotides of SEQ ID NP:161 and SEQ ID NP:452 and polypeptides of SEQ ID 
NP:330 encode a novel human deubiquitinating enzyme (GNP:AF0 17306). Deubiquitinating 
enzymes serve a number of functions (Hochstrasser Cur Ppin Cell Biol 4:1024 (1992); Rose, In: 

30 Ubiquitin , Plenum Press, New York (1988)). First, ubiquitin must be cleaved from a set of 

biosyntheticprecursors, which occur either as a series of ubiquitin monomers in head-to-tail linkage 
or as fusions to certain ribosomal proteins (Finley & Chau, Annu Rev Cell Biol 7, 25-69 (1991)). 
Secondly, ubiquitin must be recycled from intracellular conjugates, both to maintain adequate pools 
of free ubiquitin and, in principle at least, to reverse the modification of inappropriately targeted 

35 proteins. Finally, deubiquitinating reactions may be integral to the degradation of ubiquitinated 
proteins by the 26S proteasome, a complex ATP-dependent enzyme whose exact composition and 
range of activities remain poorly characterized (Hershko & Ciechanover, Annu Rev Biochem 61, 
761-807 (1992); Hadari et ah, J Biol Chem 267, 719-727 (1992); Murakami et al., Nature 360, 597- 
9 (1992); Rechsteiner, J. Biol. Chem. 268, 6065-6068 (1993)). 
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An embodiment of the invention includes preferred polypeptides with ubiquitin-specific 
protease activity with a novel N-terminus of Clone ID:398703 comprising the amino acid sequence 
MCTTSLPCPIIMEPWGLATTKAAYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGD 
DEACSMDTN encoded by 
5 ATGTGTACGACCTCATTGCCGTGTCCAATCATTATGGAGCCATGGGGGTTGGCCACTAC 
TAAAGCAGCTTATGTGCTATTTTACCAACGTCGAGATGATGAATTTTATAAGACACCTT 
CACTTAGCAGTTCTGGTTCCTCTGATGGAGGGACACGACCAAGCAGCTCTCAGCAGGG 
CTTTGGGGATGATGAGGCTTGC AGCATGGAC ACCAACTAA of SEQ ID NO: 1 6 1 . 
The preferred polypeptides of the invention are those which prevent or reverse 
10 ubiquitination of cellular proteins in vitro or in vivo. Further preferred are polypeptides of the 
invention which prevent or reverse ubiquitination of extracellular proteins in vitro or in vivo. 

The polynucleotides of SEQ ID NO: 161 and SEQ ID NO:452 encode polypeptides of SEQ 
ID NO:330 which contain protein domains or motifs including but not limited to a Protein kinase C 
phosphorylation site comprising the amino acid fragment SAR, and an N-myristylation site 
1 5 comprising amino acid fragment GLNMSE. Further preferred amino acids of SEQ ID NO : 330 
include the Ubiquitin carboxyl-terminal hydrolases family 2 signature or amino acid sequence 
YDLIAVSNHYGAMGVGHY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 161 and SEQ ID NO:452 and polypeptides of SEQ ID NO:330, described throughout the 
20 present application also pertain to the human cDNA of clone 398703, and the polypeptides encoded 
thereby. 

The utility of the polynucleotides and polypeptides of the present invention or part thereof 
may be further confirmed using methods which assess activity or function of deubiquitinating 
enzymes described in United States Patents 5391490 and 5565352 which disclosures are hereby 
25 incorportated by reference in their entireties. 

Proteins of SEP ID NO:277 (internal designation Clone ID:653966) and related protein of SEP ID 
NO:535. 

The polynucleotides of SEQ ID NO: 1 08 and SEQ ID NO:42 1 and polypeptides of SEQ ID 
30 NO:277 and SEQ ID NO:535 encode human liver fatty acid binding protein (L-FABP) comprising 
the amino acid sequence of SEQ ID NO:277 and 535. The amino acid sequence of SEQ ID NO:277 
and 535 are the same as human L-FABP (Genbank accession GNP:M10617; Lowe et al., JBC 
260:3413-17 (1985)) and homologous to human FABP (Genbank accession GNP:M 10050). The 
polypeptides of the present invention belong to the FABP/P2/CRBP/CRABP family of transporters 
35 and functionally binds to free fatty acids and derivatives thereof. L-FABP is normally expressed in 
the cytoplasm of hepatocytes, but preferred embodiments include use of the polypeptides of the 
present invention as extracellular polypeptides. Further preferred embodiments include use of the 
polypeptides of the present invention as serum or plasma polypeptides. Further preferred 
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embodiments use polypeptides of the invention in vitro. Still further preferred embodiments 
include use of the polypeptides of the present invention in vivo. 

Preferred amino acids of the invention include the lipocalin domain, from 2 to 127 or 
polypeptides comprising the amino acid sequence 
5 SFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVIQNEFT 
VGEECELETMTGEKVKTVVQLEGDNKLVTTFKNIKSVTELNGDIITNTMTLGDIVFKmSK^ 
I. The polypeptides of SEQ ID NO:277 and SEQ ID NO:535 contain a cytosolic fatty-acid binding 
protein signature comprising the amino acid sequence GKYQLQSQENFEAFMKAI which 
functions in the polypeptides ability to bind small hydrophobic molecules, such as lipids, steroid 
10 hormones, and retinoids. Preferred amino acids of SEQ ID NO:277 and SEQ ID NO:535 include 
GKYQLQSQENFEAFMKAI, MSFSGKYQLQSQENFEAF, and LQSQENFEAFMKAIGLPE. 

Phosphorylation status modulates the activity of L-FABP. Preferred polypeptides of the 
invention include the amino acids sequence comprising the sites of cAMP- and cGMP-dependent 
protein kinase phosphorylation including residues of SEQ ID NO:277 and 535 comprising the 
15 sequence KRIS. 

Further preferred polypeptides of the invention include the amino acid sequence comprising 
the sites of Protein kinase C phosphorylation including residues at positions 4 to 6, 94 to 96, and 
124 to 126 of SEQ IDNO:277 and 535. Still further preferred polypeptides of SEQ IDNO:277 and 
535 include the amino acid sequence comprising SGK, TFK, and SKR. 

20 Further preferred are polypeptides of SEQ ID NO: 277 and 535 include the amino acid 

sequence comprising a Casein kinase II phosphorylation sites. Preferred amino acids of SEQ ID 
NO: 277 and 535 include positions 64 to 67, 100 to 103, and 1 14 to 117. Further preferred amino 
acids comprise the sequences TVGE, SVTE, and TLGD. 

A preferred polypeptide of SEQ ID NO: 277 and 535 is one in which the amino acid 

25 asparagine (Asn) is located at residue 105, further referred to as the N-isoform. Further preferred is 
the polypeptide of SEQ ID NO: 277 and 535 in which the amino acid aspartate (Asp) is located at 
residue 105 further referred to as the D-isoform. The rat homologue of the human D-isoform of the 
present invention was shown to have a greater affinity to lysophospholipids, prostaglandins, 
retinoids, bilirubin and bile salts compared to the rat homologue of the human N-isoform of the 

30 present invention by methods described by DiPietro and Santome, Biochim Biophys Acta 

1478 : 186-200 (2000) which disclosure is hereby incorporated by reference in its entirety. The rat 
homologues share only 82% identity with the of the human D- and N-isoforms, therefore it is not 
predictable to find that the human D-isoform has equal or greater affinity to lysophospholipids, 
prostaglandins, retinoids, bilirubin, bile salts and fatty acid compared to the human N-isoform. 

35 Further preferred polypeptides of the present invention include the D-isoform polypeptide 

and fragments thereof which have an equal or at least 10%, 20%, 30%, 40%, 50%, 60% or 75% 
greater affinity for fatty acids, and lipophilic compounds selected from a group including but not 
limited to lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, steroid hormones (such 
as testosterone and estradiol), and cholesterol compared to the N-isoform. 
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Another embodiment of the invention includes polynucleotides or polypeptides of the 
invention or fragments thereof which bind lipophilic compounds selected from a group including 
but not limited to free fatty acids, lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, 
steroid hormones (such as testosterone and estradiol), and cholesterol in serum or plasma. Further 
5 preferred are polypeptides of the invention which bind lipophilic compounds in serum or plasma 
separated from whole blood in a process of purifying serum or plasma for use in vitro or in vivo. 
Further preferred are polypeptides of the invention which bind lipophilic compounds in serum or 
plasma in vivo. 

In a further embodiment, polynucleotides or polypeptides of the invention or fragments 
10 thereof, in physiological appropriate formulations, are useful in the prevention, treatment or 
attenuation of conditions in which lipophilic compounds are elevated in the serum of mammals, 
preferably humans. Such conditions are selected from a group which include but are not limited to 
obesity, hyperlipidemia, hypercholesterolemia, hypertriglyceridemia, diabetes type I (IDDM) 
diabetes type II (NIDDM), atherosclerosis, and hypertension. 
1 5 Mammary-derived growth inhibitor (MDGI) and heart-fatty acid binding protein (FABP) 9 

which belong to the FABP family, specifically inhibit growth of normal mouse mammary epithelial 
cells (MEC) and promote morphological differentiation, stimulates its own expression and promotes 
milk protein synthesis (US Patent 5977309, 24 March 1995). In further preferred embodiments, 
polypeptides of the invention include those which locally signal growth cessation and stimulate 
20 differentiation of the developing epithelium. Further preferred polypeptides of the invention 
suppress the mitogenic effects of EGF family members, and inhibit c-fos, c-myc and c-ras 
expression. 

In a further aspect of the present invention, there is provided a method for producing such 
polypeptide by recombinant techniques comprising culturing recombinant prokaryotic and/or 
25 eukaryotic host cells, containing a human fatty acid binding polypeptides or polynucleotides of the 
invention acid under conditions promoting expression of said protein and subsequent recovery of 
said protein. 

In a further embodiment of the present invention, there is provided a method for utilizing 
such polypeptides, or polynucleotides of the invention for therapeutic purposes, for example, as a 

30 cell growth inhibitor and as to cause differentiation stimulatory activity on various responsive types 
of tissues and cells in vitro. Further preferred are methods for use of polypeptides, or 
polynucleotides of the invention, in appropriate physiological form, for therapeutic purposes for to 
inhibit cell proliferation or to induce cell differentiation in mammals, preferably humans. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

35 NO: 1 08 and SEQ ID NO:42 1 and polypeptides of SEQ ID NO:277 and SEQ ID NO:535, described 
throughout the present application also pertain to the human cDNA of clone 653966, and the 
polypeptides encoded thereby. 
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Proteins of SEP ID NO:313 (internal designation Clone ID:633418), SEP ID NO:314 (internal 

designation Clone ID:422878) and related protein of SEP ID NO:552. 

The polynucleotides of SEQ IDNG:144, 145 and 438 and polypeptides of SEQ IDNP:313, 
314and 552 encode a cleavage stimulation factor important in mRNA processing and protein 
5 expression. Protein kinase C phosphorylation increases activity of said polypeptides and preferred 
amino acids include SEK and SGR. Further, sites of tyrosine kinase phosphorylation increase 
activity of said polypeptides and preferred amino acids of SEQ ID NP:3 14, 552 include 
KKLEENPY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
10 NP:144, 145 and 438 and polypeptides of SEQ ID NO:3 13, 3 14and 552, described throughout the 
present application also pertain to the human cDNA of clones 63341 8 and 422878, and the 
polypeptides encoded thereby. 

Proteins of SEP ID NO:219 (internal designation Clone ID:589848V SEP ID NP:220 (internal 
15 designation Clone ID:21 1883V SEP ID NP:221 (internal designation Clone ID:642603), SEP ID 
NP:222 (internal designation Clone ID: 1933 161 and related protein of SEP 1DNP:497. 

Polynucleotides of SEQ ID NO:50, 51, 52, 53, 380 and polypeptides of SEQ ID NP:219, 
220, 221 and 497 encode RNA associated proteins with a ribosomal L34 domain comprising the 
amino acid sequence NEYQPSNIKRKNKHGWVRRLXTPAGXXXILRRMLKGRKSLSH or 
20 NEYQPSNIKRKNKHGWVRR1.XTPAGVQVILRJRJVILKGRKSLSH. It will be appreciated that 
all characteristics and uses of the polynucleotides of SEQ ID NCs:50, 51, 52, 53, 380 and 
polypeptides of SEQ ID NGs:219, 220, 221 and 497, described throughout the present application 
also pertain to the human cDNA of clones 589848, 21 1883, 642603 and 193316, and the 
polypeptides encoded thereby. 

25 

Proteins of SEP ID NP:302 (internal designation Clone ID: 1000891 255) and related protein of 
SEP IDNP:543. 

Polynucleotides of SEQ ID NP:133 and 429 and polypeptides of SEQ ID NG:302 and 543 
encode human ribosomal protein, hRIBPRT. An embodiment of the invention includes the 

30 compositions of the polypeptides of SEQ ID NP:302 and 543, comprising the amino acid sequence 
MVAAKKTKXSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALRKSEI 
EYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLA1IDPXDSXIIRSMPEQTGEK, and the 
polynucleotides of SEQ IDNG:133 and 429, respectively, which encode human ribosomal protein, 
hRIBPRT. The polypeptides of the invention contain the ribosomal protein 

35 L30e/L7Ae/S12e/Gadd4 signature (Koonin EV, J Mol Med 75:236-238 (1997) and Nakanishi et aL, 
Gene 35:289-96 (1985)). 

Preferred polypeptides of the invention include the amino acid sequence comprising 
KSLESIKSRLQLVMKSGKWLGYKQTLKM1RQGKAKLVILANNCPALRKSEIEYYAMLAK 
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TGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR; 

KSLESIKSRLQLVMKSGKYVLGYKQT^ ; and 

SEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR- Further preferred 
amino acids of the invention include sites of PKC phosphorylation, comprising the amino acid 
5 sequences of SEQ ID NO:302 and 543 including TKK (positions 7-13); SIK (positions 13-15); 
SGK (positions 24-26); and TLK (positions 34-36). Further preferred amino acids of the invention 
include sites of Casein Kinase II phosphorylation, comprising the amino acid sequences SEIE 
(positions 58-61) and SMPE (positions 107-1 10). 

In another embodiment, the proteins of SEQ ID NO:302 and 543 can be used to bind to 

10 nucleic acids, preferably RNA, alone or in combination with other substances. For example, the 
proteins of the invention or part thereof can be added to a sample containing RNAs in optimum 
conditions for binding, and allowed to bind to RNAs. In a preferred such embodiment, the proteins 
of the invention or part thereof may be used to purify mRNAs, for example to specifically isolate 
RNA, e.g. from a specific cell type or from cells grown under particular conditions. Such RNAs 

1 5 could then be reverse transcribed and cloned, could be analyzed for relative expression analyses, 
etc. In addition, such methods may be used to specifically remove RNA from a sample, for 
example during the purification of DNA. To carry out any of these methods, the proteins of the 
invention or part thereof may be bound to a chromatographic support, either alone or in 
combination with other RNA binding proteins, to form an affinity chromatography column. A 

20 sample containing a mixture of nucleic acids to purify is then run through the column. 

Immobilizing the proteins of the invention or part thereof on a support is particularly advantageous 
for embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of RNAs from the batch of resin-coupled protein after 
binding, and allows subsequent re-use of the protein. Immobilization of the proteins of the 

25 invention or part thereof can be accomplished, for example, by inserting any matrix binding domain 
in the protein according to methods known to those skilled in the art. The resulting fusion product 
including the proteins of the invention or part thereof is then covalently, or by any other means, 
bound to a protein, carbohydrate or matrix (such as gold, "Sephadex" particles, polymeric surfaces). 
Another embodiment of the present invention relates to methods and compositions using 

30 the proteins of the invention, or part thereof, to associate specific mRNAs to the inner face of lipidic 
bilayers of liposomes in order to further introduce these mRNAs into the cytoplasm of eukaryotic 
cells. Preferably, specific mRNAs are first associated with the protein of the invention and the 
RNA/protein complex formed in that way is then mixed with liposomes according to methods 
known to those skilled in the art. These liposomes are added to an in vitro culture of eukaryotic 

35 cells. In vivo, such a method might treat and/or prevent disorders linked to dysregulation of gene 
transcription such as cancer and other disorders relating to abnormal cellular differentiation, 
proliferation, or degeneration. 

A decrease in ribosome function results in a significant inhibition of cell growth. 
Therefore, in another embodiment, the present proteins and nucleic acids can be used to modulate 
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the rate of cell growth in vitro or in vivo. Accordingly, compounds that inhibits the expression or 
function of the proteins of the invention can be used to inhibit the growth rate of cells, and can thus 
be used, e.g. in the treatment or prevention of diseases or conditions associated with excessive cell 
growth, such as cancer or inflammatory conditions. Such compounds include, but are not limited 
5 to, antibodies, antisense molecules, dominant negative forms of the proteins, and any heterologous 
compounds that inhibit the expression or the activity of the proteins. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 133 and 429 and polypeptides of SEQ ID NO:302 and 543, described throughout the present 
application also pertain to the human cDNA of clone 1000891255, and the polypeptides encoded 
1 0 thereby. 

Proteins of SEP IDNO:271 (Internal designation Clone ID:493328), related clones 153261, 
152042, 599054, and 650872 and related protein of SEP ID NO:533. 

1 5 The polynucleotides of SEQ ID Nos: 1 02, 1 03, 1 04, 1 05, and 1 06 and polypeptides of SEQ 

ID Nos:271, 272, 273, 274, 275 and 533 encode for the HUMAN GENSET BINDING PRGTEIN 
or HGBP-1. Preferred polypeptides comprise the amino acid sequence 

MKVKIKCWNGVATWLWVANDENCGICRMAFNGCCPDCKVPGDDCPLVWGQCSHCFHM 
HCILKWLHAQQVQQHCPMCRQEWKFKE. It will be appreciated that all characteristics and 
20 uses of the polynucleotides of SEQ ID Nos:102, 103, 104, 105, and 106 and polypeptides of SEQ 
ID Nos:271, 272, 273, 274, 275 and 533, described throughout the present application also pertain 
to the human cDNA of clones 493328, 153261, 152042, 599054, and 650872, and the polypeptides 
encoded thereby. 

The protein of SEQ ID NG: 271 encoded by the extended cDNA SEQ ID NG: 1 02 is the 
25 same as a hepatocellular carcinoma associated ring finger protein (EMBL AF247565) and Genset 
protein in WOO 100806 (Genpep accession AX061622) with homology to an anaphase-promoting 
complex (APC) subunit from Drosophila (Embl accession number AJ251510). In addition, HGBP- 
1 exhibits the pfam PHD zinc finger signature from positions 33 to 79. 

Zinc binding domains which contain a C3HC4 sequence motif are known as RING domains 
30 (Lovering, R. et al. (1993) Proc. Natl. Acad. Sci. USA 90:21 12-21 16). Zinc finger domains are 
found in numerous zinc binding proteins which are involved in protein-protein and protein-nucleic 
acid interactions. They are independently folded zinc-containing mini-domains which are used in a 
modular repeating fashion to achieve sequence-specific recognition of DNA (Klug 1993 Gene 135, 
83-92). Such zinc binding proteins are commonly involved in the regulation of gene expression, 
35 and usually serve as transcription factors, either by directly affecting transcription or recruiting co- 
activators or co-repressors (see US patents 5,866,325; 6,013,453 and 5,861,495). PHD fingers are 
C4HC3 zinc fingers spanning approximately 50-80 residues and distinct from RING fingers or LIM 
domains. They are thought to be mostly DNA or RNA binding domain but may also be involved in 
protein-protein interactions (for a review see Aasland et al, Trends Biochem Sci 20:56-59 (1995)). 
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HGBP-1 or part thereof is a zinc binding protein, which is able to bind nucleic acids, more 
preferably a transcription factor. Preferred polypeptides of the invention are polypeptides 
comprising the amino acids of SEQ ID NO: 271 from positions 33 to 79. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 271 having any of the biological 
5 activity described herein. The nucleic acid binding activity of the protein of the invention or part 
thereof may be assayed using any of the assays known to those skilled in the art including those 
described in US patent 6,013,453. 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to bind to nucleic acids, preferably DNA, alone or in combination with other substances. 
1 0 For example, the protein of the invention or part thereof is added to a sample containing nucleic 
acid in conditions allowing binding, and allowed to bind to nucleic acids. In a preferred 
embodiment, the protein of the invention or part thereof may be used to purify nucleic acids such as 
restriction fragments. 

In another preferred embodiment, HGBP polypeptides or parts thereof may be used to 

15 visualize nucleic acids when the polypeptide is linked to an appropriate fusion partner, or is 
detected by probing with an antibody. Thus, HGBP polypeptides can be used to diagnose... 

Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other DNA binding proteins, using 
techniques well known in the art, to form an affinity chromatography column. A sample containing 

20 nucleic acids to purify is run through the column. Immobilizing the protein of the invention or part 
thereof on a support advantageous is particularly for those embodiments in which the method is to 
be practiced on a commercial scale. This immobilization facilitates the removal of the protein from 
the batch of product and subsequent reuse of the protein. Immobilization of the protein of the 
invention or part thereof can be accomplished, for example, by inserting a cellulose-binding domain 

25 in the protein. One of skill in the art will understand that other methods of immobilization could 
also be used and are described in the available literature. 

In another embodiment, the present invention relates to compositions and methods using the 
protein of the invention or part thereof, especially the zinc binding domain, to alter the expression 
of genes of interest in a target cells. Such genes of interest may be disease related genes, such as 

30 oncogenes or exogenous genes from pathogens, such as bacteria or viruses using any techniques 
known to those skilled in the art including those described in US patents 5,861,495; 5,866,325 and 
6,013,453. 

In a further embodiment, the protein of the invention or part thereof may be used to 
diagnose, treat and/or prevent disorders linked to dysregulation of gene transcription such as cancer 
35 and other disorders relating to abnormal cellular differentiation, proliferation, or degeneration, 
including hyperaldosteronism, hypocortisolism (Addison's disease), hyperthyroidism (Grave's 
disease), hypothyroidism, colorectal polyps, gastritis, gastric and duodenal ulcers, ulcerative colitis, 
and Crohn's disease. The invention relates to methods of diagnosing, treating and/or preventing 
disorders described herein, comprising delivering to a patient, or causing to be present therein, a 
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zinc finger polypeptide which inhibits the expression of a gene enabling the cells to divide. The 
target could be, for example an oncogene or a normal gene, which is overexpressed in the cancer 
cells. 



5 ISPG Iron-sulfur cluster protein (Clone ID: 1000872335) 

The polynucleotides of SEQ ID NOs:43 and 374 encodes the amino acids sequence of SEQ 
ID NOs:212 and 491 respectively, an iron-sulfur protein which mediates electron transfer in 
metabolic reactions, also referred to as ISPG. It will be appreciated that all characteristics and uses 
of the polynucleotides of SEQ ID NOs 43 and 374 and polypeptides of SEQ ID NOs:212 and 491, 

10 described throughout the present application also pertain to the human cDNA of clone 1000872335, 
and the polypeptides encoded thereby. 

ISPG is an iron-sulfur protein that belongs to the broad family of the 2Fe-2S-type 
ferredoxins. The 2Fe-2S-type ferredoxins are proteins or domains of around one hundred amino 
acid residues that bind a single 2Fe-2S iron-sulfur cluster and are found in plants, animals and 

15 bacteria. Iron-sulfur cluster proteins are well known classes of proteins and are recognized as ideal 
devices for accepting, donating, storing and shifting electrons. It will be appreciated by the skilled 
artisan that stuctural aspect of iron-sulfur proteins have been studied extensively (reviewed in 
Beinert et al, Nature 1 Aug 1997, 277:653-659), allowing modifications of the ISPG proteins to fine 
tune its properties for desired uses while retaining its biological function in mediating electron 

20 transfer and possible protein stabilization and iron or sulfide storage functions. The ISPG protein of 
SEQ ID NO 491 comprisese a glycosaminoglycan attachment site at amino acid positios 34 
(SGSG); protein kinase C phosphorylation sites at amino acid positions 14 (SAR), 44 (TTR), and 
86 (SGR); N-myristoylation sites at amino acid positions 1 1 (GGVSAR), 24 (GTXWNR), 31 
(GGTSGS), 39 (GVALGT), 106 (G ACE AS), a cytochrome c family heme-binding site at amino 

25 acid positions 114-119 and an iron-sulfur binding region signature at amino acid positions 108-1 18. 

In view of its role in electron transfer reactions, ISPG is thought to be involved in a wide 
variety of metabolic reactions and disorders, and may be useful in the treatment of disorders of 
metabolism such as obesity, in the detection of toxic compounds, in prediction, diagnosis or 
treatment of conditions or traits related to drug metabolism or in treatments related to the synthesis 

30 of eg. steroid hormones. 

In a preferred example, overexpression or administration of the ISPG protein may be used 
as a therapeutic treatment for obesity by accelerating the metabolic rate of a subject in need of 
treatment. There is accumulating evidence to support the hypothesis that a low-energy-output 
phenotype is at high risk of weight gain and obesity, irrespective of whether this is owing to a low 

35 resting metabolic rate and/or physical inactivity. The low-energy-output phenotype is associated 
with impaired appetite control, which is improved if energy output is increased, serving as the 
background for pharmacologic stimulation of energy expenditure as a tool to improve the results of 
obesity management. The ISPG protein and agonists or stimulators thereof may serve as a means to 
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increase electron transfer and hence the metabolic rate of an individual in a similar goal as 
commonly cited targets such as leptin receptors, the sympathetic nervous system and its peripheral 
beta-adrenoceptors, selective thyroid hormone derivatives, and stimulation of the mitochondrial 
uncoupling proteins. 

5 In addition, iron-sulfur proteins such as ISPG are generally recognized as being capable of 

several functions that are not of an oxidoreductive nature such as the binding and activation of 
substrates at the unique iron site (in the catalytic function of aconitase and relates enzymes), and 
apparently stabilizing radicals in reactions occurring by a free-radical pathway. There is also 
evidence suggesting that such iron-sulfur clusters can function in coupling electron transfer to 

10 proton transport. By binding Cys ligands from different subunits, iron-sulfur clusters effect dimer 
formation, as in the Fe protein of nitrogenase. Further, by straddling protein structural elements, 
iron-sulfur clusters are able to stabilize structures that are required for specific functions (eg. 
endonuclease III of E. coli). Proteins of ISPG's class have also been shown to protect proteins from 
the attack of intracellular proteases. Finally, proteins of ISPG's class are thought to be capable of 

1 5 serving as storage devices for iron and possibly sulfide. 

Thus, in a few examples, ISPG may be used advantageously as an iron or metal biosensor, 
for the treatment and/or diagnosis of iron overload disorders, or in applications involving stabilizing 
target proteins such as for protein production or for mediating protein interactions. 

Additionally, structural aspects of ISPG suggest that it may be capable of mediating steroid 

20 hormone synthesis, either in human or animals, or in engineered cell culture systems for the large 
scale production of hormones. ISPG may be used to act as an adrenal ferredoxin (known as 
adrenodoxin (ADX)), a vertebrate mitochondrial protein which transfers electrons from 
adrenodoxin reductase to cytochrome P450scc, which is involved in cholesterol side chain cleavage. 
Its primary function as a soluble electron carrier between the NADPH-dependent adrenodoxin 

25 reductase and several cytochromes P450 makes it an irreplaceable component of the steroid 
hormones biosynthesis in the adrenal mitochondria of vertebrates. 

DRUG METABOLISM 

Previous studies have revealed that cytochrome P-450 isozymes are responsible for drug 
metabolism, and oxidation by P-450 isozymes is a common aspect of the overall clearance of drugs. 

30 Further studies have revealed that genetic polymorphism of cytochrome P-450 isozymes underlies a 
wide spectrum of substrates specificity in drug oxidation. In certain cases, genetic mutation and/or 
deletion of one critical isozyme gene results in a significant alteration of a phenotype projected on 
substrate specificity. It has been reported that CYP2D6 oxidizes more than 30 drugs (for example, 
M. Eichelbaum et al., Pharmacol. Ther., Vol. 46, pp. 377-, 1990). Many anti-cancer drugs are 

35 known to be oxygenated by cytochrome P450 enzymes to yield metabolites that are cytotoxic or 
cytostatic toward tumor cells. These include several commonly used cancer chemotherapeutic 
drugs, such as cyclophosphamide (CPA), its isomer ifosfamide (IFA), dacarbazine, procarbazine, 
thio-TEPA, etoposide, 2-aminoanthracene, 4-ipomeanol, and tamoxifen (LeBlanc, G. A. and 
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Waxman, D. J., Drug Metab. Rev. 20:395-439 (1989); Ng, S. F. and Waxman D. J., Intl. J. 
Oncology 2:731-738 (1993); Goeptar, A. R., et al., Cancer Res. 54:241 1-2418 (1994); van Maanen, 
J. M., et al., Cancer Res. 47:4658-4662 (1987); Dehal, S. S., et al., Cancer Res. 57:3402-3406 
(1997); Rainov, N. G. ? et al., Human Gene Therapy 9:1261-1273 (1998)). Bioreductive metabolism 
5 that results in drug activation is also catalyzed by cytochrome P450 enzymes for a variety of anti- 
cancer drugs. Examples of such drugs include Adriamycin, mitomycin C, and 
tetramethylbenzoquinone (Goeptar, A. R., et al., Crit. Rev. Toxicol. 25:25-65 (1995); Goeptar, A. 
R. ? et al., Mol. Pharmacol. 44:1267-1277 (1993)). Those who have homozygous alteration in this 
recessive gene, are so-called "poor metabolizers (PMs)" and may suffer from severe side effects due 

10 to poor metabolism of drugs (for example, see M. Eichelbaum et al., Pharmacol. Ther., Vol. 46, pp. 
377-, 1990). Such genetic alterations occur at rates of from 1 to 30% in different ethnic populations 
(for example, L. M. Distlerath et al., J. Biol. Chem., Vol. 260, pp. 9057-, 1985). 

The ISPG protein of the invention is thought to be capable of functioning as a soluble 
electron carrier in the electron transport chain involving one or more of the several available 

1 5 cytochromes P450 enzymes. The ISPG protein may thus be useful in methods of killing neoplastic 
cells involving P450 (and ISPG) gene transfer and the use of bioreductive drugs that are activated 
by cytochrome P450, and in methods for evaluating the susceptibility of a sample compound to 
metabolism with respect to a specific cytochrome P-450 isozyme system. 

Thus, in a first aspect, a drug activation/gene therapy strategy has been developed based on 

20 a cytochrome P450 gene ("CYP M or "P450") in combination with a cancer chemotherapeutic agent 
that is activated through a P450-catalyzed monoxygenase reaction (Chen, L. and Waxman, D. J., 
Cancer Research 55:581-589 (1995); Wei, M. X., et al., Hum. Gene Ther. 5:969-978 (1994); U.S. 
Pat. No. 5,688,773, issued Nov. 18, 1997). Presently known drug-enzyme combinations can utilize 
established chemotherapeutic drugs widely used in cancer therapy. Such methods to obtain 

25 enhanced chemosensitivity have been demonstrated both in vitro and in studies using a 

subcutaneous rodent solid tumor model and human breast tumor grown in nude mice in vivo, and is 
strikingly effective in spite of the presence of a substantial liver-associated capacity for drug 
activation in these animals (Chen, L., et al., Cancer Res. 55:581-589 (1995); Chen, L., et al., Cancer 
Res. 56:1331-1340 (1996)). The P450-based approach also shows significant utility for gene 

30 therapy applications in the treatment of brain tumors (Wei, M. X., et al., Human Gene Ther. 5:969- 
978 (1994); Manome, Y., et al., Gene Therapy 3:513-520 (1996); Chase, M, et al., Nature 
Biotechnol. 16:444-448(1998)). 

Although the P450/drug activation system has shown great promise against several tumor 
types, further enhancement of the activity of this system is needed to achieve clinically effective, 

35 durable responses in cancer patients. This requirement is necessitated by two characteristics that are 
inherent to the P450 enzyme system: (1) P450 enzymes metabolize drugs and other foreign 
chemicals, including cancer chemotherapeutic drugs, at low rates, with a typical P450 turnover 
number (moles of metabolite formed/mole P450 enzyme) of only 10-30 per minute; and (2) P450 
enzymes metabolize many chemotherapeutic drugs with high Km values, typically in the millimolar 
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range. This compares to plasma drug concentrations that are only in the micromolar range for many 
chemotherapeutic drugs, including drugs such as CPA and IFA. Thus, current approaches to P450 
gene therapy may result in intratumoral drug activation at a low absolute rate and under conditions 
that are not saturating with respect to drug substrate. Furthermore, since P450 is expressed at a very 
5 high level in liver tissue, only a very small fraction of the administered chemotherapeutic drug is 
metabolized via the tumor cell P450 gene product using the currently available methods for P450 
gene therapy (Chen, L. and Waxman, D. J., Cancer Res. 55:581-589 (1995)). As described in U.S. 
Patent No. 6,207,648, one enhancement involves introducing a P450 reductase (RED) gene in 
combination with a cytochrome P450 gene (and thus a P450 gene product) into neoplastic cells, the 

1 0 enzymatic conversion of a P450-activated chemotherapeutic drug to its therapeutically active 
metabolites is greatly enhanced within the cellular and anatomic locale of the tumor, thereby 
increasing both the selectivity and efficiency with which neoplastic cells are killed. 

In a preferred embodiment, further enhancements to known prodrug-enzyme strategies may 
be achieved by introducing an ISPG gene into neoplastic cells, either alone or in combination with a 

1 5 P450 gene and/or a P450 reductase gene. Suitable vectors for the introduction and expression of 
said ISTG and P450 genes are known to one of skill in the art. The introduction of the ISPG gene 
and subsequent expression of the ISPG gene product may increase the enzymatic conversion of a 
P450-activated chemotherapeutic drug to its therapeutically active metabolites. Thus, the invention 
comprises a method for killing neoplastic cells comprising: (a) infecting the neoplastic cells with a 

20 vector for gene delivery, the vector comprising an ISPG gene capable of mediating enzymatic 
conversion of a chemotherapeutic agent by a P450 enzyme; (b) optionally infecting the neoplastic 
cells with a vector for gene delivery, the vector comprising a cytochrome P450 gene and/or a gene 
encoding RED; (b) treating the neoplastic cells with a chemotherapeutic agent that is activated by 
the product of the cytochrome P450 gene; and (c) killing the neoplastic cells. 

25 The present invention also provides a reagent composition for use in evaluating drug 

metabolism by a specific cytochrome P-450 isozyme, which comprises a liver microsome lacking 
said specific P-450, said specific P-450 isozyme and a carrier material. The liver microsome may be 
of human source lacking CYP2D6, CYP2C 19, or CYP2A6. The CYP2D6 isozyme, CYP2C 19 
isozyme, and CYP2A6 isozyme to be added may be a recombinant CYP2D6-expressing 

30 microsome, a recombinant CYP2C 19-expressing microsome, or a recombinant CYP2A6- 
expressing microsome. The reagent composition may comprise more than one kind of PM 
microsomes. 

According to the present invention, there can be provided a reagent composition and a 
method for accurately quantitating the contribution of certain P-450 isozymes such as CYP2D6, 
35 CYP2C 19, and CYP2A6 in drug metabolism. The present invention provides a method for 
evaluating the susceptibility of a sample compound to metabolism with respect to a specific 
cytochrome P-450 isozyme, which comprises contacting the sample compound with a reagent 
composition prepared by adding said specific cytochrome P-450 isozyme and an ISPG protein to 
liver microsomes lacking said specific cytochrome P-450 isozyme in a carrier material. ISPG 
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would be useful in order to enhance efficiency of the P-450 isozyme in drug metabolism, thereby 
effectively amplifying the power of the assay to detect the contribution of a particular P-450 
enzyme. In another embodiment the contribution to drug metabolism of a particular P-450 system 
can be assessed by focussing on a particular iron-sulfur protein associated with said specific P-450 
5 system. In this aspect, the method comprises contacting the sample compound with a reagent 
composition prepared by adding said specific ISPG protein to liver microsomes lacking said ISPG 
protein in a carrier material. The method may further comprise (a) incubating a mixture of the 
sample compound and the reagent composition; (b) extraction of the reaction mixture obtained in 
Step (a); and (c) analyzing the reaction products isolated in Step (b). For the purposes of 
10 quantitating the assay, a plurality of the reagent compositions having different amount of the 
specific P-450 isozyme or ISPG protein may be subjected to Step (a) to (c), respectively. For 
example, the specific P-450 isozyme to be used in the method may be selected from CYP2D6, 
CYP2C19, CYP2A6, CYP1A1 and CYP2E1. 

IRON BIOSENSOR 

15 Iron-sulfur clusters have been found to serve as sensors of iron, dioxygen, superoxide ion 

and possibly nitric oxide. Two main mechanisms of sensing have been described. In one example, 
the oxidation [Fe^S^]^ + --> [Fe^S^]2+ by dioxygen provides the signal for activation of a defense 
mechanism against superoxide, as observed with the SoxR protein of E. coli, and thus may serve a 
cytoprotective function (eg. useful for treatment of ischemia, etc.). In an alternative mechanism, 

20 oxidative disassembly or reassembly of a cluster provides the controlling signal as in the FNR 
protein of E.coli. 

In a further detailed example, the ISPG protein of the invention maybe be used as an iron 
biosensor. An example of an iron biosensor is provided in U.S. Patent No. 5,5 16,697 (Kruzel et 
al.). In summary, ISPG can be immobilized in the vicinity of a device to measure the change in pH, 

25 causing a detectable variation of the potential upon the binding of iron. 

In the process of sequestering iron, the sensing element, the ISPG protein is expected to 
release a number of protons of hydrogen (H + ) directly proportional to the atoms of iron bound. The 
release of protons during the binding of iron by ISPG becomes the operative feature which is 
measured by the biosensors. The release of protons causes a change in pH and is measured by an 

30 ion-selective field effect transistor or by pH sensitive paper. A sample containing iron is placed into 
a buffered solution, usually water. The sample may be diluted one or more times. In one 
embodiment of the sensors of the present invention, the release of protons is measured as the 
variation of the potential on the surface of an ion-selective field effect transistor (an ISFET) 
(Reviewed in: Biosensor Technology, edited by Buck et al. and published by Marcel Decker, Inc., 

35 1990, entitled "Solid State Potentiometric Sensors" by Jiri Janata, pp 17-34). In another 
embodiment of the present invention, the protons released upon binding of iron by ISPG are 
detected by the change in pH using pH sensitive paper. Preferably, the iron selective element 
(ISPG) is incorporated in close proximity or integrated with the signal transducer, to give a 
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reagentless sensing system for iron. Since the signal can be amplified, only small quantities of ISPG 
are needed for detection of iron. The ISFET is modified by immobilizing ISPG on the surface of the 
ISFET or by a disposable membrane with immobilized ISPG that is in close proximity to the ISFET 
by attaching the ISPG -modified membrane to the surface of the ISFET. A sample containing iron, 
5 for example a biological sample such as body fluid from a mammal, particularly a human, is then 
contacted with the ISPG -modified ISFET. 

In order to produce an ISPG-modified ISFET as an independent sensor, an existing system 
which uses an ISFET designed to measure pH can be modified. Systems which presently use an 
ISFET to measure pH are the Sentron 2001 pH system, manufactured by Integrated Sensor 

10 Technology, Federal Way, Wash.; the Corning 360i pH system, manufactured by Corning 
Incorporated, Corning N.Y.; or Orion 610 pH system, manufactured by Orion Analytical 
Technology, Inc., Boston, Mass. The modification required to measure the amount of iron in a 
sample is either to place an immobilized layer of ISPG on the ISFET of such a system or, 
alternatively, to provide a ISPG -modified membrane, i.e. a membrane coated with ISPG, which 

1 5 will be in close proximity to the existing ISFET so as to detect the release of protons when the ISPG 
binds iron in a sample and records the change in potential. 

Iron sensitive biosensors (as well as treatments for iron overload disorders) are extremely 
valuable It is estimated that 30,000,000 Americans suffer from different types of iron related 
disorders, including a substantial proportion with profound iron deficiency syndrome. Detection of 

20 bioaccessible iron is one of the most important measurements that doctors can use for early 

detection if iron deficiency, iron overload or other types of immunological disorders. To date iron 
is measured through a combination of blood tests that detect iron and iron binding capacity of 
transferrin, the protein that transports iron through the body. The current technology involves very 
sophisticated instrumentation which make this analysis prohibitively expensive and often requires 

25 qualified personnel to analyze the sample. Therefore, there is a need for the direct assay of iron that 
combines simplicity and economics. ISPG may therefore be advantageously used in development of 
a biosensor for detecting the amount of iron in a sample. 

STEROID BIOSYNTHESIS 

As noted above, ISPG may be used as an adrenal ferredoxin (known as adrenodoxin 
30 (ADX)), a vertebrate mitochondrial protein which transfers electrons from adrenodoxin reductase to 
cytochrome P450scc, which is involved in cholesterol side chain cleavage and is an irreplaceable 
component of the steroid hormones biosynthesis in the adrenal mitochondria. 

In therapeutic embodiments, ISPG may have particular importance in treatment of disorders 
where it is desired to increase the level of steroid hormone synthesis. As P450scc has a critical role 
35 in synthesis of the conversion of cholesterol into pregnenolone, ISPG may be used as a limiter or 
enhancer of steroid synthesis. In but one example, evidence has been shown that cytochrome 
P450scc activity in the human placenta is limited by the supply of electrons to the P450scc. 
Furthermore, Tuckey et al. Eur. J. Biochem. 1999 Jul;263(2): 319-325 have shown that p450scc 
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activity can be increased considerably by adding adrenodoxin reductase and adrenodoxin. Thus, 
ISPG may be useful in the treatment of reproductive disorders by augmenting the electron supply to 
P450scc, and thus increasing the level of progesterone synthesis. Accordingly, in another example, 
ISPG may be used to limit steroid synthesis, whether for therapeutic or for research uses. 
5 ISPG may also be used in biological steroid synthesis processes for the production of 

steroid hormones. For example, Duport et al, Nat Biotechnol 1998 Feb; 16(2): 1 86-9 report a system 
for self-sufficient biosynthesis of pregnenolone and progesterone in engineered yeast wherein the 
first two steps of the steroidogenic pathway were reproduced in Saccharomyces cerevisiae. 
Engineering of sterol biosynthesis by disruption of the delta 22-desaturase gene and introduction of 
10 the Arabidopsis thaliana delta 7-reductase activity and coexpression of bovine side chain cleavage 
cytochrome P450, adrenodoxin, and adrenodoxin reductase, lead to pregnenolone biosynthesis from 
simple carbon source. As ISPG is thought to be capable of functioning as an adrenodoxin protein, 
ISPG may be used as a function substitute in the system of Duport et al for adrenodoxin. 



15 MTG (METALLOTHIONEIN) (Clone ID:654627^ 

SEQ IDNOS 96 and 413 and clone FL1 1 :654627_1 82-5-3-0-F 1 0-F encode the polypeptide 
of SEQ ID NOs:265 and 527 respectively, a metallothionein protein which binds heavy metal. Said 
polypeptide of the invention is also referred herein as MTG. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NOs:96 and 413 and polypeptides of SEQ 

20 ID NO:265 and 527, described throughout the present application also pertain to the human cDNA 
of clone 654627, and the polypeptides encoded thereby. 

Metal loth ioneins (MT) [1,2,3] are small proteins which bind heavy metals such as zinc, 
copper, cadmium, nickel, etc., through clusters of thiolate bonds. MPs occur throughout the 
animal kingdom and are also found in higher plants, fungi and some prokaryotes and are thought to 

25 play a role in metal detoxification or in the metabolism and homeostasis of metals. On the basis of 
structural relationships MT's have been subdivided into three classes. Class I includes mammalian 
MPs as well as MT's from crustacean and molluscs, but with clearly related primary structure. 
Class II groups together MT's from various species such as sea urchins, fungi, insects and 
cyanobacteria which display none or only very distant correspondence to class I MT's. 

30 Class 111 MT's are atypical polypeptides containing gamma-glutamylcysteinyl units. 

Vertebrate class I MT's such as the MTG protein of the invention are proteins of typically 
60 to 68 amino acid residues, 20 of these residues are cysteines that bind to 7 bivalent metal ions. 
As a signature pattern we chose a region that spans 19 residues and which contains seven of the 
metal-binding cysteines, this region is located in the N-terminal section of class-I MT's. A 

35 consensus pattern for class I MT's is as follows: C-x-C-[GSTAP]-x(2)-C-x-C-x(2)-C-x-C-x(2)-C-x- 
K. 

The MTG protein of SEQ ID NO 527 has a metallothionein domain (Prosite ref. PS00203) 
at amino acid positions 13-3 1 ; an N-glycosylation site at position 4 (NCSC), a protein kinase C 
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phosphorylation site at amino acid positions 18 (SCK), 28 (SCK) and 55 (SQR); a casein kinase II 
phosphorylation site at amino acid position 41 (TLVD); an N-myristoylation site at amino acid 
positions 10 (GVSCTC); and a prokaryotic membrane lipoprotein lipid attachment site at amino 
acid position 3 (PNCSCAAGVSC). 

5 THERAPEUTICS 

Discovery of new proteins related to metallothioneins, and the polynucleotides that encode 
them, satisfies a need in the art by providing new diagnostic or therapeutic compositions useful in 
diagnosing and treating heavy metal toxicity, cancer, inflammatory disease and immune disorders. 

Acute or chronic exposure to heavy metals such as lead, arsenic, mercury or cadmium leads 

10 to a variety of diseases and disorders involving neuromuscular, CNS, cardiovascular, and 

gastrointestinal effects. MTs may play a role in the prevention or alleviation of these conditions. In 
addition, MTs are transcriptionally regulated by glucocorticoids, which suggests that MTs have a 
direct role in the effects of glucocorticoids to treat inflammatory disease, immune disorders, and 
cancer. It is therefore thought that MTG may have important applications in the treatment of 

15 inflammatory disease, immune disorders, and cancer as well as in cytoprotection in a variety of 
therapeutic applications. 

In one preferred example, the MTG nucleic acids and protein may be used for suppressing 
the production of sunburn cells which is applicable in various manners with minimal adverse side 
effects, a method of inducing metallothionein, a method of treating skin diseases and a method of 

20 screening ultraviolet rays, and further relates to cosmetic compositions and UV screening 
compositions. 

Conventionally, steroids and zinc oxide formulations have been topically used as medicines 
for treating skin diseases such as dermatitis, sunburn, neurodermatitis, eczema and anogenital 
pruritus. Steroids, however, have been difficult to administer in large quantities for a prolonged 

25 period due to their strong adverse side effects. Zinc oxide formulations, which have local astringent 
action, involve problems with respect to the manufacture of pharmaceuticals, since they are 
insoluble in water and are not usually administered internally. 

Zinc, one of the indispensable trace metals in the living body, is known to participate in the 
development of sexual organs, promotion of wound healing and is also known to be a component of 

30 a metal loenzy me, an accelerator for dehydrogenase, and to have various functions such as activating 
the immune system. Zinc is further known to be an inducing factor of metallothionein (MT), a 
metal-combining protein. It is reported that MT functions as a scavenger of free radicals which are 
generated at the onset of inflammations ["Dermatologica", Hanada, k., et a!., 179 (suppl. 1) 143 
(1989)]. 

35 As proposed in U.S. Patent No. 5,582,817 (Otsu et al), MTG may be useful in treatment of 

dermatological inflammations caused by external irritative stimulants, such as sunburn or the like, 
where MTG could act to quench the free radicals released from leukocytes, especially granulocytes 
which gather at the inflamed region, and thereby exhibit an anti-oxidation action to diminish cell 
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damage, especially to normal lymphocytes, to activate the immune system and further to prevent the 
accelerated aging of the skin. Formation of sunburn cells (SBCs) could be suppressed by 
administering zinc for inducing MTG to be present, or to increase MTG in the epidermal keratinous 
layer. Anti-oxidation action of MTG can also be useful in the treatment of skin problems resulting 
5 from radiation therapy by X rays, alpha rays, beta rays, gamma rays, neutron rays and accelerated 
electron rays. 

Various zinc compounds have been studied by Otsu et al (supra) with respect to their 
pharmacological activities, who reported that zinc salts or zinc complexes of a certain compound 
have an excellent action of inducing metallothionen (MT) and suppressing sunburn cell (SBC) 
10 production due to UV rays, and thereby useful as components of cosmetic compositions or 

medicines for purposes of ameliorating sunburn, preventing sunburn, ameliorating sufferings from 
skin diseases and ameliorating other radiation induced disorders, leading to completion of the 
invention. 

There are two different types of dermatological reactions caused by sunlight, one is an acute 
1 5 inflammatory change in the skin called sunburn, and the other is a subsequent melanin pigmentation 
called suntan. The light having a wave length in the range of 320 nm or less, called UVB, induces 
sunburn and is responsible for erythematous change. The erythemic reaction caused by UV rays, as 
opposed to a burn injury, does not occur immediately after the exposure to the sunlight, but rather 
occurs after a latent period of several hours. When sunburned skin is histopathologically examined, 
20 various degrees of inflammatory changes are recognized in the epidermis and dermis depending on 
the dose of radiation. Among such changes, a notable one is the generation of so-called sunburn 
cells (SBC) in the epidermis. A histologically stained tissue sample presents strongly and 
acidophilically stained cells which have pyknotic nuclei. This phenomenon indicates the necrosis of 
epidermal cells ("Fragrance Journal", 9, 15-20 (1991). In order to prevent sunburn, para- 
25 aminobenzoic acid derivatives, cinnamic acid derivatives or the like UV absorbers mentioned above 
are used, but their UV absorbing effects are not necessarily satisfactory. What is more, they raise 
problems of cumbersome handling upon use, poor stability, low compatibility with other 
components of the composition, and also involve unsolved problems in water-resistance and oil- 
resistance. 

30 In the field of medicines for the treatment of skin diseases, development of medicines 

which have minimal adverse side effects, and which have novel functions obtainable by both 
external and internal administrations has been desired. Also, in the field of the therapy and 
prevention of radiation disorders, medicines which can suppress and cure the disorders caused by 
oxidative reactions have been desired. Lastly, in the field of the manufacture of cosmetics, 

35 cosmetics which overcome the above-mentioned problems such as handling upon use and stability 
of the composition have been desired. Accordingly, the present invention encompasses providing 
therapeutic agents for treating skin diseases having the above-mentioned characteristics, wherein 
said agents are capable of inducing MTG for suppressing the formation of sunburn cells, and for use 
in cosmetic compositions. Also encompassed are methods of screening for therapeutic agents for 
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treating skin diseases comprising bringing a test compound into contact with a cell, tissue or animal 
model of disease, and detecting induction of MTG expression or function. 

GENE EXPRESSION SYSTEMS 

The MTG nucleic acid and proteins of the invention may also be advantageously used in the 
5 production of recombinant proteins as biopharmaceutical products at commercial scale. 

Previously, genes have been extensively expressed in mammalian cell lines, particularly in 
mutant Chinese Hamster Ovary (CHO) cells deficient in the dihydrofolate reductase gene (dhfr) as 
devised by the method of Urlaub et al, PNAS U.S.A. 77, 4216-4220, 1980. A variety of expression 
systems have been used. Many vectors for the expression of genes in such cells are therefore 

10 available. Typically, the selection procedures used to isolate cells transformed with the expression 
vectors rely on using methotrexate to select for transformants in which both the dhfr and the target 
genes are coamplified. The dhfr gene, which enables cells to withstand methotrexate, is usually 
incorporated in the vector with the gene whose expression is desired. Selection of cells under 
increasing concentrations of methotrexate is then performed. This leads to amplification of the 

1 5 number of dhfr genes present in each cell of the population, as cells with higher copy numbers 

withstand greater concentrations of methotrexate. As the dhfr gene is amplified, the copy number of 
the gene of interest increases concomitantly with the copy number of the dhfr gene, so that 
increased expression of the gene of interest is achieved. Unfortunately, these amplified genes have 
been reported to be variably unstable in the absence of continued selection (Schimke, J. Biol. Chem. 

20 263, 5989-5992, 1988). This instability is inherent to the presently available expression systems of 
CHO dhfr.sup.- cells. For many years, several promoters have been used to drive the expression of 
the target genes such as the SV40 early promoter, the CMV early promoter and the SR.alpha. 
promoter. The CMV and SR.alpha. promoters are claimed to be the strongest (Wenger et al, Anal. 
Biochem. 221,416-418, 1994). 

25 In one report, the .beta.-interferon promoter has also been used to drive the expression of 

the .beta.-interferon gene in the mutant CHO dhfr.sup.- cells (U.S. Pat. No. 5,376,567). In this 
system, however, the selected CHO dhfr cells had to be superinduced by the method of Tan et al 
(Tan et al, PNAS U.S.A. 67, 464-471, 1970; Tan et al, U.S. Pat. No. 3,773,924) to effect a higher 
level of .beta.-interferon production. In this system a significant percentage of the superinduced 

30 .beta.-interferon produced by the CHO dhfr- cells was not glycosylated. The mouse metallothionein 
gene (mMTl) promoter has also been used for the expression of beta-interferon genes in CHO cells, 
BHK and LTK.sup.- mouse cells (Reiser et al 1987 Drug Res. 37, 4, 482-485). However, the 
expression of .beta.-interferon with this promoter was not as good as the SV40 early promoter in 
CHO cells. Further, .beta.-interferon expression from these cells mediated by the mMTl promoter 

35 was inducible by heavy metals. Heavy metals are however extremely toxic to the cells and this 
system was therefore abandoned. Instead, Reiser et al used the CHO dhfr- expression system in 
conjunction with the SV40 early promoter (Reiser et al, Drug Res. 37,4, 482-485 (1987) and EP-A- 
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0529300) to produce .beta.-interferon in CHO dhfr- cells as derived by the method of Urlaub et al 
(1980). 

As described in U.S. Patent No. 6,207,146 (Tan et al) beta.-interferon was expressed in 
wild-type CHO cells using a metallothionein based system. MTG may thus be used in similar 
5 applications so as to provide a system for expression of recombinant proteins. Tan et al 

demonstrates wild-type CHO cells transfected with a vector comprising a .beta.-interferon gene 
under the control of a mouse sarcoma viral enhancer and mouse metallothionein promoter (MSV- 
mMTl), a neo gene under the control of promoter capable of driving expression of the neo gene in 
both E. coli and mammalian cells and a human metallothionein gene having its own promoter. 

10 Transfected cells capable of expressing .beta.-interferon were selected by first exposing cells to 
geneticin (antiobiotic G41 8) and thus eliminating cells lacking the neo gene and then exposing the 
surviving cells to increasing concentrations of a heavy metal ion. 

The heavy metal ion enhanced the MSV-mMTl promoter for the .beta.-interferon gene, 
thus increasing .beta.-interferon expression. The heavy metal ion also induced the human 

15 metallothionein gene promoter, causing expression of human metallothionein. The human 

metallothionein protected the cells against the toxic effect of the heavy metal ion. The presence of 
the heavy metal ion ensured that there was continual selection of cells which had the transfecting 
vector, or at least the .beta.-interferon gene and the human metallothionein gene and their respective 
promoters, integrated into their genome. 

20 The selected cells that had been successfully transfected expressed .beta.-interferon. 

Expression was surprisingly improved when the cells were cultured in the presence ofZn 2 + The 
.beta.-interferon had improved properties, in particular a higher bioavailability, than prior .beta.- 
interferons. 

These findings have general applicability and suggest that the MTG gene of the present 
25 invention may be used accordingly in expression systems. Accordingly, the present invention 
provides a nucleic acid vector comprising: 

(i) a coding sequence which encodes a protein of interest and which is operably linked to a 
promoter capable of directing expression of the coding sequence in a mammalian cell in the 
presence of a heavy metal ion; (ii) a first selectable marker sequence which comprises an MTG 
30 gene of the invention and which is operably linked to a promoter capable of directing expression of 
the MTG gene in a mammalian cell in the presence of a heavy metal ion; and optionally (iii) a 
second selectable marker sequence which comprises a neo gene and which is operably linked to a 
promoter capable of directing expression of the neo gene in a mammalian cell; 

35 CDPG (GLYCOSYL PHOSPH ATID YLINOSITOL-LIN KED GLYCOPROTEIN) (Clone 
ID: 10009029 17) 

SEQ ID NOS 3 and 341 and clone FL1 1 : 10009029 17_223-52-4-0-G3-F encode the 
polypeptide of SEQ ID NOS 172 and 458 respectively, a glycosyl phosphatidylinositol-linked 
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glycoprotein protein which is thought to be a signal transducing polypeptide expressed in lymphoid, 
myeloid, and erythroid cells. Said polypeptide comprises a CD24 signal transducing domain as 
well as a GPI-anchor of the invention is also referred herein as CDPG. CDPG is believed to be 
highly glycosylated, and it is expected that CDPG molecular weight will vary among cell types and 
5 cell developmental stage due to differences in glycosylation patterns, providing further specificity in 
its use as a therapeutic target. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs: 3 and 341 and polypeptides of SEQ ID NO: 172 and 458, 
described throughout the present application also pertain to the human cDNA of clone 1000902917, 
and the polypeptides encoded thereby. 

10 It is suggested that CDPG may have a specific role to play in early thymocyte development. 

The CDPG protein is thought to be extensively o-glycosylated may be capable of modulating b-cell 
activation responses. As a signaling transducer, the CDPG polypeptide's signal transducing 
function may in some embodiments be triggered by the binding of a lectin-like ligand to the CD24 
domain carbohydrates, and the release of second messengers allowing signaling. The CDPG 

1 5 polypeptide is thought to have important functions in regulating the differentiation and/or growth of 
lymphoid, myeloid, and erythroid cells, including specifically promoting antigen dependent 
proliferation of b-cells, and preventing their terminal differentiation into antibody-forming cells. 

Additionally, based on a growing body of evidence characterizing the CD24 domain 
function, it is proposed that CDPG may have a role as a potent stimulator of neurite outgrowth, and 

20 thus may be useful in the treatment of central nervous system disorders. 

Fragments of CDPG may also be useful, eg. GPI-anchor domain for example in the 
development of soluble T-cell receptors (U.S. Patent No. 6,080,840) or any suitable application 
where a temporally controlled solubilization of a protein of interest is desired. For example, a 
fragment comprising the CDPG GPI-anchor domain can be used in the production of soluble 

25 molecules by replacing the transmembrane domains of the cDNA of a protein of interest with a 
sequence comprising the CDPG glycosylphosphatidyl inositol (GPI) linkage. These chimeric 
cDNAs are then transferred into an expression vector containing a strong promoter and a mutant 
(e.g. DHFR) gene allowing high levels of transcriptional expression and amplification of the gene. 
These chimeric genes are then cotransfected into a selected cell type, preferably lacking the 

30 endogenous protein of interest, and transfectants are selected and can also be screened with 

antibodies for the protein of interest. These GPI linked proteins of interest can then be solubilized 
by cleavage with the enzyme phosphatidyl inositol specific phospholipase C (PI-PLC) and 
purified/concentrated from the supernatant (e.g. by passage over a protein of interest-reactive 
antibody affinity column). 

35 THERAPEUTICS 

CDPG or inhibitors of CDPG may be used in the treatment of any disorder where it is 
desired to regulate B-cell proliferation or differentiation. CDPG or inhibitors thereof may be useful 
in the treatment of B-cell neoplasms, a heterogeneous group of diseases characterized by different 
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maturation states of the B-cell, which are related to the aggressiveness of the disorder. Chronic 
lymphocytic leukemia (CLL) is characterized by proliferation and accumulation of B-lymphocytic 
leukemia (BLL) is characterized by proliferation and accumulation of B-lymphocytes that appear 
morphologically mature but are biologically immature. This disorder accounts for 30% of leukemias 
5 in Western countries. The disorder is characterized by proliferation of biologically immature 
lymphocytes, unable to produce immunoglobulins, which cause lymph node enlargement. As a 
regulator of B-cell proliferation and differentiation, CDPG and/or inhibitors of CDPG may be 
useful for inhibiting proliferation of leukemic B-cells in CLL patients. 

CDPG may also be useful in the modulation of cell growth in the CNS. CD24 is known to 
10 be highly expressed in neurons and has been demonstrated as capable of inhibiting neurite 

outgrowth of dorsal root ganglion neurons while promoting neurite outgrowth of cerebellar neurons 
via interaction with an LI protein. 

SELECTABLE CELL MARKERS 

In one aspect, the CDPG polypeptide may be used as a selectable cell marker and to a 

15 method of using the selectable marker to identify a cell. Viruses such as recombinant retroviruses 
have been used as a vehicle for gene transfer based on their potential for highly efficient infection 
and non-toxic integration of their genome into a wide range of cell types. The transfer of exogenous 
genes into mammalian cells may be used, for example in gene therapy to correct an inherited or 
acquired disorder through the synthesis of missing or defective gene products in vivo. The 

20 expression of exogenous genes in cells may be useful in somatic gene therapy, to correct hereditable 
disorders at the level of the gene. Hemopoietic stem cells are particularly suited to somatic gene 
therapy as regenerative bone marrow cells may be readily isolated, modified by gene transfer and 
transplanted into an immunocompromised host to reconstitute the host's hemopoietic system. 

Gene therapy involving hone marrow transplant with recombinant primary hemopoietic 

25 stem cells requires efficient gene transfer into the stem cells. As a very small number of primary 
stem cells can reconstitute the entire host hemopoietic system it is important that the transferred 
gene be efficiently expressed in the recombinant stem cells transferred. The transfer of foreign 
genes into a reconstituted host hemopoietic system has been limited by the availability of a 
selectable marker which permits the rapid and non-toxic selection of cells which are efficiently 

30 expressing the transferred gene. Currently available selection markers may not be suitable for 
primary hemopoietic stem cells since they may alter the proliferative ability or biological 
characteristics of the cells. The transfer of foreign genes into a reconstituted host hemopoietic 
system has also been limited by the availability of a viral vector capable of expression in 
hemopoietic stem cells, especially where more than one transcriptional unit is present in the vector 

35 (Botrell, D. R. L. et al., 1987, Mol. Biol. Med. 4:229). 

U.S. Patent No. 5,804,177 (Humphries et al) has demonstrated that the cell surface protein 
CD24 (also M1/69-J1 Id heat stable antigen) can be used as a dominant marker in a recombinant 
viral vector. A nucleotide sequence encoding the cell surface protein CD24 in a recombinant viral 
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vector was used to infect hematopoietic stem cells and cells infected with the recombinant viral 
vector were rapidly and non-toxically selected for in vitro using fluorescence activated cell sorting 
(FACS), demonstrating a good correlation between proviral copy number and expression of 
selectable marker. 

5 CD24 is a signal transducing molecule found on the surface of most human B cells that can 

modulate their responses to activation signals, and is structurally closely related to CDPG. The 
CD24 CDNA (approximately 300 bps) has been cloned (Kay, R. et al, 1991, J. Immunol. 147:1412) 
and encodes a mature peptide of only 3 1 to 35 amino acids that is extensively glycosylated and 
attached to the outer surface of the plasma membrane by a glycosyl phosphatidylinositol lipid 

10 anchor. M1/69-J1 Id heat stable antigen is a genetically similar homologous murine peptide widely 
expressed on a variety of hemopoietic cell types (Kay, R. et al., 1990, J. Immunol. 145:1952). 

It is thus proposed that a recombinant viral vector can be used to successfully transfer and 
express the CDPG gene in primitive hemopoietic stem cells such that they are able to repopulate 
lethally irradiated recipients. Preferably foreign CDPG antigen expression in repopulated animals 

15 persists post transplantation such that the biological function of the repopulated hemopoietic cells is 
not affected by the expression of the CDPG antigen. CDPG may subsequently be found to be 
expressed in any or all of hemopoietic lineages including granulocytes, macrophages, pro- 
erythrocytes, erythrocytes and T and B lymphocytes. Therefore, the cell surface protein CDPG may 
be particularly useful as a marker for hematopoietic stem cells capable of repopulation in vivo and 

20 as a selectable marker in gene therapy. The recombinant viral vectors also have the advantage that 
the nucleotide sequence encoding the marker is very small, leaving a large amount of space for the 
insertion of additional genes of interest such as those coding for exogenous genes. 

In a preferred embodiment of the invention a recombinant viral vector is used to introduce 
the nucleotide sequence into the cell. Preferably, the CDPG nucleotide sequence is operatively 

25 linked to one or more regulatory elements. The recombinant viral vector of the invention may be 
used as a marker for an exogenous gene to be expressed in a host cell. The invention further 
provides a method of identifying a cell and progeny thereof comprising: providing a cell; infecting 
the cell with a recombinant viral vector of the invention under suitable conditions to allow 
expression of the cell surface protein CDPG on the cell; and, identifying the cell and progeny 

30 thereof by detecting expression of the cell surface protein CDPG on the cell or progeny thereof. 
Cells infected with a recombinant viral vector of the invention and expressing the cell surface 
protein may be transplanted into a host, and the cell and progeny thereof may be identified after 
transplantation by removing biological samples from the host, and assaying for cells expressing the 
cell surface protein. A recombinant viral vector of the invention may be directly introduced into a 

35 host. 

ENRICHING STEM CELL COMPOSITIONS 

As it is proposed that CDPG is involved in early thymocyte development and is found on 
the cell surface due to its GPI-anchor, CDPG nucleic acids and polypeptides of the present 
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invention may also be used to obtain novel antibody compositions useful for preparing cell 
preparations containing human hematopoietic cells. 

There is a continued interest in developing stem cell purification techniques. Pure 
populations of stem cells will facilitate studies of hematopoiesis. Transplantation of hematopoietic 
5 cells from peripheral blood and/or bone marrow is also increasingly used in combination with high- 
dose chemo- and/or radiotherapy for the treatment of a variety of disorders including malignant, 
nonmalignant and genetic disorders. Very few cells in such transplants are capable of long-term 
hematopoietic reconstitution, and thus there is a strong stimulus to develop techniques for 
purification of hematopoietic stem cells. Furthermore, serious complications and indeed the success 
10 of a transplant procedure is to a large degree dependent on the effectiveness of the procedures that 
are used for the removal of cells in the transplant that pose a risk to the transplant recipient. Such 
cells include T lymphocytes that are responsible for graft versus host disease (GVHD) in allogenic 
grafts, and tumour cells in autologous transplants that may cause recurrence of the malignant 
growth. 

15 Hematopoietic cells have been separated on the basis of physical characteristics such as 

density and on the basis of susceptibility to certain pharmacological agents which kill cycling cells. 
The advent of monoclonal antibodies against cell surface antigens has greatly expanded the 
potential to distinguish and separate distinct cell types. There are two basic approaches to separating 
cell populations from bone marrow and peripheral blood using monoclonal antibodies. They differ 

20 in whether it is the desired or undesired cells which are distinguished/labeled with the antibody(s). 
In positive selection techniques the desired cells are labeled with antibodies and removed from the 
remaining unlabeled/unwanted cells. In negative selection, the unwanted cells are labeled and 
removed. Antibody/complement treatment and the use of immunotoxins are negative selection 
techniques, but FACS sorting and most batch wise immunoadsorption techniques can be adapted to 

25 both positive and negative selection. In immunoadsorption techniques cells are selected with 
monoclonal antibodies and preferentially bound to a surface which can be removed from the 
remainder of the cells e,g. column of beads, flasks, magnetic particles. Immunoadsorption 
techniques have won favor clinically and in research because they maintain the high specificity of 
targeting cells with monoclonal antibodies, but unlike FACSorting, they can be scaled up to deal 

30 directly with the large numbers of cells in a clinical harvest and they avoid the dangers of using 
cytotoxic reagents such as immunotoxins, and complement. 

Current positive selection techniques for the purification of hematopoietic stem cells target 
and isolate cells which express CD34. However, positive selection procedures suffer from many 
disadvantages including the presence of materials such as antibodies and/or magnetic beads on the 

35 CD34 4 " cells, and damage to the cells resulting from the removal of these materials. 

Negative selection has been used to remove minor populations of cells from clinical grafts. 
These cells are either T-cells or tumour cells that pose a risk to the transplant recipient. The 
efficiency of these purges varies with the technique and depends on the type and number of 
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antibodies used. Typically, the end product is very similar to the start suspension, missing only the 
tumor cells or T-cells. 

As described in U.S. Patent No. 5,877,299, Thomas et al developed a negative selection 
technique that uses an antibody composition containing antibodies specific for glycophorin A, CD3, 
5 CD24, CD1 6, CD14 and optionally CD45RA, CD36, CD2, CD19, CD56, CD66a, and CD66b, 
which reportedly gave a cell preparation highly enriched for human hematopoietic and progenitor 
cells. Maximum enrichment of early progenitor and stem cells (CD34 + , CD38" cells) was observed 
when anti-CD45R and anti-CD36 were included in the antibody composition. However, as CDPG 
is proposed as acting in early thymocyte development, CDPG may be used advantageuously to 
10 develop more effective antibody compositions for selecting hematopoietic stem cells. Accordingly, 
the invention encompasses antibodies specific for CDPG polypeptides of the invention and antibody 
compositions comprising, consisting of or consisting essentially of antibodies specific for CDPG, 
glycophorin A, CD3, CD24, CD16, CD14 and optionally CD45RA, CD36, CD2, CD19, CD56, 
CD66a, and CD66b. 

1 5 Use of the antibody composition comprising CDPG in a negative selection technique to 

prepare a cell preparation which is enriched for hematopoietic stem cells and progenitor cells may 
offer significant advantages over conventional techniques. The antibody composition is applied in 
one step to a sample of peripheral blood, bone marrow, cord blood or frozen bone marrow, 
preferably without additional enrichments steps which could result in loss of, or damage to, 

20 progenitor and stem cells. 

PRSG (PROLINE-RICH CALCIUM-BINDING PROTEIN) and HSTG (BASIC HISTIDINE RICH 
SALIVARY GLAND PEPTIDE) (Clone ID:3381 12 and 1000839315 respectively) 

SEQ ID NOS 22 and 358 and clone FL1 1 : 10008393 1 5_220-26-l-0-F3-F encode the 
25 polypeptide of SEQ ID NOS 191 and 475 respectively, a basic histidine rich salivary gland peptide 
referred to herein as HSTG and expected to have potent antimicrobial properties. Preferably, the 
amino acid sequence of HSTG (SEQ ID NO 475) comprises a tyrosine at amino acid position 40. 
The HSTG protein also comprises a Pattern-DE: Protein kinase C phosphorylation site at amino 
acid position 51 (SSK). It will be appreciated that all characteristics and uses of the polynucleotides 
30 of SEQ ID NOs: 22 and 358 and polypeptides of SEQ ID NO: 1 91 and 475, described throughout 
the present application also pertain to the human cDNA of clone 1000839315, and the polypeptides 
encoded thereby. 

SEQ ID NOS 8 and 345 and clone FL1 1:3381 12_174-l-l-0-A1 1-F encode the polypeptide 
of SEQ ID NOS 1 77 and 462 respectively, a proline-rich protein referred to herein as PRSG 
35 believed to be a component of saliva and a calcium binding protein also possessing potent 
antimicrobial properties. Preferably, the amino acid sequence of PRSG (SEQ ID NO 462) 
comprises a proline residue at amino acid position 96, an arginine residue at amino acid position 
100, a glutamine residue at amino acid positoin 102, and/or a glycine residue at amino acid position 
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103. The PRSG protein also comprises a casein kinase II phosphorylation site at amino acid 
positions 15 (SAQD), 24 (SQED), and 59 (SAGD) and an N-myristoylation site at amino acid 
position 52 (GGQQSQ). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs:8 and 345 and polypeptides of SEQ ID NO: 1 77 and 462, 
5 described throughout the present application also pertain to the human cDNA of clone 3381 12, and 
the polypeptides encoded thereby. 

A study of saliva and its tooth-protective components reveals at least four important 
functions of saliva: (1) buffering ability, (2) a cleansing effect, (3) antibacterial action, and (4) 
maintenance of a saliva supersaturated in calcium phosphate. Several salivary constituents serve one 

10 or more of these functions. Research has yielded important information about organic and inorganic 
secretory products. It is also clear that saliva as a unique biologic fluid has to be considered in its 
entirety to account fully for its effects on teeth. Saliva is greater then the sum of its parts. One 
reason for this is that salivary components display redundancy of function, each often having more 
than one function. This redundancy, however, does not imply that proteins that share functional 

1 5 roles all contribute to the same degree. For instance, when comparing proteins that inhibit calcium 
phosphate precipitation, statherin and acidic proline-rich proteins are most potent, whereas histatins, 
cystatins, and mucins appear to play lesser roles. The complex interaction between proteins is 
another major factor contributing to saliva's function. In this regard, heterotypic complexes of 
various proteins have been shown to form on hydroxyapatite. Mucin binding to other salivary 

20 proteins, including proline-rich proteins, histatins, cystatins, and statherin, is well documented. The 
complexes, whether adsorbed to the tooth surface or in saliva, have important implications for 
bacterial clearance, selective bacterial aggregation on the tooth surface, and control of 
mineralization and demineralization. Finally, proteolytic activity of saliva generates numerous 
products whose biologic activities are often different from their parent compounds. The ability of 

25 saliva to deliver fluoride to the tooth surface constantly makes salivary fluoride an important player 
in caries protection largely by promoting remineralization and reducing demineralization. Saliva is 
well adapted to protection against dental caries. Saliva's buffering capability; the ability of the saliva 
to wash the tooth surface, to clear bacteria, and to control demineralization and mineralization; 
saliva's antibacterial activities; and perhaps other mechanisms all contribute to its essential role in 

30 the health of teeth. The fact that the protective function of saliva can be overwhelmed by bacterial 
action indicates the importance of prevention and therapy as in other infectious diseases. With 
knowledge of salivary components and their interactions, the use of modified oral molecules as 
therapeutic agents may become a important contributor to oral health. 

Proline-rich proteins are major components of parotid and submandibular saliva in humans 

35 as well as other animals. They can be divided into acidic, basic and glycosylated proteins. The 
proline-rich proteins are apparently synthesized the acinar cells of the salivary glands and their 
phenotypic expression is under complex genetic control. The acidic proline-rich proteins will bind 
calcium with a strength which indicates that they may be important in maintaining the concentration 
of ionic calcium in saliva. Moreover they can inhibit formation of hydroxyapatite, whereby growth 
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of hydroxyapatite crystals on the tooth surface in vivo may be avoided. Both of these activities as 
well as the binding site for hydroxyapatite are located in the N-terminal proline-poor part of the 
protein. 

Basic histidine rich salivary gland peptides such as the peptide of SEQ ID NO. 191 and 
5 475, also referred to as histatins are a group of electrophoretically distinct histidine-rich 
polypeptides with microbicidal activity found in human parotid and submandibular gland 
secretions. Histatins 1, 3, and 5 are homologous proteins that consist of 38, 32, and 24 amino acid 
residues, respectively, that have been shown to kill the pathogenic yeast, Candida albicans. More 
recently histatins 2, 4, 6, and 7-12 were isolated and characterized Troxler RF et al, J Dent Res 1990 

10 Jan;69(l):2-6. Histatin 2 was found to be identical to the carboxyl terminal 26 residues of histatin 1 ; 
histatin 4 was found to be identical to the carboxyl terminal 20 residues of histatin 3; and histatin 6 
was found to be identical to histatin 5, but contained an additional carboxyl terminal arginine 
residue. The amino acid sequences of histatins 7-12 formally corresponded to residues 12-24, 13- 
24, 12-25, 13-25, 5-1 1, and 5-12, respectively, of histatin 3, but could also arise proteolytically from 

15 histatin 5 or 6. Troxler et al provides further guidance on the structural elements and relationship of 
histatins to one another in the context of their genetic origin, biosynthesis and secretion into the oral 
cavity, and potential as reagents in anti-candidal studies. The HSTG polypeptide and fragments 
thereof are therefore expected to have valuable properties and uses in antimicrobial applications, 
particularly in antifungal applications. Supporting such uses is a considerable body of evidence, 

20 including MacKay BJ et al, Infect Immun. 1984 Jun;44(3):695-701 , Growth-inhibitory and 

bactericidal effects of human parotid salivary histidine-rich polypeptides on Streptococcus mutants; 
MacKay BJ et al, Infect Immun. 1984 Jun;44(3):688-94, Isolation of milligram quantities of a group 
of histidine-rich polypeptides from human parotid saliva; Pollock JJ et al, Infect Immun. 1984 
Jun;44(3):702-7, Fungistatic and fungicidal activity of human parotid salivary histidine-rich 

25 polypeptides on Candida albicans; and Xu T et al, Infect Immun. 1991 Aug;59(8):2549-54, 

Anticandidal activity of major human salivary histatins. Furthermore, tissue distribution of RNAs 
for cystatins, histatins, statherin, and proline-rich salivary proteins in humans and macaques is 
further discussed in Sabatini et al, J Dent Res 1989 Jul;68(7):l 138-45. 

In further embodiments, the skilled artisan will appreciate that fragments and analogues of 

30 PRSG and HSTG may readily be generated and selected. Selection of preferred fragments and 

analogies may be carried out by assaying for a desired antimicrobial activity. For example, synthetic 
histatin analogues and methods for obtaining such analogies with broad-spectrum antimicrobial 
activity are described in Helmerhorst EJ et al, Biochem J 1997 Aug 15;326 ( Pt l):39-45, where 
histatin analogies inhibited the growth of the second most common yeast found in clinical isolates, 

35 Torulopsis glabrata, of oral- and non-oral pathogens such as Prevotella intermedia and 
Streptococcus mutants, and of a methicillin-resistant Staphylococcus aureus. 

Thus, in preferred embodiments, the PRSG and/or HSTG polypeptides or fragments thereof 
may be used in oral, injectable, topical or edible compositions for the treatment of infection PRSG 
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and/or HSTG polypeptides may also be used as antimicrobial/antifungal compositions for 
disinfection of surfaces (e.g. in industrial settings). 

In a preferred example further discussed below, the PRSG and/or HSTG polypeptides or 
fragments thereof are used in oral, topical (e.g. mouthwash) or edible compositions optionally 
5 containing additional salivary proteins to provide an anticaries effect. While there is an interest in 
developing and marketing products which reduce caries without reliance on a high level of fluoride 
ions (such as in fluoridated water and fluoride toothpastes), there have not been many reports of 
such approaches meeting with success. While certain cysteine-rich proteins have been proposed 
useful in the treatment of dental caries (U.S. Patent No. 5,688,766, Revis et al), the present 

10 invention provides PRSG and HSTG polypeptides which may provide higher potency, efficacy and 
range of disinfection and protection. 

PRSG and HSTG polypeptide compositions can be administered in a formulation 
comprising a carrier. A preferred carrier composition for the active(s) of this invention are oral 
compositions. Such compositions include toothpastes, mouthrinses, liquid dentifrices, lozenges, 

1 5 chewing gums or other vehicle suitable for use in the oral cavity. Toothpastes and mouthrinses are 
the preferred systems. The abrasive polishing material contemplated for use in the toothpaste 
compositions of the present invention can be any material which does not excessively abrade dentin. 
These include, for example, silicas including gels and precipitates, calcium carbonate, dicalcium 
orthophosphate dihydrate, calcium pyrophosphate, tricalcium phosphate, calcium 

20 polymetaphosphate, insoluble sodium polymetaphosphate, hydrated alumina, and resinous abrasive 
materials such as particulate condensation products of urea and formaldehyde, and others such as 
disclosed by Cooley et al. in U.S. Pat. No. 3,070,510, Dec. 25, 1962, incorporated herein by 
reference. Mixtures of abrasives may also be used. Silica dental abrasives, of various types, can 
provide the unique benefits of exceptional dental cleaning and polishing performance without 

25 unduly abrading tooth enamel or dentin. For these reasons, they are preferred for use herein. 

Flavoring agents can also be added to toothpaste compositions. Suitable flavoring agents include oil 
of wintergreen, oil of peppermint, oil of spearmint, oil of sassafras, and oil of clove. Sweetening 
agents which can be used include aspartame, acesulfame, saccharin, dextrose, levulose and sodium 
cyclamate. Flavoring and sweetening agents are generally used in toothpastes at levels of from 

30 about 0.005% to about 2% by weight. Toothpaste compositions can also contain emulsifying agents. 
Suitable emulsifying agents are those which are reasonably stable and foam throughout a wide pH 
range, including non-soap anionic, nonionic, cationic, zwittefionic and amphoteric organic synthetic 
surfactants. Water is also present in the toothpastes of this invention. Water employed in the 
preparation of commercially suitable toothpastes should preferably be deionized and free of organic 

35 impurities. In preparing toothpastes, it is necessary to add some thickening material to provide a 
desirable consistency. Preferred thickening agents are carboxyvinyl polymers, carrageenan, 
hydroxyethyl cellulose and water soluble salts of cellulose ethers such as sodium carboxymethyl 
cellulose and sodium carboxymethyl hydroxyethyl cellulose. Natural gums such as gum karaya, 
xanthan gum, gum arabic, and gum tragacanth can also be used. Colloidal magnesium aluminum 
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silicate or finely divided silica can be used as part of the thickening agent to further improve 
texture. Thickening agents in ane amount from 0.2% to 5.0% by weight of the total composition can 
be used. It is also desirable to include some humectant material in a toothpaste to keep it from 
hardening. Suitable humectants include glycerin, sorbitol, and other edible polyhydric alcohols at a 
5 level of from about 1 5% to about 70%. 

Another preferred embodiment of the present invention is a mouthwash composition. 
Conventional mouthwash composition components can comprise the carrier for the agents of the 
present invention. Mouthwashes generally comprise from about 20:1 to about 2:1 of a water/ethyl 
alcohol solution or be alcohol free and preferably other ingredients such as flavor, sweeteners, 

10 humectants and sudsing agents such as those mentioned above for dentifrices. The humectants, such 
as glycerin and sorbitol give a moist feel to the mouth. Generally, on a weight basis the 
mouthwashes of the invention comprise 0% to 60% (preferably 5% to 20%) ethyl alcohol, 0% to 
20% (preferably 5% to 20%) of a humectant, 0% to 2% (preferably 0.01% to 1 .0%) emulsifying 
agents, 0% to 0.5% (preferably 0.005% to 0.06%) sweetening agent such as saccharin or natural 

15 sweeteners such as stevroside 0% to 0.3% (preferably 0.03% to 0.3%) flavoring agent, and the 
balance water. 

The pH of the present compositions and/or the pH in the mouth can be any pH which is safe 
for the mouth's hard and soft tissues. Such pH's are generally from about 3 to about 10, preferably 
from about 4 to about 8. Other acceptable oral carders include gums, lozenges, as well as other 

20 forms. Such suitable forms are disclosed in U.S. Pat. No. 4,083,955, Apr. 11, 1978 to Grabenstetter 
et al. incorporated herein in its entirety by reference. Edible compositions are also suitable for use 
as the carrier compositions herein. Edible compositions include many types of solid as well as 
liquid compositions. Such compositions include, for example, soft drinks, citrus drinks, cookies, 
cakes, breads among many others. Such compositions may contain sugar or another sweetener, 

25 water, flour, shortening, other fibers such as wheat, corn, barley, rye, oats, psyllium and mixtures 
thereof. 

HSDG (HYDROXY STEROID DEHYDROGENASE) (Clone ID:495917) 

SEQ ID NOS 54 and 381 and clone FL1 1:49591 7 1 60-22-4-0-D8-F encode the polypeptide 
30 of SEQ ID NOS 223 and 498 respectively, a hydroxy steroid dehydrogenase referred to herein as 
HSDG. As the HSDG polypeptide is implicated in steroid hormone regulation, preferably 
glucocorticoid metabolism, HSDG may be useful in any applications where steroid hormones levels 
are to be increased or inhibited. HSDG may be useful in the treatment of disease treatable by steroid 
hormones. HSDG inhibitors may also be useful in systems for self-sufficient biosynthesis of steroid 
35 hormones such as glucocorticoids such as in engineered cells comprising elements of the synthesis 
pathway. HSDG inhibitors of endogenous HSDG activity may allow the recovery of higher 
amounts of glucocorticoids and/or other synthesized steroid hormones from these cell systems. 



146 



WO 02/083898 PCT/IBO 1/009 14 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NOs: 54 and 381 and polypeptides of SEQ ID NO: 223 and 498, described throughout the present 
application also pertain to the human cDNA of clone 495917 , and the polypeptides encoded 
thereby. 

5 The HSDG polypeptides of the invention comprise leucine zipper pattern (Prosite ref. 

PS00029) at amino acid positions 58-79 as well as a N-myristoylation site at amino acid position 36 
(GANAGV) of SEQ ID NO 498. In one example, HSDG polypeptides may be used in the 
production or therapeutic modulation of glucocorticoids. The skilled artisan will recognize that any 
suitable HSDG polypeptides or variants or fragments thereof capable of metabolizing 

10 glucocorticoids can readily be used. 

In one embodiment HSDG activity can be determined by detecting levels of glucocorticoids 
or metabolites or glucocorticoids. Moreover, structural aspects of 1 1 beta-hydroxysteroid 
dehydrogenase have been documented in the art and may serve as a guidelines for developing 
suitable HSDG variants and fragments. The HSDG polypeptides of the invention may allow 

1 5 modulation of glucocorticoid activity or identification (e.g. drug screening) of compounds capable 
of specifically modulating glucocorticoid levels or activity. HSDG may be useful in allowing the 
modulation of steroid hormone synthesis, or glucocorticoid synthesis to be carried in a tissue 
specific manner, thereby offering improved methods for treating disease with decreased risk of side 
effects, 

20 Corticosteroids, also referred to as glucocorticoids are steroid hormones, the most common 

form of which is Cortisol. Modulation of glucocorticoid activity is important in regulating 
physiological processes in a wide range of tissues and organs. Glucocorticoids act within the gonads 
to directly suppress testosterone production (Monder, C et al, (1994) Steroids 59, 69-73). High 
levels of glucocorticoids may also result in excessive salt and water retention by the kidneys, 

25 producing high blood pressure. 

Glucocorticoid action is mediated via binding of the molecule to a receptor, defined 
hereinafter as either a mineralocorticoid receptor (MR) or a glucocorticoid receptor (GR). 
Krozowski, Z. S. et al, ((1983) Proc. Natl. Acad. Sci. USA 80, 6056-6060) and Beaumont, K. et al, 
((1983) Endocrinology 1 13, 2043-2049) showed that MR of adrenalectomised rats have an equal 

30 affinity for the mineralocorticoid aldosterone and glucocorticoids, for example corticosterone and 
Cortisol. Confirmatory evidence has been found for human MR (Arriza, J. L et al, (1988) Neuron 1 , 
887-900). In patients suffering from the congenital syndrome of Apparent Mineralocorticoid 
Excess (AME), Cortisol levels are reportedly elevated and bind to and activate MRs normally 
occupied by aldosterone, the steroid that regulates salt and water balance in the body. Salt and water 

35 are retained in AME patients causing severe hypertension. 

Like HSDG, the enzyme 1 1 p-hydroxysteroid dehydrogenase (1 IpHSD), also discussed in 
U.S. Patent No. 5,965,372 (Funder et al) may be involved in converting glucocorticoids into 
metabolites that are unable to bind to MRs (Edwards et al, (1988) Lancet. 2: 986-9; Funder et al, 
(1988) Science 242, 583,585), present in mineralocorticoid target tissues, for example kidney, 
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pancreas, small intestine, colon, as well as the hippocampus, placenta and gonads. For example, in 
aldosterone target tissues 1 1(3HSD inactivates glucocorticoid molecules, allowing the much lower 
circulating levels of aldosterone to maintain renal homeostasis. When the 1 lpHSD enzyme is 
inactivated, for example in AME patients or following administration of glycyrrhetinic acid, a 
5 component of licorice, severe hypertension results. Further, placental 1 lpHSD activity may protect 
the foetus from high circulating levels of glucocorticoid which may predispose to hypertension in 
later life (Edwards et al., 1993). Biochemical characterisation of activity has indicated the presence 
of at least two 1 1 pHSD isoenzymes (11 PHSD1 and 1 1 PHSD2) with different cofactor requirements 
and substrate affinities. The 1 1 pHSDl enzyme is a low affinity enzyme that prefers NADP+ as a 

10 cofactor (Agarwal et al., 1989). The 1 1PHSD2 enzyme is a high affinity enzyme (Km for 

glucocorticoid=10 nM), requiring NAD+, not NADP+ as the preferred cofactor, belonging to a class 
of glucocorticoid dehydrogenase enzymes hereinafter referred to as "NAD+ dependent 
glucocorticoid dehydrogenase" enzymes. 

Inverse correlation between 1 1 PHSD enzyme activity in human granulosa-lutein cells and 

1 5 the success of 1VF has further been shown, suggesting that activity of this enzyme might be related 
to the success of embryo attachment and implantation following IVF. The measurement of ovarian 

I lpHSD enzyme activity as a prognostic indicator for the outcome of assisted conception in all 
species, is the subject of UK Patent Application No 9305984. However, the disclosure of Michael et 
al. ((1993) Lancet 342, 711-712), and corresponding UK Patent Application No 9305984 do not 

20 identify, or even suggest which isoenzyme in the ovary might be a predictive indicator of IVF 
embryo transfer, or a means of distinguishing isoenzymes of 1 1 PHSD in the prediction of IVF 
embryo transfer outcomes. In fact, the enzyme assay procedure might detect all isoenzymes of 

I I PHSD activity in the cell, some of which may be hitherto uncharacterised. 

Thus, the human HSDG hydroxy steroid dehydrogenase enzyme and the nucleic acids 
25 encoding it providing novel means for the development of gene therapies and identification of 
HSDG activators and inhibitors which alter the endogenous activity of this hydroxysteroid 
dehydrogenase enzyme in a cell. The present invention also permits the screening, through genetic 
or immunological means, levels of expression of genes encoding the NAD+ dependent 
glucocorticoid dehydrogenase enzyme in various tissue or organ types, including for example, skin, 
30 colon, kidney, placenta, and gonads, amongst others. 

S100G CALCIUM BINDING PROTEIN (Clone ID:200895) 

The polynucleotides of clone FL1 1 :200895_1 1 6-055-1 -0-H 1 1-F and SEQ ID NOS 132 and 
428 encode for the S100 calcium binding protein referred to herein as S100G. SEQ ID NOS 301 
35 and 542 provide the amino acid sequence corresponding to the nucleic acid sequences of SEQ ID 
NOS 132 and 428, respectively. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs:132 and 428 and polypeptides of SEQ ID NO: 301 and 542, 
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described throughout the present application also pertain to the human cDNA of clone200895, and 
the polypeptides encoded thereby. 



BACKGROUND 

In nearly all eukaryotic cells, calcium (Ca2 + ) functions as an intracellular signaling 
5 molecule in diverse cellular processes including cell proliferation and differentiation, 

neurotransmitter secretion, glycogen metabolism, and skeletal muscle contraction. Within a resting 
cell, the concentration of Ca2 + in the cytosol is extremely low, <10~7 M. However, when the cell is 
stimulated by an external signal, such as a neural impulse or a growth factor, the cytosolic 
concentration of Ca^ + increases by about 50-fold. This influx of Ca^ + is caused by the opening of 

10 plasma membrane Ca^ -1 " channels and the release of Ca^ + from intracellular stores such as the 
endoplasmic reticulum. Ca^ + directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. 

The protein of SEQ ID NOS 301 and 542 is a calcium binding SI 00 protein typically found 
in heart and muscle. SI 00 proteins are low-molecular weight calcium binding proteins that are 

15 believed to play an important role in various cellular processes such as cytoskeletal/membrane 
interactions, cell division and differentiation. The expression of SI 00 proteins has been evaluated 
in a variety of disorders. For example, SI 00 protein have been evaluated as markers of 
inflammatory disease, including ulcerative colitits, Crohn's disease, and as serum markers for 
subjects with infections diseases including AIDS and malaria and for subjects with hematological 

20 disease. Evidence has accumulated that indicates that SI 00 proteins can alter cellular invasion and 
metastatic spead of cancer. For example, SI 00 protein is expressed in dendritic cells in human 
transitional cell carcinoma of the bladder and the invasive potential of these tumor has been found 
to correlate with the presence of SI 00 protein expressing cells. 

THERAPEUTICS AND DIAGNOSTICS 

25 The S100G protein of SEQ ID NOS 301 and 542 disclosed herein provides new calcium 

binding SI 00 protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders, vesicle trafficking disorders and 
developmental disorders. 

CBPs are implicated in a variety of disorders and several CBPs have proven to be effective 

30 therapeutic targets for which small molecule inhibitors could be developed. However, while several 
CBPs are targets for widely-used therapeutic treatments, it would be advantageous to provide 
further CBPs allowing more selective therapeutic treatments for disease to be developed. In one 
example, calcineurin is found in the cells of all eukaryotes ranging from yeast to mammals. 
Calcineurin is a target for inhibition by the immunosuppressive agents cyclosporin A and FK506 

35 emphasizing its importance in immune disorders (Kissinger, C. R. et al. (1995) Nature 378:641- 
644). Calcineurin also plays a critical role in transcriptional regulation and growth control in T- 
lymphocytes (Wang, M. G. et al. (1996) Cytogenet. Cell Genet. 72:236-241). However, inhibition 
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of calcineurin phosphatase activity has been implicated both in the mechanism of 
immunosuppression and in the observed toxic side effects of FK506 in nonlymphoid cells, 
suggesting that identification of a new (FK binding proteins (FKBPs) that can mediate calcineurin 
inhibition and are restricted in its expression to T cells could provide new immunosuppressive drugs 
5 may be identified that, by virtue of their specific interaction with the FKBP, would be targeted in 
their site of action (Baughman G, et al Mol Cell Biol 1995 Aug;15(8):4395-402). In another CBP 
example, levels of CaM are increased several-fold in tumors and tumor-derived cell lines for 
various types of cancer (Rasmussen, C. D. and Means, A. R. (1989) Trends in Neuroscience 
12:433-438). Calcium binding S100P is another example of a CBP involved in a variety of 

10 disorders. Like the S100G protein of the invention, S100p contains an EF-hand motif S100p is 
abundantly expressed in the nervous system. S100P levels are increased in the blood and 
cerebrospinal fluid of patients with neurological injury resulting from cerebral infarction, transient 
ischemic attacks, hemorrhagia, head trauma, and Down's syndrome. Furthermore, S100P and other 
neural-specific CBPs may also protect against neurodegenerative disorders, such as Alzheimer's, 

15 Parkinson's, and Huntington's diseases. S100P is produced and secreted by glial cells in the central 
and peripheral nervous systems (Allore, R. J. et al. (1990) J. Biol. Chem. 265:15537-15543). The 
accumulation of S100p in mature glial cells is associated with the microtubule network. S100P 
promotes neuronal differentiation and survival but may be detrimental to cells if overexpressed. The 
selective overproduction has been implicated in the progression of the neuropathological changes in 

20 Alzheimer's disease which may involve mitotic protein kinases (Marshak, D. R. and Pena, L. A. 
(1992) Prog. Clin. Biol. Res. 379:289-307). Adult T-cell leukaemia (ATL) is a mature T-cell 
malignancy which is caused by human T lymphotrophic virus type-1. Diminished surface 
expression of the T-cell receptor alpha beta (TCRaP+) complex is a specific feature of ATL cells. 
SI OOP is not detectable in CD4+, TCRapH- ATL cells, but is expressed in CD4-, CD8-, TCRap+ 

25 leukaemic cells from four ATL patients. This suggested that increased levels of S100p may be 

associated with the diminished surface expression of the TCRaP+ complex in ATL (Suzushima, H. 
et al. (1994) Leuk. Lymphoma 13:257-262). Elevated serum levels of S100P are associated with 
disseminated malignant melanoma metastases, suggesting that serum S100P may be of value as a 
clinical marker for progression of metastatic melanoma (Henze, G. et al. (1997) Dermatology 

30 194:208-212). In yet another example, messenger RIM A levels encoding human calgizzarin (an 
SlOO-like protein), as well as those encoding phospholipase A2, are elevated in colorectal cancers 

compared with those of normal colorectal mucosa (Tanaka, M. et al. (1995) Cancer Lett. 89:195- 
200). Finally, an intracellular SI 00 calcium-binding protein has been isolated from rat peritoneum. 
This protein, MRP 14, is one of two migration inhibitory factor-related proteins that are expressed in 
35 peritoneal macrophages in the arthritis-susceptible Lewis/N rat (Imamichi, T. et al. (1993) Biochem. 
Biophys. Res. Comm. 194:819-825). 

However, despite the many uses and therapeutics based on SI 00 proteins and other CBPs, it 
would be advantageous to selectively target CBPs which are involved in a disorder and which are 
found specifically in the targeted cells or tissues. Thus, in a first embodiment, the S100G protein of 
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the invention can be used for the development of selective inhibitors of calcium signaling, eg. 
preferably inhibitors of transcriptional regulation and cell growth control. 

In one embodiment, an antagonist of S100G may be administered to a subject to prevent or 
treat a neuronal disorder. Such disorders may include, but are not limited to, akathesia, Alzheimer's 
5 disease, amnesia, amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, 
dementia, depression, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's 
disease, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid psychoses, 
schizophrenia, and Tourette's disorder. In one aspect, an antibody which specifically binds S100G 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing 

1 0 a pharmaceutical agent to cells or tissue which express S 1 00G. 

In one embodiment, an antagonist of S100G may be administered to a subject to prevent or 
treat a vesicle trafficking disorder. Such disorders may include, but are not limited to, cystic 
fibrosis, glucose-galactose malabsorption syndrome, hypercholesterolemia, diabetes mellitus, 
diabetes insipidus, hyper- and hypoglycemia, Grave's disease, goiter, Cushing's disease, and 

15 Addison's disease; gastrointestinal disorders including ulcerative colitis, gastric and duodenal 
ulcers; other conditions associated with abnormal vesicle trafficking including AIDS; allergies 
including hay fever, asthma, and urticaria (hives); autoimmune hemolytic anemia; proliferative 
glomerulonephritis; inflammatory bowel disease; multiple sclerosis; myasthenia gravis; rheumatoid 
and osteoarthritis; scleroderma; Chediak-Higashi and Sjogren's syndromes; systemic lupus 

20 erythematosus; toxic shock syndrome; traumatic tissue damage; and viral, bacterial, fungal, 

helminth, and protozoal infections. In one aspect, an antibody which specifically binds S100G may 
be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissue which express S100G. 

In one embodiment, an antagonist of S100G may be administered to a subject to prevent or 

25 treat an immunological disorder. Such disorders may include, but are not limited to, AIDS, 

Addison's disease, adult respiratory distress syndrome, allergies, anemia, asthma, atherosclerosis, 
bronchitis, cholecystitis, Crohn's disease, ulcerative colitis, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythema nodosum, atrophic gastritis, glomerulonephritis, gout, 
Graves' disease, hypereosinophilia, irritable bowel syndrome, lupus erythematosus, multiple 

30 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, rheumatoid arthritis, scleroderma, Sjogren's syndrome, Werner 
syndrome, and autoimmune thyroiditis; complications of cancer, hemodialysis, and extracorporeal 
circulation; viral, bacterial, fungal, parasitic, protozoal, and helminthic infections; and trauma. In 
one aspect, an antibody which specifically binds SI OOP may be used directly as an antagonist or 

35 indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue 
which express S100G. 

In one embodiment, an antagonist of S100G may be administered to a subject to prevent or 
treat a neoplastic disorder. Such disorders may include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 



151 



WO 02/083898 PCT/IBO 1/009 14 

the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. In one aspect, an antibody 
which specifically binds S100P may be used directly as an antagonist or indirectly as a targeting or 
5 delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express S100G. 

In addition, the S100G nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 
administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 
homeostasis. Co-administration of said compounds would be useful to counter-effect iatrogenically 

10 caused dysfunction of calcium metabolism. 

An antagonist of S100G may be produced using methods which are generally known in the 
art. In particular, purified S100G may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind S100G. In one example, a positive 
screening for drugs that specifically inhibit the Ca2+-signaling activity was carried out on the basis 

1 5 of the growth promoting effect on a yeast mutant with a peculiar phenotype (Shitamukai A et al, 
Biosci Biotechnol Biochem 2000 Sep;64(9): 1942-6). An inappropriate activation of a signaling 
pathway in yeast often has a deleterious physiological effect and causes various defects, including 
growth defects. In a certain genetic background (deltazdsl) of Saccharomyces cerevisiae, the cell- 
cycle progression in G2 is specifically blocked in the medium with CaC12 by the hyperactivation of 

20 the Ca2+-signaling pathways. Shitamukai et al provide an example of a drug screening procedure 
designed to detect the active compounds that specifically attenuate the Ca2+-signaling activity on 
the basis of the ability to abrogate the growth defect of the cells suffering from the hyperactivated 
Ca2+ signal. Screening conditions were established for the drugs that suppress the Ca2+-induced 
growth inhibition using known calcineurin inhibitors as model compounds, and an indicator strain 

25 with an increased drug sensitivity was constructed with a syrl/erg3 null mutation. 

In another embodiment, a vector expressing the complement of the polynucleotide encoding 
SI OOP may be administered to a subject to treat or prevent a neuronal disorder, immunological 
disorder, neoplastic disorder or vesicle trafficking disorder including, but not limited to, those 
described above. In other embodiments, any of the proteins, antagonists, antibodies, agonists, 

30 complementary sequences or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment or 
prevention of the various disorders described above. Using this approach, one may be able to 

35 achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

Expression of SI 00 proteins have been evaluated as serum markers for melanoma (Henze et 
al, Dermatology 194:208-212; Bueretal, 1997 Brit. J. Cancer 75: 1373-1376; Sherbert et al. 1998; 
Anticancer Res. 18:2415-2422) and more recently as serum markers for cancer in general 
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(particularly breast, colon and lung cancers) and that their detectability in serum can have 
prognostic and/or therapeutic significance in cancer. Furthermore auto-antibodies to SI 00 proteins 
were found in cancer patients. (International Patent Publication No. WO 00/26668). The S100G 
protein of the invention may thus be used in the diagnosis and prevention of cancer, for 
5 identification of subjects predisposed to cancer, for monitoring patients undergoing treatment for 
cancer based on the increased level of S100G protein(s) in biological fluid samples of subjects. 
Methods for diagnosis and prognosis of cancer in a subject may comprise 

a) detecting a S100G protein in a biological fluid sample obtained from a subject, and 

b) comparing the level of protein detected in the subjects sample to the level of protein 
1 0 detected in a control sample, 

wherein an increase in the level of S100G protein detected in the subject's sample as 
compared to control samples is an indicator of a subject with cancer of at increased risk for cancer. 

The invention also comprises methods for diagnosis and prognosis of a subject with cancer 
comprising: 

15 a) contacting a serum sample derived from a subject with a sample containing S100G 

protein antigens under conditions such that a specific antigen-antibody complex binding can occur; 
and 

b) detecting the presence of immunospecific binding of autoantibodies present in the 
suject's serum samples to the S100G protein; 
20 wherein the presence of immunospecific binding of autoantibodies indicates the presence of 

cancer. 

Assays for detection of S100G protein in a sample can be accomplished by any suitable 
method, including immunoassays where in S100G proteins are detected by their interaction with an 
S100G specific antibody. In addition, reagents other than antibodies such as for example 

25 polypeptides that specifically bind S100G may be used. 

In yet further embodiments, the S100G protein of the invention may be useful for the 
development of specific anti-SlOO antibodies which are specific for SI 00 proteins other than 
S100G. As several S100 proteins have been implicated in disease, it may be advantageous to 
develop a panel of SI 00 specific antibodies to characterize disease, eg. cancers. Thus, S100G 

30 proteins and antibodies thereto may be advantageous to distinguish cancer types. SI 00G proteins 
and antibodies may also be useful in the screening of SI 00 specific antibodies by determining the 
selectivity of a given anti-SlOO antibody for its target, and eliminating antibodies which are cross- 
reactive with S100G proteins. 

STUCTURAL ASPECTS OF SI 00 PROTEINS OF THE INVENTION 

35 The SI 00 proteins are a group of low molecular mass (approximately 10-12 kDa) acidic 

Ca^" 1 " -binding proteins, so named after the solubility of the first isolated protein in 100% saturated 
ammonium sulfate. The most striking conserved feature of these proteins is the presence of an EF- 
hand. The SI 00 proteins have two Ca^ + -binding domains. One of these domains is a basic helix- 
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loop-helix domain, the other domain is an acidic helix-loop-helix EF-hand (Kligman, D. and Hilt, 
D. C. (1988) Trends Biochem. Sci. 13:437-442). The EF-hand domain also encompasses a part of a 
region within S100 proteins which specifically identifies members of the S100 family of proteins 
which have a low affinity for Ca 2+ ions (SlOO/ICaBP; PROSITE PS00303, SWISSPROT, PFAM 
5 PF01023). The EF-hand is characterized by a twelve amino acid residue-containing loop, flanked 
by two alpha-helices, orientated approximately 90 degrees with respect to one another. Aspartate 
(D) and glutamate (E) residues are usually found bordering the twelve amino acid loop. In addition, 
a conserved glycine residue in the central portion of the loop is found in most Ca 2 " 1 " -binding EF- 
hand domains. Oxygen ligands within this domain coordinate the Ca2+ ion (Kretsinger, R. H. and 

10 Nockolds, C. E. (1973) J. Biol. Chem. 248:3313-3326). It will also be appreciated by the skilled 
artisan that modifications of the S100G polypeptide may readily be made based on extensive 
knowledge of CBP stucture and Ca(2+) binding mechanisms, such as Sastry M et al, (Structure 
1998;6:223-231), describing the three-dimensional structure of Ca(2+)-bound calcyclin and its 
implications for Ca(2+)-signal transduction by SI 00 

15 Recently, a protein designated S100A13 has been discovered and characterized which 

shared significant primary structure similarity with the S100G polypeptide of the invention. (Wicki 
et al, (1996) BBRC 227:594-599). 

The binding to calcium induces a conformational change in the SI 00 proteins, and this may 
then affect the secondary effector proteins. This mode of protein-protein interaction and modulation 

20 of the activity of the secondary effector protein is similar to that seen with calmodulin, also 

containing the EF-hands. The S100G protein of the invention comprises an ICaBP type calcium 
binding domain domain at amino acid positions 9 to 52 and casein kinase II phosphorylation site 
patterns at amino acid position 7 (TELE); 34 (SVNE); and 55 (SLDE) of SEQ ID NO 542. 

The S100G polypeptides of the invention may thus also be used in any situation in vivo or 

25 in vitro where it is desired to modulate, preferably decrease the level of free calcium, or in 

applications involving sensing a change in calcium concentration. In one aspect, S100G nucleic 
acids and polypeptides may be used to develop a calcium biosensor. Calcium biosensors may have 
particular utility in detecting abnormalities in calcium transport that result in uncompensated influx 
into, or efflux from, the extracellular fluid, will result in hypercalcaemia or hypocalcaemia, 

30 respectively. Such abnormalities in serum calcium concentration may have profound effects on 
neurological, gastrointestinal, and renal function (Bushinsky DA et al, Lancet 1998 Jul 25;352 
(9124):306-1 1). Calcium biosensors may be developed by using any suitable means known in the 
art to detect a conformational change induced by the binding of calcium to the EF-hand domains of 
the S100G polypeptides. Preferably, the conformational change is detected by detecting a change in 

35 the ability of the S100G protein to bind a selected secondary effector protein 

As the distribution of particular SI 00 proteins is dependent on specific cell types, the SI 00 
proteins may be involved in transducing the signal of an increase in intracellular calcium in a cell 
type-specific fashion (Wu, T. et al. (1997) J. Biol. Chem. 272:17145-17153). 
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Calcium is one of the "second messengers" which relays chemical and electrical signals 
within a cell. This signal transduction and, hence the regulation of biological processes, involves 
interaction of calcium ion with high-affinity calcium-binding proteins (CBPs). Disclosed herein in 
5 SEQ ID NOS 184 and 469 is one such protein, encoded by the nucleic acid sequences of SEQ ID 
NOS 15 and 352, respectively, and the clone FL1 l:500742698_204-61-4-0-B2-F, and further 
referred to herein as CaMLP, which is thought to act as a Ca^" 1 " sensing and binding protein 
involved in diverse aspects of cell proliferation (such as for example of hepatocytes, melanoma 
cells, leukemic lymphocytes, and HUVEC (human umbilical vein endothelial cells)) and 

10 differentiation. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs: 1 5 and 352 and polypeptides of SEQ ID NO: 1 84 and 469, described throughout the present 
application also pertain to the human cDNA of clone 500742698, and the polypeptides encoded 
thereby. Notably, the CaMLP polypeptide contains EF hand calcium-binding domains (PROSITE 
PS000 1 8) at amino acid positions 8 1 -93 and at position 1 29- 1 4 1 of SEQ ID NO 469. CaMLP 

15 The cellular processes in which Ca2 + functions as an intracellular signaling molecule are 

diverse, including cell proliferation and differentiation, neurotransmitter secretion, glycogen 
metabolism, and skeletal muscle contraction. Within a resting cell, the concentration of Ca^ + in the 
cytosol is extremely low, <10.sup.-7 M. However, when the cell is stimulated by an external signal, 
such as a neural impulse or a growth factor, the cytosolic concentration of Ca^ + increases by about 

20 50-fold. This influx of Ca^" 1- is caused by the opening of plasma membrane Ca^+ channels and the 
release of Ca^" 1 " from intracellular stores such as the endoplasmic reticulum. Ca^" 1 " directly activates 
regulatory enzymes, such as protein kinase C, which trigger signal transduction pathways. Ca^ + 
also binds to specific Ca^ + -binding proteins (CBPs) such as calbindins, troponin C, calmodulin, 
and S-100 proteins which then activate multiple target proteins including enzymes, membrane 

25 transport pumps, and ion channels. Calmodulin (CaM) is the most widely distributed and the most 
common mediator of calcium effects and appears to be the primary sensor of Ca^ + changes in 
eukaryotic cells. The binding of Ca^ + to CaM induces marked conformational changes in the 
protein permitting interaction with, and regulation of over 100 different proteins. CBP interactions 
are involved in a multitude of cellular processes including, but not limited to, gene regulation, DNA 

30 synthesis, cell cycle progression, mitosis, cytokinesis, cytoskeletal organization, muscle contraction, 
signal transduction, ion homeostasis, exocytosis, and metabolic regulation (Celio, M. R. et al. 
(1996) Guidebook to Calcium-binding Proteins, Oxford University Press, Oxford, UK, pp. 15-20). 

THERAPEUTICS 

The CaMLP protein of SEQ ID NOS 1 84 and 469 disclosed herein provides new calcium 
35 binding protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders and developmental disorders. 

Calcium binding proteins (CBPs) are implicated in a variety of disorders and several CBPs 
have proven to be effective therapeutic targets for which small molecule inhibitors could be 
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developed. However, while several CBPs are targets for widely-used therapeutic treatments, it 
would be advantageous to provide further CBPs allowing more selective therapeutic treatments for 
disease to be developed. Evidence has accumulated for a large number of CBPs suggesting 
involvement in cell proliferative disorders. It is proposed that CaMLP may be useful as a tissue 
5 specific calmodulin homologue allowing the development of specific inhibitors and activators 
having increased selectivity and safety (decreased side effect profile). To date, calmodulin 
antagonists are reportedly useful for the treatment of some malignant tumors, particularly those of 
the central nervous system, as well as lung tumors. The antitumor activity of calmodulin 
antagonists, as well as successful chemotherapy using the same, has been described, for example, in 

10 Sculler et al. Cancer Res., 50:1645-1649 (1990) and Hait et al. Cancer Res., 50:6636-6640 (1990). 
U.S. Pat. No. 5,340,565, additionally describes the use of calmodulin antagonists or inhibitors as 
agents which enhance the effectiveness of a chemotherapeutic agent or radiation treatment. 
Specifically, described therein is a method of inhibiting or killing a tumor or cancer cell in a human 
patient undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic 

1 5 agents as cisplatin (Platinol®), by additionally administering a calmodulin binding agent which 
inhibits calmodulin activity. 

Calmodulin is also believed to play a pathogenic role in the tissue damage caused by burns 
and frostbite (Beitner et ah, Gen. Pharmac. 20: 641-646, 1989), as well as in dermatitis and other 
conditions involving keratinocyte hyperproliferation. The methods of the present invention may be 

20 applied to the treatment of these and other conditions wherein antagonism of calmodulin activity is 
desirable. 

Thus, as discussed above, CaMLP protein of the invention shares structural similarity with 
the ubiquitous intracellular receptor protein calmodulin suggesting that CaMLP may be useful in the 
development of selective CaMLP inhibitors. The nucleic acids and polypeptides of the invention 

25 thus provide a novel therapeutic target in particular for cell proliferative disorders. In one aspect, 
said nucleic acids and protein may be used in drug screening processes to develop selective 
calmodulin and other calcium binding protein antagonists which do not inhibit the polypeptide of 
the invention. In another aspect, the nucleic acids and polypeptides of the invention may be used in 
drug screening processes to identify selective modulators of the CaMLP without inhibiting 

30 calmodulin, thereby identifying compounds less likely to cause unwanted side effects. In yet 

another aspect, the nucleic acids and polypeptides of the invention may be used in drug screening 
processes to identify selective modulators of both calmodulin and CaMLP, thereby identifying 
compounds having increased potency. 

Upon calcium binding, CaMLP may interact with a number of protein targets in a calcium 

35 dependent manner, thereby altering a number of complex biochemical pathways that can affect the 
overall behavior of cells. The calcium-calmodulin complex for example controls the biological 
activity of more than thirty different proteins including several enzymes, ion transporters, receptors, 
motor proteins, transcription factors, and cytoskeletal components in eukaryotic cells. 
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As described in U.S. Patent No. 5,840,697, Blondelle et al have peptide inhibitors of 
calmodulin. A number of other calmodulin targeted compounds are known and used for a variety of 
therapeutic applications. For instance, chlorpromazine (Thorazine.RTM.) and related phenothiazine 
derivatives, disclosed, for example, in U.S. Pat. No. 2,645,640, are calmodulin antagonists useful as 
5 tranquilizers and sedatives. Naphthalenen-sulfonamides, also calmodulin antagonists, are known to 
inhibit cell proliferation, as disclosed, for example, in Hidaka et al. ((1981), PNAS, 78:4354-4357) 
and are useful as antitumor agents. In addition, the cyclic peptide cyclosporin A (Sandimmune®), 
disclosed in U.S. Patent No. 4,1 17,1 18, is as an immunosuppressive agent which is thought to work 
by inhibiting calmodulin mediated responses in lymphoid cells. 

10 Many existing calmodulin inhibitors have undesirable biological effects when administered 

at concentrations sufficient to block calmodulin. These undesirable biological effects include non- 
specific binding to other proteins or receptors, as described, for example, in Polak et al, ((1991), J. 
Neurosci. 1 1 : 534-542.) In addition, negative side effects such as toxicity can occur. A specific 
example is the toxic side effects from cyclosporin A. Therefore, a need exists for calmodulin 

15 targeted agents, and in particular calmodulin antagonists which inhibit calmodulin without having 
additional, undesirable biological or side effects. In particular there is a need for inhibitors which 
are specific to calmodulin and which do not have toxic side effects. 

In addition, the CaMLP nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 

20 administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 

homeostasis. Co-administration of said compounds would be useful to counter-effect iatrogenically 
caused dysfunction of calcium metabolism. Such disorders include, but are not limited to, organ 
damage, autoimmune disorders, psychotic disorders, tumors and drug induced dysfunction, such as 
negative side effects subsequent to administration of pharmaceuticals. For example, organ or tissue 

25 transplantation can result in autoimmune disorders, such as tissue graft (allograft) rejections. 

It is well known that calmodul in-targeted compounds which are antagonists can be used as 
immunosuppressive agents. In addition, also as described above, such compounds are widely used 
as sedative or anti-psychotic agents. Furthermore, there is evidence that calmodulin (and hence 
CaMLP) antagonists are useful for the treatment of some malignant tumors, particularly those of the 

30 central nervous system, as well as lung tumors. The antitumor activity of calmodulin antagonists, as 
well as successful chemotherapy using the same, has been described, for example, in Sculler et al. 
Cancer Res., 50:1645-1649 (1990) and Hait et al. Cancer Res., 50:6636-6640 (1990), both of which 
are incorporated herein by reference. U.S. Pat. No. 5,340,565, which is incorporated herein by 
reference, additionally describes the use of calmodulin antagonists or inhibitors as agents which 

35 enhance the effectiveness of a chemotherapeutic agent or radiation treatment. Specifically, 
described therein is a method of inhibiting or killing a tumor or cancer cell in a human patient 
undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic agents as 
cisplatin (Platinol), by additionally administering a calmodulin binding agent which inhibits 
calmodulin activity. 
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It has also been found that extracellular calmodulin inhibits TNF release and facilitates 
elastase release, providing further suggestion that CaMLP, CaMLP analogues and CaMLP receptor 
agonists are useful agents for regulating the inflammatory process. CaMLP antagonists, which 
include CaMLP receptor antagonists and CaMLP -binding molecules, may be used to block the 
5 interaction of CaMLP with a receptor, thus providing the opposite effect from CaMLP, its 
analogues and receptor agonists. CaMLP may serve as a potent modulator of self-directed 
inflammation by assisting in the recognition of self vs. non-self as prokaryotes (e.g., bacterial 
pathogens) do not contain CaMLP. In some situations such as in tumor necrosis, release of 
extracellular CaMLP may lead to an inappropriate host response and failure of the 

10 immune/inflammatory systems to eradicate tumor cells. Further, a diagnostic test has been 
developed which can discern patient variabilities in TNF inhibition by calmodulin and other 
substances. This test can be utilized in monitoring individual patients for determining effective 
therapies, and for predicting efficacy of therapy with extracellular CaMLP, CaMLP analogues or 
CaMLP receptor agonists on the one hand and CaMLP antagonists on the other. A diagnostic test 

1 5 for elastase has also been developed with similar utility. 

Lvsozvme C Protein of SEP ID NO: 196 (internal designation 482181) and related protein of SEP 
ID NO:479 

The polypeptides of SEQ ID NP : 196 and SEQ ID NO:479encoded by the cDNA of SEQ 

20 ID NP:27 and 362, respectively, belong to the widely conserved family of iysozyme C precursors 
(Prager and Jolles, Lysozymes: model enzymes in biochemistry and biology, ed. Jolles, 9-321 
(1996), Qasba and Kumar, Crit. Rev. Biochem. Mol Biol 32:255-306 (1997)), which disclosures 
are hereby incorporated by reference in their entireties. The protein of SEQ ID NPs: 196 and 479 or 
part thereof plays a role in glycoprotein and/or peptidoglycan metabolism, probably as a glycosyl 

25 hydrolase of family 22. Thus, the protein of the invention or part thereof is involved in immune and 
inflammatory responses and has antiviral, antibacterial, anti-inflammatory and/or anti-histaminic 
functions. Preferred polypeptides of the invention are polypeptides comprising the amino acids of 
SEQ ID NP: 196 from positions 19 to 100, or from positions 1 to 100. Pther preferred polypeptides 
of the invention are fragments of SEQ ID NP: 1 96 having any of the biological activities described 

30 herein. The glycolytic activity of the protein of the invention or part thereof may be assayed using 
any of the assays known to those skilled in the art including those described in Gold and Schweiger, 
M. Methods in Enzymology, Vol. XX, Part C pp. 537-542, Ed. Moldave, Academic Press,New 
York and London, 1971 and in the US patent 4,255,517, which disclosures are hereby incorporated 
by reference in their entireties. 

35 Lysozymes, which are ubiquitous proteins found in most body secretions, are defined as 

1,4-beta-N-acetylmuramidase which cleave the glycoside bond between the C-l of N-acetyl- 
muramic acid and the C-4 of N-acetylglucosamine in the peptidoglycan of bacteria. They have 
various therapeutic properties, such as antiviral, antibacterial, anti-inflammatory and antihistaminic 
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effects. The activity of lysozymes as an anti-bacterial agent appears to be based on both its direct 
bacteriolytic activity and also on stimulatory effects in connection with phagocytosis of 
polymorphonuclear leucocytes and macrophages (Biggar and Sturgess, J. M. Infect Immunol. 16: 
974-982 (1977); Thacore and Willet, Am. Rev. Resp. Dis. 93: 786-790 (1966); Klockars and 
5 Roberts, P. Acta Haematol 55: 289-292 (1976)), which disclosures are hereby incorporated by 
reference in their entireties. Lysozymes have proven to be not only a selective factor but also an 
effective factor against microorganisms of the mouth (Iacono et al, J. J. Infect. Immunol. 29: 623- 
632 (1980)), which disclosure is hereby incorporated by reference in its entirety. Lysozymes can 
also kill pathogens by acting synergistically with other proteins such as complement or antibody to 

10 lyse pathogenic cells. Lysozymes, also inhibit chemotaxis of polymorphonuclear leukocytes and 
limit the production of oxygen free radicals following an infection. This limits the degree of 
inflammation, while at the same time enhances phagocytosis by these cells. Other postulated 
functions of lysozymes include immune stimulation (Jolles, P. Biomedicine 25: 275-276 (1976) 
Ossermann, E. F. Adv. Pathobiol 4: 98-102 (1976)) and immunological and non-immunological 

15 monitoring of host membranes for any neoplastic transformation (Jolles, P. Biomedicine 25: 275- 
276 (1976); Ossermann, E. F. Adv. Pathobiol 4: 98-102 (1976)), which disclosures are hereby 
incorporated by reference in their entireties. Lysozymes may thus be used in a wide spectrum of 
applications (see US patent 5,618,712, which disclosure is hereby incorporated by reference in its 
entirety). Determination of the lysozymes from serum and/or urine is used to diagnose various 

20 diseases or as an indicator for their development. In acute lymphoblastic leukaemia the lysozyme 
serum level is significantly reduced, whereas in chronic myelotic leukaemia and in acute 
monoblastic and myelomonocytic leukaemia the lysozyme concentration in the serum is greatly 
increased. The therapeutically effective use of lysozyme is possible in the treatment of various 
bacterial and virus infections (Zona, Herpes zoster), in colitis, various types of pain, in allergies, 

25 inflammation and in pediatrics (the conversion of cows milk into a form suitable for infants by the 
addition of lysozyme). 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to hydrolyze one or several substrates, alone or in combination with other substances, 
preferably antiviral, antifungal and/or antibacterial substances including but not limited to 

30 immunoglobulins, lactoferrin, betalysin, fibronectin, and complement components. Such substrates 
are glycosylated compounds, preferably containing beta-l-4-glycoside bonds, more preferably 
containing beta-l-4-glycoside bonds between n-acetylomuraminic acid and n-acetyloglucosamine. 
For example, the protein of the invention or part thereof is added to a sample containing the 
substrate(s) in conditions allowing hydrolysis, and allowed to catalyze the hydrolysis of the 

35 substrate(s). In a preferred embodiment, the hydrolysis is carried out using a standard assay such as 
those described by Gold and Schweiger, supra, and US patents 5,871,477 and 4,255,517, which 
disclosures are hereby incorporated by reference in their entireties. In a preferred embodiment, the 
protein of the invention or part thereof may be used to lyze recombinant bacteria in order to recover 
the recombinant DNA, the recombinant protein of interest, or both using, for example, any of the 
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assays described in Sambrook, et ah, Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press (1989), which disclosure is hereby incorporated by reference 
in its entirety. 

In an embodiment, the protein of the invention or part thereof is used to hydrolyze 
5 contaminating substrates, preferably exogenous substrates from bacterial, fungal or viral origins, in 
an aqueous sample or onto a material, preferably glassware and plasticware. In particular, the 
protein of the invention or part thereof may be used as a disinfectant in dental rinse, in protection of 
aqueous systems or in preparing material for medical applications using any of the methods and 
compositions described in US patents 5,069,717, 4,355,022 and 5,001,062, which disclosures are 

10 hereby incorporated by reference in their entireties. In a preferred embodiment, the protein of the 
invention is used as a host resistance factor in infants' formulas to convert cow's milk into a form 
more suitable for infants as described in US patent 6,020,015, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, the protein of the 
invention or part thereof may be used as a food preservative (see Hayashi et al., Agric. Biol. Chem. 

15 (European Edition of Japanese Journal of Agriculture, Biochemistry and Chemistry), Vol. 53, pp. 
3173-3177, 1989), which disclosure is hereby incorporated by reference in its entirety. In addition, 
the protein of the invention or part thereof may be used to clarify xanthan gum fermented broth for 
applications in food and in cosmetic industries using the method described in US patent 5,994,107, 
which disclosure is hereby incorporated by reference in its entirety. In another preferred 

20 embodiment, compositions comprising the protein of the present invention or part thereof are added 
to samples or materials as a "cocktail" with other antimicrobial substances, preferably antibiotics or 
hydrolytic enzymes such as those described in US patents 5,458,876 and 5,041,326, which 
disclosures are hereby incorporated by reference in their entireties, to decontaminate the samples. 
For example, the protein of the invention or part thereof may be used in place or in combination 

25 with antibiotics in cell cultures. The advantage of using a cocktail of hydrolytic enzymes is that one 
is able to hydrolyze a wide range of substrates without knowing the specificity of any of the 
enzymes. Using a cocktail of hydrolytic enzymes also protects a sample or material from a wide 
range of future unknown contaminants from a vast number of sources. For example, the protein of 
the invention or part thereof is added to samples where contaminating substrates, preferably 

30 exogenous substrates from bacterial, fungal or viral origins, is undesirable in an amount sufficient to 
promote hydrolysis of said substrates. Alternatively, the protein of the invention or part thereof 
may be bound to a chromatographic support, either alone or in combination with other hydrolytic 
enzymes, using techniques well known in the art, to form an affinity chromatography column. A 
sample containing the undesirable substrate is run through the column to remove the substrate. 

35 Immobilizing the protein of the invention or part thereof on a support advantageous is particularly 
for those embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of the enzyme from the batch of product and subsequent 
reuse of the enzyme. Immobilization of the protein of the invention or part thereof can be 
accomplished, for example, by inserting a cellulose-binding domain in the protein. One of skill in 
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the art will understand that other methods of immobilization could also be used and are described in 
the available literature. Alternatively, the same methods may be used to identify new substrates. 

In addition, the protein of the invention or part thereof may be useful to identify or quantify 
the amount of a given substrate in biological fluids, foods, water, air, solutions and the like. In a 
5 preferred embodiment, the protein of the invention or part thereof is used in assays and diagnostic 
kits for the identification and quantification of exogenous substrates in bodily fluids including 
blood, lymph, saliva or other tissue samples, in addition to bacterial, fungal, plant, yeast, viral or 
mammalian cell cultures. In a preferred embodiment, the protein of the invention or part thereof is 
used to detect, identify, and or quantify eubacteria using reagents and assays described in US patent 

10 5,935,804, which disclosure is hereby incorporated by reference in its entirety. Briefly, the protein 
of the invention of part thereof is catalytically inactived , i.e. capable of binding but not cleaving a 
peptidoglycan comprising NAc-muramic acid in the eubacteria, using any of the methods known to 
those skilled in the art including those which produce a mutant enzyme, a recombinant-enzyme, or a 
chemically inactivated enzyme. The catalytically inactive protein of the invention is then incubated 

1 5 with an aliquot of a biological sample under conditions suitable for binding of the inactive enzyme 
to the peptidoglycan substrate. Then, the bound enzyme is detected to assess the presence or 
amount of the eubacteria in the biological sample. 

In another embodiment, the nucleic acid of the invention or part thereof may be used to 
increase disease resistance of plants to bacterial, fungal and/or viral infections. A polynucleotide 

20 containing the nucleic acid of the invention or part thereof is introduced into the plant genome in 
conditions allowing correct expression of the transgenic protein using any methods known to those 
skilled in the art including those disclosed in US patents 5,349,122 and 5,850,025, which 
disclosures are hereby incorporated by reference in their entireties. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

25 to treat and/or prevent bacterial, fungal and viral infections in humans or in animals caused by 
various agents including but not limited to Streptococcus, Veillonella alcalescens, Actinomyces, 
Herpes simplex, Candida albicans, Micrococcus lysodeikticus and HIV by hydrolyzing the 
glycosylated compounds contained in such micro-organisms. In still a preferred embodiment, the 
protein of the invention or part thereof is used to prevent and/or treat bacterial, fungal and viral 

30 infections in immunocompromised individuals who lack fully functional immune systems, such as 
neonates or geriatric patients or HIV-infected individuals, or who suffer from a disease affecting the 
respiratory tract such as cystic fibrosis or the gastrointestinal tract such as ulcerative colitis or sprue. 

In still another embodiment, the protein of the invention or part thereof may be used as a 
growth factor for in vitro cell culture, preferably for T cells and T cell lines, using techniques and 

35 methods taught in US patent 5,468,635, which disclosure is hereby incorporated by reference in its 
entirety. 

In addition, the protein of the invention or part thereof may be used to identify inhibitors for 
mechanistic and clinical applications. Such inhibitors may then be used to identify or quantify the 
protein of the invention in a sample, and to diagnose, treat or prevent any of the disorders where the 
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protein's hydrolytic, immunostimulatory and/or inflammatory activities is/are undesirable and/or 
deleterious including but not limited to amyloidosis, colitis, lysosomal diseases, inflammatory and 
immune disorders including allergies and leukaemia. The protein of the invention may also be used 
to monitor host cell membranes for neoplastic transformation. 
5 It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

NOs:27 and 362 and polypeptides of SEQ ID NO: 196 and 479, described throughout the present 
application also pertain to the human cDNA of clone 482181, and the polypeptides encoded 
thereby. 



10 Angiogenin Protein of SEP ID NO: 176 (internal designation 1 14180) and related protein of SEQ 
ID NO:461 

The polypeptides of SEQ ID NO: 1 76 and SEQ ID NO:461 encoded by the extended cDNA 
SEQ ID NO:7 and SEQ ID NO:344, respectively, are ribonucleases that belongs to the pancreatic 
ribonuclease family (see reviews from Beintema, (1998) Cell. Mol Life Sci. 54:763-5; Beintema 

1 5 and Kleineidam, (1998) Cell. Mol. Life Sci. 54:825-32, which disclosures are hereby incorporated 
by reference in their entireties). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:7 and SEQ ID NO:344 and polypeptides of SEQ ID NO: 1 76 and 
SEQ ID NO:461, described throughout the present application also pertain to the human cDNA of 
clone 1 14180, and the polypeptides encoded thereby. In addition, the protein of the invention plays 

20 a role in angiogenesis as an angiogenin variant protein (see review from Badet, (1999) Pathol. Biol. 
74:345-51, which disclosure is hereby incorporated by reference in its entirety). Preferred 
polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID NO: 1 76 
from positions 19 to 75, from positions 26 to 75, or from positions 63 to 69. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 1 76 having any of the biological 

25 activity described herein. The ribonuclease activity of the protein of the invention or part thereof 
may be assayed using any of the assays known to those skilled in the art including those described 
in US patent 5,866,1 19, which disclosure is hereby incorporated by reference in its entirety. The 
angiogenic activity of the protein of the invention or part thereof may be assayed using any of the 
assays known to those skilled in the art including those described by Fett et al. (1985) Biochem. 24, 

30 5480-5486, which disclosure is hereby incorporated by reference in its entirety. 

Ribonucleases are proteins which catalyze the hydrolysis of phosphodiester bonds in RNA 
chains. Pancreatic ribonucleases are pyrimidic-specific ribonucleases present in high quantity in the 
pancreas of a number of mammalia taxa and of a few reptiles. In addition to their function in 
hydrolysis of RNA, ribonucleases have evolved to support a variety of other physiological 

35 activities. Such activities include anti-parasite, anti-bacterium, anti-virus, anti-neoplastic activities, 
neurotoxicity, and angiogenesis. For example, bovine seminal ribonuclease is anti-neoplastic 
(Laceetti, et al. (1992) Cancer Res. 52: 4582-4586, which disclosure is hereby incorporated by 
reference in its entirety). Some frog ribonucleases display both anti-viral and anti-neoplastic activity 
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(Youle, et al. (1994) Proc. Natl. Acad. Sci. USA 91 : 6012-6016; Mikulski, et al. (1990) J. Natl. 
Cancer Inst. 82: 151-152; and Wu, et al. (1993) J. Biol. Chem. 268: 10686-10693), which 
disclosures are hereby incorporated by reference in their entireties. Eosinophil-derived neurotoxin 
(EDN) and eosinophil cationic protein (ECP) are related ribonucleases which possess neurotoxicity 
5 (Beintema, et al. (1988) Biochemistry 27: 4530-4538; Ackerman, (1993) In Makino, and Fukuda, 
Eosinophils: Biological and Clinical Aspects. CRC Press, Boca Raton, Fla., pp 33-74), which 
disclosures are hereby incorporated by reference in their entireties. In addition, ECP exhibits 
cytotoxic, anti-parasitic, and anti-bacterial activities. A EDN-related ribonuclease, named RNase 
k6, is shown to express in normal human monocytes and neutrophils, suggesting a role for this 

10 ribonuclease in host defense (Rosenberg, and Dyer, (1996) Nuc. Acid. Res. 24: 3507-3513), which 
disclosure is hereby incorporated by reference in its entirety. 

Angiogenin is a tRNA-specific ribonuclease which binds protein partners on the surface of 
endothelial cells for endocytosis. Potential partners of angiogenin include heparin, plasminogen, 
elastase, angiostatin, actin, and a 170 kDa receptor on the surface of endothelial cells [Strydom, 

15 (1998) Cell. Mol. Life Sci. 54, 81 1-824, which disclosure is hereby incorporated by reference in its 
entirety ]. Endocytosed angiogenin is translocated to the nucleus where it promotes endothelial 
invasiveness required for blood vessel formation (Moroianu, and Riordan, (1994) Proc. Natl. Acad. 
Sci. USA 91 : 1217-1221, which disclosure is hereby incorporated by reference in its entirety ). 

Although originally isolated from medium conditioned by human colon cancer cells (Fett et 

20 al. (1985), supra), and subsequently shown to be produced by several other histological types of 
human tumors [Rybak, et al. (1987) Biochem. Biophys. Res, Commun. 146, 1240-1248; Olson, et 
al., (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 442-446, which disclosures are hereby incorporated by 
reference in their entireties], angiogenin also is a constituent of human plasma and normally 
circulates at a concentration of 250-360 ng/ml [Shimoyama, et al. (1996) Cancer Res. 56, 2703- 

25 2706; Blaser, et al. (1993) Eur. J. Clin. Chem. Clin. Biochem. 31, 513-516, which disclosures are 
hereby incorporated by reference in their entireties]. It has also been shown that recurrent gastric 
cancer patients had a much higher serum concentration of angiogenin than primary gastric cancer 
patients [Shimoyama, and Kaminishi, (2000) J. Cancer Res. Clin. Oncol. 126, 468-474, which 
disclosure is hereby incorporated by reference in its entirety]. 

30 Angiogenin is a potent inducer of angiogenesis [Fett, et al. supra]. Angiogenesis is a 

complex process of blood vessel formation comprising of several separate but interconnected steps 
at the cellular and biochemical level including: (i) activation of endothelial cells by the action of an 
angiogenic stimulus, (ii) adhesion and invasion of activated endothelial cells into the surrounding 
tissues and migration toward the source of the angiogenic stimulus, and (iii) proliferation and 

35 differentiation of endothelial cells to form a new microvasculature [Folkman, and Shing, (1992) J. 
Biol. Chem. 267, 10931-10934; Moscatelli, and Rifkin, (1988) Biochim. Biophys. Acta 948, 67-85, 
which disclosures are hereby incorporated by reference in their entireties]. While angiogenesis is a 
tightly-controlled process under usual physiological conditions, abnormal angiogenesis can have 
devastating consequences in pathological conditions such as arthritis, diabetic retinopathy and 
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tumor growth. It is now well-established that the growth of virtually all solid tumors is angiogenesis 
dependent [Folkman, (1989) J. Natl. Cancer Inst. 82, 4-6, which disclosure is hereby incorporated 
by reference in its entirety]. Angiogenesis is also a prerequisite for the development of metastasis, 
since it provides the means whereby tumor cells disseminate from the original primary tumor and 
5 establish at distant sites [Mahadevan, and Hart, (1990) Rev. Oncol. 3, 97-103; Blood, and Zetter 
(1990) Biochim. Biophys. Acta 1032, 89-1 1 8, which disclosures are hereby incorporated by 
reference in their entireties]. Therefore, interference with the process of tumor-induced 
angiogenesis can be an effective therapy for both primary and metastatic cancers. Indeed, several 
anti-angiogenic agents have been produced and are currently in the clinical trial stage. 

10 The invention relates to methods and compositions using the protein of the invention or part 

thereof to hydrolyze one or several substrates, preferably nucleic acids, more preferably RNA, alone 
or in combination with other substances. For example, the protein of the invention or part thereof is 
added to a sample containing the substrate(s) in conditions allowing hydrolysis, and allowed to 
catalyze the hydrolysis of the substrate(s). Hydrolysis conditions as described in the US patent 

1 5 5,866,1 19 may be used, which disclosure is hereby incorporated by reference in its entirety. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
remove contaminating RNA in a biological sample, alone or in combination with other nucleases. 
In a more preferred embodiment, the protein of the invention or part thereof may be used to purify 
DNA preparations from contaminating RNA, to remove RNA templates prior to second strand 

20 synthesis and prior to analysis of in vitro translation products. Compositions comprising the protein 
of the present invention or part thereof are added to biological samples as a "cocktail" with other 
nucleases. The advantage of using a cocktail of hydrolytic enzymes is that one is able to hydrolyze 
a wide range of substrates without knowing the specificity of any of the enzymes. Such cocktails of 
nucleases are commonly used in molecular biology assays, for example to remove unbound RNA in 

25 RNAse protection assays. Using a cocktail of hydrolytic enzymes also protects a sample from a 
wide range of future unknown RNA contaminants from a vast number of sources. For example, the 
protein of the invention or part thereof is added to samples where contaminating substrates is 
undesirable. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other hydrolytic enzymes, using 

30 techniques well known in the art, to form an affinity chromatography column. A sample containing 
the undesirable substrate is run through the column to remove the substrate. Immobilizing the 
protein of the invention or part thereof on a support is particularly advantageous for those 
embodiments in which the method is to be practiced on a commercial scale. This immobilization 
facilitates the removal of the enzyme from the batch of product and subsequent reuse of the enzyme. 

35 Immobilization of the protein of the invention or part thereof can be accomplished, for example, by 
inserting a cellulose-binding domain in the protein. One of skill in the art will understand that other 
methods of immobilization could also be used and are described in the available literature. 
Alternatively, the same methods may be used to identify new substrates. 
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In another embodiment, the protein of the invention or part thereof may be used to 
decontaminate or disinfect samples infected by undesirable parasite, bacteria and/or viruses using 
any of the methods known to those skilled in the art including those described in Youle et al, 
(1994), supra; Mikulski et al (1990) supra, Wu et al (1993) supra. 
5 In another embodiment, the present invention relates to compositions and methods using the 

protein of the invention or part thereof to selectively kill cells. The protein of the invention or part 
thereof is linked to a recognition moiety capable of binding to a chosen cell, such as lectins, 
receptors or antibodies thus generating cytotoxic reagents using methods and techniques described 
in US patent 5,955,073, which disclosure is hereby incorporated by reference in its entirety. 
10 In still another embodiment, the invention relates to compositions and methods using the 

protein of the invention or part thereof to stimulate cell proliferation both in vitro and in vivo, 
especially endothelial cell growth. For example, soluble forms of the protein of the invention or 
part thereof may be added to cell culture medium in an amount effective to stimulate cell 
proliferation. 

15 In still another embodiment, the protein of the invention or part thereof may be used in the 

diagnosis, prevention and/or treatment of disorders associated with excessive angiogenesis such as 
tumor growth, arthritis or diabetic retinopathy. 

In a preferred embodiment, the protein of the invention may be used as a diagnostic marker 
to evaluate the risk of a given individual to develop a tumor, to evaluate the risk of recurrence of 

20 tumors or to evaluate the degree of cancer aggressiveness based on the facts that the level of 

circulating angiogenin is lower in normal individuals than in patients bearing tumors, and is lower 
in patients with primary cancers compared to patients with reoccurent tumors, as stated above. 
Thus, quantitative immunoassays can be used for the detection of abnormal levels of either the 
protein of SEQ ID NO: 176 or the mRNA encoding such protein as the polynucleotide of SEQ ID 

25 No:7, thereby identifying those individuals at risk for the development of tumors or the recurrence 
of tumors. Detection of abnormal levels of the protein of the invention may be performed using any 
techniques known to those skilled in the art including those described elsewhere in the application. 
For example, antibodies binding specifically to the protein of the invention, or fragments thereof, 
may be used in routine immunoassays to screen for the presence or absence of the protein of the 

30 invention, or fragments thereof. Alternatively, the nucleic acids which encode the protein of the 
invention, or fragments thereof, may be used in hybridization assays to detect and/or quantity the 
expression of said protein. 

Another aspect of the invention provides for molecules which inhibit, or reduce, the 
biological activity or expression of SEQ ID NO: 276. Such molecules may be be administered to 

35 patients to prevent vascularization, especially tumor vascularization, thereby limiting tumor growth. 
Such antagonists and/or inhibitors may be antibodies specific for the protein of the invention that 
can be used directly as an antagonist, or indirectly as a targeting or delivery mechanism for bringing 
a pharmaceutical agent to cells or tissue which express the protein of the invention. Neutralizing 
antibodies, (i.e., those which inhibit protein-protein interactions) are especially preferred for 
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therapeutic use. Alternatively, such molecules may be mutated forms of the protein of the invention 
or truncated forms which will be able to bind to the partners of the protein of the invention and 
compete with if for partners but without eliciting any of its biological activities. Other methods to 
inhibit the expression of the protein of the invention include antisense and triple helix strategies as 
5 described herein. Other antagonists or inhibitors of the protein of the invention may be produced 
using methods which are generally known in the art, including the screening of libraries of 
pharmaceutical agents to identify those which specifically bind the protein of the invention. The 
protein of the invention, or part thereof, preferably its functional or immunogenic fragments, or 
oligopeptides related thereto, can be used for screening libraries of compounds in any of a variety of 
10 drug screening techniques including those described herein. 

Protease Inhibitor Protein of SEO ID NO: 181 (internal designation 1000771934) and related 
protein of SEP ID NO:466 

The protein of SEQ ID NOs:181 and 466 encoded by the extended cDNA SEQ ID NO: 12 

1 5 and 349, respectively, is a protease inhibitor belonging to the WAP-type disulfide core family. It 
will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID NOs: 12 and 
349 and polypeptides of SEQ ID NOs: 181 and 466, described throughout the present application 
also pertain to the human cDNA of clone 1000771934, and the polypeptides encoded thereby. 
Preferred polypeptides of the invention are polypeptides comprising the amino acid fragments of 

20 SEQ ID NO: 1 8 1 from positions 32 to 73, 49 to 62, 76 to 1 22, 97 to 1 1 0 or any combination thereof. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 1 8 1 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
or part thereof may be assessed using any techniques known to those skilled in the art. Possible 
substrates for the protein of the invention include are not limited to trypsin, chymotrypsin, 

25 leukoproteinase, elastase, subtilisin, type IV collagenase and other serine proteases. 

Proteases, which cleave proteins, are largely used in industry including in food processing, 
brewing, and alcohol production. Proteases are important components of laundry detergents and 
other products. Within biological research, proteases are used in purification processes to degrade 
unwanted proteins. It is often desirable to employ proteases of low specificity or mixtures of more 

30 specific proteases to obtain the necessary degree of degradation. 

Proteases are also key components of a broad range of biological pathways, including blood 
coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors VIII, IX, and Vila. In another application, 

35 the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 
mechanism, thereby reducing morbitity resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen-based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 186:237-247, 1994; 
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Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 
pathways by proteolytic activation of the cognate receptor (Vu et al., Cell 64:1057-1068, 1991; 
Blackhart et al., J. Biol. Chem. 271:16466-16471, 1996, which disclosures are hereby incorporated 
5 by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 
tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
10 administration of exogenous alpha- 1 -antitrypsin. 

In addition, protease inhibitors have been shown to inhibit the growth of microorganisms 
including human pathogenic bacteria such as strains of group A streptococci, including antibiotic- 
resistant strains (Merigan, T. et al (1996) Ann Intern Med 124:1039-1050; Stoka, V. (1995) FEBS. 
Lett 370:101-104; Vonderfecht, S. et al (1988) J Clin Invest 82:2011-2016; Collins, A. et al (1991) 
1 5 Antimicrob Agents Chemother 35:2444-2446, which disclosures are hereby incorporated by 
reference in their entireties). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 
addresses these needs. 

20 The invention relates to compositions and methods using the protein of the invention or part 

thereof to inhibit proteases, both in vitro or in vivo. Since proteases play an important role in the 
regulation of many biological processes in virtually all living organisms as well as a major role in 
diseases, inhibitors of proteases are useful in a wide variety of applications. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 

25 the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 
assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 

30 of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. 

In addition, the protein of the invention or part thereof may be used to remove, identify or 
inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
35 to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 
proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
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proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 
assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
5 chromatographic support, either alone or in combination with other protease inhibitor, using 

techniques well known in the art, to form an affinity chromatography column. A sample containing 
the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 

10 inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 
dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 
Alzheimer's disease, periodontal disease, and cancer metastasis. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

15 to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 
diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite-borne or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HIV and HCV. 

20 In another embodiment, the protease inhibitors of the present invention may be used as 

antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 
Particularly, the polypeptides of the present invention may be used to inhibit the growth of group A 
streptococci on non-living matter such as surgical instruments, laboratory glassware and 
plasticware, and in culture of living plant, fungi, and animal cells. 

25 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

30 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 

Serpin Protein of SEP ID NO: 1 79 (internal designation 784093) and related protein of SEP ID 
NP:464 

35 The protein of SEQ ID NPs: 1 79 and 464 and encoded by the extended cDNA SEQ ID 

NPs: 1 0 and 347, respectively, is a serine protease inhibitor. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NPs: 1 0 and 347 and polypeptides of SEQ 
ID NOs: 1 79 and 464, described throughout the present application also pertain to the human cDNA 
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of clone 784093, and the polypeptides encoded thereby. Preferred polypeptides of the invention are 
polypeptides comprising the amino acid fragments of SEQ ID NO: 179 from positions 47 to 139. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 1 79 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
5 or part thereof may be assessed using any techniques known to those skilled in the art including 
those disclosed in the US patent 5,955,284. Possible substrates for the protein of the invention 
include are not limited to serine proteases such as elastase, trypsin, chymotrypsin, thrombin III, 
plasmin, heparin, complement II, plasminogen activator, protein C, interleukin-lB coverting 
enzyme, preferably trypsin, elastase, and chymotrypsin. 

10 Proteases are key components of a broad range of biological pathways, including blood 

coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors VIII, IX, and Vila. In another application, 
the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 

15 mechanism, thereby reducing morbitity resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen-based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 186:237-247, 1994; 
Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 

20 pathways by proteolytic activation of the cognate receptor (Vu et al., Cell 64: 1 057-1 068, 1991 ; 
Blackhart et al., J. Biol. Chem. 271:16466-16471, 1996, which disclosures are hereby incorporated 
by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 

25 tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
administration of exogenous alpha- 1 -antitrypsin. 

The serine proteases (SP) are a large family of proteolytic enzymes that include the 
digestive enzymes, trypsin and chymotrypsin, components of the complement cascade and of the 

30 blood-clotting cascade, and enzymes that control the degradation and turnover of macromolecules 
of the extracellular matrix. SP are so named because of the presence of a serine residue in the active 
catalytic site for protein cleavage. They are characterized by a catalytic triad of serine, histidine, and 
aspartic acid residues. SP have a wide range of substrate specificities and can be subdivided into 
subfamilies on the basis of these specificities. The main sub-families are trypases (cleavage after 

35 arginine or lysine), aspases (cleavage after aspartate), chymases (cleavage after phenylalanine or 
leucine), metases (cleavage after methionine), and serases (cleavage after serine). 

Serine proteases are used for a variety of industrial purposes. For example, the serine 
protease subtilisin is used in laundry detergents to aid in the removal of proteinaceous stains (e.g., 
Crabb, ACS Symposium Series 460:82-94, 1991, which disclosure is hereby incorporated by 
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reference in its entirety). In the food processing industry, serine proteases are used to produce 
protein-rich concentrates from fish and livestock, and in the preparation of dairy products (Kida et 
al., Journal of Fermentation and Bioengineering 80:478-484, 1995; Haard and Simpson, in Martin, 
A. M., ed., Fisheries Processing: Biotechnological Applications, Chapman and Hall, London, 1994, 
5 1 32-1 54; Bos et al., European Patent Office Publication 494 149 Al , which disclosures are hereby 
incorporated by reference in their entireties). 

Serpins are irreversible serine protease inhibitors which are principally located 
extracellularly. Proteins which have been assigned to the serpin family include the following: 
.alpha.-l protease inhibitor, .alpha.- 1 -antichymotrypsin, antithrombin III, .alpha.-2-antiplasmin, 

10 heparin cofactor II, complement CI inhibitor, plasminogen activator inhibitors 1 and 2, glia derived 
nexin, protein C inhibitor, rat hepatocyte inhibitors, crmA (a viral serpin which inhibits interleukin 
l-.beta. cleavage enzyme), human squamous cell carcinoma antigen which may modulate the host 
immune response against tumor cells, human maspin which seems to function as a tumor 
suppressor, lepidopteran protease inhibitor, leukocyte elastase inhibitor (the only known 

15 intracellular serpin), and products from three orthopoxviruses (these products may be involved in 
the regulation of the blood clotting cascade and/or of the complement cascade in the mammalian 
host). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 

20 addresses these needs. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 
the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 

25 assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 
of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. Preferred proteases in this embodiment are serine protease, more preferably 
elastase, trypsin and chymotrypsin. 

30 In addition, the protein of the invention or part thereof may be used to remove, identify or 

inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 

35 proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 
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assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other protease inhibitor, using 
techniques well known in the art, to form an affinity chromatography column. A sample containing 
5 the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 

10 dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 

Alzheimer's disease, periodontal disease, and cancer metastasis. In a more preferred embodiment, 
the invention relates to compositions and methods to use the protein of the invention or part thereof 
in diseases characterized by an abnormally elevated levels of trypsin, chymotrypsin or elastase, 
including but not limited to chronic emphysema of the lungs, cirrhosis, liver failure, cystic fibrosis, 

1 5 alpha 1 -antitrypsin deficiency associated disorders such as aneurysm or toxic shock. For prevention 
and/or treatment purposes, the protein of the invention may be used using any of the gene therapy 
methods described herein or known to those skilled in the art. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 
to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 

20 diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite-borne or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HIV and HCV. 
In another embodiment, the protease inhibitors of the present invention may be used as 

25 antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 

Particularly, an amount of the polypeptides of the present invention effective to inhibit proliferation 
may be used to inhibit the growth of group A streptococci on non-living matter such as surgical 
instruments, laboratory glassware and plasticware, and in culture of living plant, fungi, and animal 
cells. 

30 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

35 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 
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ATPKf Protein sequence of SEP ID No.253 (internal designation 1000867870) 

The protein of SEQ ID NO:253 encoded by the extended cDNA SEQ ID NO:84 and relate 
polynucleotides of SEQ ID NO:404 is a variant of the human mitochondrial ATP synthase f subunit 
or ATPK (E.C. 3.6.1 .34) and, as such, plays a role in cellular respiration. Preferred polypeptides of 
5 the invention are are polypeptides comprising the amino acids of SEQ ID NO: 253 from positions 5 
to 88. Other preferred polypeptides of the invention are fragments of SEQ ID NO: 253 having any 
of the biological activity described herein. It will be appreciated that all characteristics and uses of 
the polynucleotides of SEQ ID NOs:84 and 404 and polypeptides of SEQ ID NO: 253 described 
throughout the present application also pertain to the human cDNA of clone 1000867870, and the 

1 0 polypeptides encoded thereby. 

The mitochondrial electron transport (or respiratory) chain is a series of enzyme complexes 
in the mitochondrial membrane that is responsible for the transport of electrons from NADH to 
oxygen and the coupling of this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP 
then provides the primary source of energy for driving a cell's many energy-requiring reactions. 

1 5 ATP synthase (F0 F 1 ATPase) is the enzyme complex at the terminus of this chain and serves as a 
reversible coupling device that interconverts the energies of an electrochemical proton gradient 
across the mitochondrial membrane into either the synthesis or hydrolysis of ATP. This gradient is 
produced by other enzymes of the respiratory chain in the course of electron transport from NADH 
to oxygen. When the cell's energy demands are high, electron transport from NADH to oxygen 

20 generates an electrochemical gradient across the mitochondrial membrane. Proton translocation 
from the outer to the inner side of the membrane drives the synthesis of ATP. Under conditions of 
low energy requirements and when there is an excess of ATP present, this electrochemical gradient 
is reversed and ATP synthase hydrolyzes ATP. The energy of hydrolysis is used to pump protons 
out of the mitochondrial matrix. ATP synthase is, therefore, a dual complex, the F0 portion of 

25 which is a transmembrane proton carrier or pump, and the Fl portion of which is catalytic and 
synthesizes or hydrolyzes ATP. Mammalian ATP synthase complex consists of sixteen different 
polypeptides (Walker, J. E. and Collinson, T. R. (1994) FEBS Lett.346: 39-43, which disclosure is 
hereby incorporated by reference in its entirety). Six of these polypeptides (subunits alpha, beta, 
gamma, delta, epsilon, and an ATPase inhibitor protein IF 1) comprise the globular catalytic F 1 

30 ATPase portion of the complex, which lies outside of the mitochondrial membrane. The remaining 
ten polypeptides (subunits a, b, c, d, e, f, g, F6, OSCP, and A6L) comprise the proton-translocating, 
membrane spanning F0 portion of the complex. Like other members of the respiratory chain, all but 
two of the polypeptide subunits of ATP synthase are nuclear gene products that are imported into 
the mitochondria. Enzyme complexes similar to mammalian ATP synthase are found in all cell 

35 types and in chloroplast and bacterial membranes. This universality indicates the central importance 
of this enzyme to ATP metabolism. Transcriptional regulation of these nuclear encoded genes 
appears to be the predominant means for controlling the biogenesis of ATP synthase. Multiple 
mitochondrial pathologies exist because of the essential role of mitochondrial oxidative 
phosphorylation in cellular energy production, in the generation of reactive oxygen species and in 
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the initation of apoptosis (Wallace, Science, 283:1482-1488, 1999, which disclosure is hereby 
incorporated by reference in its entirety). It is now clear that mitochondrial diseases encompass an 
assemblage of clinical problems commonly involving tissues that have high energy requirements 
such as heart, muscle and the renal and endocrine systems. Over the past 1 1 years, a considerable 
5 body of evidence has accumulated implicating defects in the mitochondrial energy-generating 
pathway, oxidative phosphorylation, in a wide variety of degenerative diseases including myopathy 
and cardiomyopathy. Most classes of pathogenic mitochondrial DNA mutations affect the heart, in 
association with a variety of other clinical manifestations that can include skeletal muscle, the 
central nervous system (including eye), the endocrine system, and the renal system. Nuclear 

10 mutations causing mitochondrial disorders have been described. They are often found in highly 
conserved subunits. Mitochondrial disorders with nuclear mutations include : myopathies (PEO, 
MNGIE, congenital muscular dystrophy, carnitine disorders), encephalopathies (Leigh, Infantile, 
Wilson's disease, Deafness-Dystonia syndrome), other systemic disorders and cardiomyopathies. 
The discovery of a new ATP synthase subunit, and polynucleotides encoding it satisfy a 

1 5 need in the art by providing new compositions which are useful for the diagnosis, prevention, and 
treatment of cancer, myopathies, immune disorders, and neurological disorders. 

An object of the present invention relates to compositions and methods of targeting 
heterologous compounds, either polypeptides or polynucleotides to mitochondria by recombinantly 
or chemically fusing a fragment of the protein of the invention to an heterologous polypeptide or 

20 polynucleotide. Preferred fragments are signal peptide, amphiphilic alpha helices and/or any other 
fragments of the protein of the invention, or part thereof, that may contain targeting signals for 
mitochondria including but not limited to matrix targeting signals as defined in Herrman and 
Neupert, Curr. Opinion Microbiol. 3:210-4 (2000); Bhagwat et al. J. Biol. Chem. 274:24014-22 
(1999), Murphy Trends Biotechnol. 15:326-30 (1997); Glaser et al. Plant Mol Biol 38:31 1-38 

25 (1998); Ciminale et al. Oncogene 18:4505-14 (1999), which disclosures are hereby incorporated by 
reference in their entireties. Such heterologous compounds may be used to modulate 
mitochondria's activities. For example, they may be used to induce and/or prevent mitochondrial- 
induced apoptosis or necrosis. In addition, heterologous polynucleotides may be used for 
mitochondrial gene therapy to replace a defective mitochondrial gene and/or to inhibit the 

30 deleterious expression of a mitochondrial gene. 

The invention further relates to methods and compositions using the protein of the invention 
or part thereof to diagnose, prevent and/or treat several disorders in which mitochondrial respiratory 
electron transport chain is impaired, including but not limited to mitochondriocytopathies, necrosis, 
aging, myopathies, cancer and neurodegenerative diseases such as Alzheimer's disease, 

35 Huntington's disease, Parkinson's disease, epilepsy, Down's syndrome, dementia, multiple sclerosis, 
and amyotrophic lateral sclerosis. For diagnostic purposes, the expression of the protein of the 
invention could be investigated using any of the Northern blotting, RT-PCR or immunoblotting 
methods described herein and compared to the expression in control individuals. For prevention 
and/or treatment purposes, the protein of the invention may be used to enhance electron transport 
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and increase energy delivery using any of the gene therapy methods described herein or known to 
those skilled in the art. 

In another embodiment, the invention further relates to methods and compositions using the 
protein of the invention or part thereof to diagnose, prevent and/or treat several disorders in which 
5 mitochondrial respiratory electron transport chain needs to be impaired, including but not limited to 
Sjogren's syndrome, Addison's disease, bronchitis, dermatomyositis, polymyositis, 
glomerulonephritis, diabetes mellitus, emphysema, Graves' disease, atrophic gastritis, lupus 
erythematosus, myasthenia gravis, multiple sclerosis, autoimmune thyroiditis, ulcerative colitis, 
anemia, pancreatitis, scleroderma, rheumatoid and osteoarthritis, asthma, allergic rhinitis, atopic 
10 dermatitis, dermatomyositis, polymyositis, and gout, using any techniques known to those skilled in 
the art including the antisense or triple helices strategies described herein. 

Moreover, antibodies to the protein of the invention or part thereof may be used for 
detection of mitochondria organelles and/or mitochondrial membranes using any techniques known 
to those skilled in the art. 

15 

Oligomerization Protein sequence of SEP ID No. 310 (internal designation Dl 50568) 

The protein of SEQ ID NO : 3 10 encoded by the cDNA of SEQ ID NO: 141 and 435, is 
able to form homo-oligomers. Preferred polypeptides of the invention are polypeptides comprising 
the amino acids of SEQ ID NO:3 10 from positions 1 to 109. Other preferred polypeptides of the 

20 invention are fragments of SEQ ID NO: 310 having any of the biological activities described herein. 

Multivalency is a prerequisite for a variety of macromolecular interactions such as binding 
of antibodies or lectins to specific targets, ligand recognition, activation or inhibition of receptors 
and cell adhesion. Dimerization and oligomerization of proteins are thus general biological control 
mechanisms that contribute to the activation of cell membrane receptors, transcription factors, 

25 vesicle fusion proteins, and other classes of intra- and extracellular proteins. 

Multimerization domains have been shown to be useful tools in several areas of 
biotechnology, especially in protein engineering. For example, Tso et al have used leucine zippers 
for producing bispecific antibody heterodimers (US patent 5,932,448) / Methods of preparing 
soluble oligomeric proteins using leucine zippers have been described by Conrad et al (US patent 

30 5,965,712), Ciardelli et al (US patent 5,837,816), Spriggs et al (WO9410308) / Leucine zipper 

forming sequences have been used by Pelletier et al in protein fragment complementation assays to 
detect biomolecular interactions (WO9834120), which disclosures are hereby incorporated by 
reference in their entireties. Because of their usefulness in biotechnology, it is thus highly 
interesting to isolate new multimerization domains. 

35 The multimerization activity of the protein of the invention or part thereof may be assayed 

using any of the assays known to those skilled in the art including circular dichroism spectrum, gel 
filtration chromatography and thermal melting analyses. 
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In one embodiment, the invention relates to compositions and methods of using the protein 
of the invention or part thereof for preparing soluble multimeric proteins, which consist in 
multimers of fusion proteins containing a multimerization domain fused to a protein of interest, 
using any technique known to those skilled in the art including those described in international 
5 patent WO9410308, which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the protein of the invention or part thereof or derivative thereof is 
used for detection and determination of an analyte in a biological liquid using the teachings of US 
patent 5,643,73 1 , which disclosure is hereby incorporated by reference in its entirety. Briefly, a 
first multimerization domain is immobilized on a solid support and the second multimerization 
10 domain is coupled to a specific binding partner for an analyte in a biological fluid. The two 

peptides are then brought into contact thereby immobilizing the binding partner on the solid phase. 
The biological sample is then contacted with the immobilized binding partner and the amount of 
analyte in the sample bound to the binding partner determined. 

In still another embodiment, the protein of the invention or part thereof may be used to 
1 5 construct multimerization devices comprising hybrid molecules with a functional domain fused to a 
multimerization domain in order to yield multimeric complexes with improved pharmacokinetic and 
pharmacological properties as described in WOO 102440, which disclosure is hereby incorporated 
by reference in its entirety. In a preferred embodiment, the protein of the invention or part thereof 
may be used to construct different fusion proteins with different functional domains such as enzyme 
20 moieties or cytotoxic moieties. Vectors encoding these different proteins may then be transfected in 
the same host cell in conditions allowing for multimerization, thus yielding multimeric 
multifunctional complexes. 

Chaperone Protein of SEP ID NO: 303 (internal designation D637548) 

25 The protein of SEQ ID NO : 303 encoded by the cDNA of SEQ ID NO: 134 is a chaperonin. 

Accordingly, the protein of SEQ ID NO:303 plays a role in protein synthesis/folding, cellular 
trafficking , and the cellular stress response. In addition, the protein of SEQ ID No: 303 has 
immunosupressant and growth factor properties. It is able to depress delayed type hypersensitivity 
reactions. It is a product of primary and neoplastic cell proliferation and under these conditions acts 

30 as a growth factor. It is also a product of platelet activation and may play a part in wound healing 
and skin repair. Preferred polypeptides of the invention are polypeptides comprising the amino 
acids of SEQ ID NO:303 from positions 9 to 33, or from positions 7 to 101 . Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 303 having any of the biological 
activities described herein. The different activities of the protein of the invention or part thereof 

35 may be assayed using any of the assays described in US 6,1 17,421 or any of the assays referred into 
US, 61 17,421, which disclosures are hereby incorporated by reference in their entireties. 

Chaperonins belong to a wider class of molecular chaperones, molecules involved in post- 
translational folding, targeting and assembly of other proteins, but which do not themselves form 
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part of the final assembled structure as discussed by Ellis et al., 1991, Annu. Rev. Biochem. 60 321- 
347, which disclosures are hereby incorporated by reference in their entireties. Most molecular 
chaperones are "heat shock" or "stress" proteins (hsp); i.e. their production is induced or increased 
by a variety of cellular insults (such as metabolic disruption, oxygen radicals, inflammation, 
5 infection and transformation), heat being only one of the better studies stresses as reviewed by 
Lindquist et al., 1988, Annu. Rev. Genet. 22 631-677, which disclosure is hereby incorporated by 
reference in its entirety. As well as these quantitative changes in specific protein levels, stress can 
induce the movement of constitutively produced stress proteins to different cellular compartments 
as referred to in the Lindquist reference mentioned above. The heat shock response is one of the 

10 most highly conserved genetic system known and the various heat shock protein families are among 
the most evolutionarily stable proteins in existence. The major stress proteins accumulate to very 
high levels in stressed cells but occur at low to moderate levels in cells that have not been stressed. 
As well as enabling cells to cope under adverse conditions, members of these families perform 
essential functions in normal cells. 

15 Chaperones are also involved in a number of disorders, especially autoimmune diseases 

such as type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, and 
mixed connective tissue disease (Feige et al. EXS 1996; 77:359-73; Feili-Hariri et al. J Autoimmun 
2000; 14:133-42, which disclosures are hereby incorporated by reference in their entireties). 
Chaperones are also involved in various disorders including tuberculosis and leprosy (Zugel et al. 

20 Clin Microbiol Rev 1999; 12:19-39), neurogenerative disorders such as Alzheimer and Parkinson 
diseases (Yoo et al. J Neural Transm Suppl 1999; 57:3 15-22), and malignant disorders (Csermely et 
al. Pharmacol Ther 1998; 79:129-68), which disclosures are hereby incorporated by reference in 
their entireties. 

In one embodiment, the protein of the invention or part thereof may be used to detect a 
25 potential pregnancy, preferably within 6-24 hours of fertilization using the teaching of Morton et al., 
1976, Proc. R. Soc. B. 193 413-41 and US 6,1 17,421, which disclosures are hereby incorporated by 
reference in their entireties. Detection of the expression or activity of the protein of the invention 
may be performed using any techniques known to those skilled in the art including those described 
elsewhere in the application. 
30 In another embodiment, molecules able to block the expression or activity of the protein of 

the invention, such as antibodies, antisense or triple helix oligonucleotides, dominant negative 
forms of the protein, polypeptides or small molecule inhibitors of the expression or activity of the 
proteins, may be used to induce abortion as described in US patent 6,1 17,421. 

In still another embodiment, the protein of the invention or part thereof may be used to treat 
35 and/or prevent infertility and miscarriage using the simple administration of the protein of the 
invention or part thereof or using any of the gene therapy methods described elsewhere in the 
application and the teaching of the US patent 6,1 1 7,421 . 

In another embodiment, the present invention provide methods of using the present proteins 
to identify specific cell types in vitro and in vivo. For example, as chaperone proteins are often 
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upregulated in response to cellular stress, the detection of cells expressing elevated levels of the 
proteins provides a tool for detecting cells under stress. As cellular stress has been implicated in a 
number of disorders, such as cardiovascular disorders, neurodegenerative disorders, and cancer, the 
ability to detect such stress thus provides a diagnostic or screening tool for such conditions. 
5 In addition, the present polypeptides and polynucleotides can be used to develop diagnostic 

and screening assays for diseases characterized by an abnormal level or activity of the protein of 
SEQ ID NO: 303 such as malignant disorders of various types, and autoimmune diseases including 
type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, Graves 
disease, multiple sclerosis, and mixed connective tissue disease. Such assays can be performed 

10 using any biological sample, such as serum or plasma. 

In another embodiment, various disorders can be treated, attenuated and/or prevented by a 
protein of SEQ ID NO: 303, or part thereof, or any other compound that can affect the level or 
activity of the proteins such as nucleic acids, antibodies, or chemical substances. In a preferred 
embodiment, proteins or other compounds directed to the proteins of the invention can be used to 

1 5 treat or prevent disorders in which the activity or level of the protein of SEQ ID NO: 303 is 
unbalanced. Such diseases include, but are not limited to, infectious diseases, neurogenerative 
disorders as Alzheimer and Parkinson diseases, schizophrenia, alopecia, aging, atherosclerosis, 
malignant disorders of various types, and autoimmune diseases including type 1 diabetes, 
rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, mixed connective tissue 

20 disease, malignant disorders, autoimmune and any other neurodegenerative disorder. In another 
embodiment, the proteins of SEQ ID NO: 303 or part thereof can be used as vaccines for various 
disorders including, but not limited, to cancer (Wang et al. Immunol Invest 2000;29:131-7), 
tuberculosis (Silva et al. Microbes Infect 1999;1 :429-35), diabetes (Int Immunol 1999;1 1 :957-66), 
and atherosclerosis (Xu et al. Arterioscler Thromb 1992;12:789-99), which disclosures are hereby 

25 incorporated by reference in their entireties. 

One embodiment of the present invention relates to methods and compositions using the 
protein of SEQ ID NO:303 or fragments thereof as a stabilizing adjuvant to slow down protein 
degradation, boost the yields of recombinant proteins, prevent the aggregation of proteins or 
regenerate denatured proteins. In a preferred embodiment, the protein of SEQ ID NO:303 of 

30 fragment thereof is mixed with a composition comprising the protein for which it is desired to slow 
down degradation, boost yield, or regenerate denatured proteins under conditions which facilitate 
the desired result. For example, numerous commercial assay kits commonly used by those skilled 
in the arts of molecular biology and biochemistry depend on the biological properties of proteins 
(mostly enzymes) which can be very short-lived in vitro due to the low stability of those proteins. 

35 An example is described in Eur. Patent DE41 24286, the disclosure of which is incorporated herein 
by reference in its entirety, wherein the low intrinsic stability of test solutions used in optical tests is 
increased by addition of chaperone proteins, thus making the test more sensitive. Another example 
is given in US patent 6,013,488, which disclosure is hereby incorporated by reference in its entirety, 
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wherein a heat-labile reverse transcriptase is able to perform cDNA synthesis at high temperature 
levels in the presence of a chaperone. 

The protein of SEQ ID NO:303 may also be used to increase the yield or activity of 
recombinant proteins, preferably secreted proteins. In recombinant DNA technology, a major 
5 unsolved problem is the solubility and biological activity of the recombinantly overexpressed 
protein in a host, especially a bacterial or yeast host. Many eukaryotic proteins, especially the 
secreted ones, require for correct folding a specific cellular machinery which is lacking in bacterial 
hosts such as E. coli or becomes insufficient in mammalian/yeast cells due to high expression of the 
protein. The ability of the protein of SEQ ID NO:303 or fragments thereof to ensure proper folding 

10 of recombinant proteins may be utilized as follows. The protein of SEQ ID NO:303, or fragment 
thereof, may be coexpressed with the recombinant protein in bacterial or eukaryotic hosts to cause 
the hosts to express the heterologous proteins or polypeptides in a form having increased solubility 
and/or biological activity. For example, the protein of SEQ ID NO:303 or fragments thereof may 
be used in the methods described in US patent 5,773,245, the disclosure of which is incorporated 

15 herein by reference in its entirety. Therefore the invention relates to a method for the correct 

folding, deaggregation or prevention of aggregation of a monomeric protein in vivo comprising: (a) 
constructing a host cell transformed with (i) a first DNA encoding a polypeptide having the amino 
acid sequence of a bioactive protein or a precursor thereof, wherein said polypeptide or precursor 
can aggregate within the cell to result in a multimeric, non-bioactive protein or precursor thereof 

20 and (ii) a second DNA which enable the cell to co-express the protein of the invention or part 

thereof with the said polypeptide or precursor, (b) growing said host cell for sufficient time under 
conditions wherein said first DNA and said second DNA express said bioactive protein and said 
protein of the invention, respectively; and (c) obtaining monomeric protein that is a bioactive 
protein. Alternatively the protein of SEQ ID NO:303 or fragments thereof may be exogeneously 

25 added to the cell cultures as described in PCT application WO 00/08135, the disclosure of which is 
incorporated herein by reference in its entirety. 

The protein of SEQ ID NO:303 or fragments thereof, may further be used to regenerate 
denatured proteins. Recombinantly expressed proteins with poor biologival activity are routinely 
denatured with a potent denaturing agent, such as guanidine hydrochloride, followed by refolding 

30 by dilution with a large amount of a diluent to reduce the concentration of the denaturing agent. 
However, this method often results in a poor refolding rate which may be significantly increased by 
addition of a cocktail of chaperone proteins in a fashion similar to that described in Eur. Patent 
EP0650975, the disclosure of which is incorporated herein by reference in its entirety. The 
advantage of using a cocktail of chaperone proteins is to accommodate differences in binding 

35 specificity of the Hsp different families and the different members within each family. 

In another embodiment of the present invention, the protein of SEQ ID NO:303 may be 
used to promote tissue repair and/or increase cell survival in stress conditions such as hypoxy, 
oxidative stress, genotoxic agents and more generally harmful conditions leading to programmed 
cell death. Those conditions include but are not limited to infarction, heart surgery, stroke, 
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neurodegenerative diseases, epilepsy, trauma, atherosclerosis, restenosis after angioplasty, and 
nerve damage. 

In addition, the invention relates to compositions and methods for promoting cell growth 
both in vitro and in vivo using any of the techniques known to those skilled in the art including 
5 those described in the US patent 6,1 1 7,421 . For example, soluble forms of the protein of the 
invention or part thereof may be added to cell culture medium in an amount effective to stimulate 
cell proliferation. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. Alternatively, the protein of the 
invention or part thereof may be directly administered in an amount effective to promoting cell 

10 growth in said subject. These applications are particularly important in individuals suffering from 
wounds or tissue damage to enhance tissue repair, in individuals to which organ or skin grafts have 
been applied, in individuals suffering from an inflammatory condition or an allergic disease. 

In addition, the invention relates to compositions and methods for promoting 
immunosupression in a subject using any of the techniques known to those skilled in the art 

1 5 including those described in the US patent 6, 1 1 7,42 1 . For example, the protein of the invention or 
part thereof may be directly administered in an amount effective to achieve immunosupression in 
said subject. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. These applications are particularly 
important in cases in which immunosupression is desired such as in individuals suffering from 

20 autoimmune disease including any of the diseases cited above and in individuals that have received 
an heterologous graft they could reject. 

Ion transport Protein of SEP ID NO: 276 (internal designation D538694) 

The protein of SEQ ID NO : 276 encoded by the cDNA of SEQ ID NO: 107 belongs to the 
25 FXDY family of small ion transport regulators or channels (Sweadner and Rael (2000) Genomics 
68:41-56, which disclosure is hereby incorporated by reference in its entirety). The protein of SEQ 
ID NO: 276 or part thereof plays a role in the control of ion transport. Preferred polypeptides of the 
invention are polypeptides comprising the amino acids of SEQ ID NO:276 from positions 9 to 63, 
or from positions 16 to 29. Other preferred polypeptides of the invention are fragments of SEQ ID 
30 NO: 276 having any of the biological activities described herein. The activity of the protein of the 
invention or part thereof may be assayed using any of the assays known to those skilled in the art 
including those described in . 

Osmoregulation occurs in all organisms, though the mechanisms differ according to the 
organism's environment. Fresh water inhabitants need to retain salts, whereas ocean inhabitants 
35 need to retain water. Terrestrial inhabitants need to conserve both water and salts. Organisms must 
balance these needs with a requirement to eliminate metabolic waste, such as nitrogenous waste, 
and generate secreted body fluids, such as saliva for digestion and sweat for thermoregulation. 
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In mammals, sweat glands, salivary glands, and the kidney all produce a primary secretion 
that is essentially isosmotic with blood and extracellular fluids. Modification of this primary 
secretion then occurs as much of the sodium chloride and water are reabsorbed as they pass through 
the excretory ducts of the glands and kidney, whereas potassium and bicarbonate ions are secreted. 
5 This modification of the primary secretion is important in the sweat glands to conserve sodium 
chloride in hot environments, and in the salivary glands to conserve sodium chloride when 
excessive quantities of saliva are lost. This modification is critical in the kidney to maintain proper 
sodium and water balance in the extracellular fluids, a balance which also regulates arterial 
pressure. Loss of this modification activity by the duct cells causes a large loss of sodium and water, 

10 resulting in severe dehydration and low blood volume, and ultimately to circulatory collapse. 

Sodium absorption by the intestines, especially in the colon, is necessary to prevent loss of 
sodium in the stools. The loss of sodium absorption produces a failure to absorb anions and water as 
well. The unabsorbed sodium chloride and water then lead to diarrhea, with further loss of sodium 
chloride from the body. Other body fluids may be under regulation similar to that seen in the 

15 systems described above. For example, cerebrospinal fluid is produced by active sodium ion 
transport from the capillaries across the epithelium of the choroid plexus, which in turn attracts 
chloride ions and water. A counter flow of potassium and bicarbonate ions move out of the 
cerebrospinal fluid into the capillaries. A dysfunction in osmoregulation is associated with several 
disease states, including hyponatremia, renal failure, and hypernatremia. (Strange, K. (1992) J Am. 

20 Soc. Nephrol. 3:12-27, which disclosure is hereby incorporated by reference in its entirety). 

In one embodiment, the protein of the invention may be useful in the diagnosis, prevention 
and/or treatment of osmoregulatory disorders including but not limited to diabetes insipidus, 
diarrhea, peritonitis, chronic renal failure, Addison's disease, syndrome of inappropriate antidiuretic 
hormone (SIADH), hypoaldosteronism, hyponatremia, adrenal insufficiency, hypothyroidism, 

25 hypernatremia, hypokalemia, Barter's syndrome, Cushing's syndrome, metabolic acidosis, metabolic 
alkalosis, encephalopathy, edema, hypotension, and hypertension. For example, any of the gene 
therapy methods described herein may be used to overexpress the protein of the invention or part 
thereof in vivo. Alternatively, the protein of the invention or part thereof may be directly 
administered in an amount effective to promoting cell growth in said subject. For diagnostic 

30 purposes, the expression of the protein of the invention could be investigated using any of the 
Northern blotting, RT-PCR or immunoblotting methods described herein and compared to the 
expression in control individuals. For prevention and/or treatment purposes, the protein of the 
invention may be used to enhance ion transport and prevent or treat osmoregulatory disorders using 
any of the gene therapy methods described herein. 

35 
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Antibodies of the present invention have uses that include, but are not limited to, methods 
known in the art to purify, detect, and target the polypeptides of the present invention including 
both in vitro and in vivo diagnostic and therapeutic methods. An example of such use using 
5 immunoaffinity chromatography is given below. The antibodies of the present invention may be 
used either alone or in combination with other compositions. For example, the antibodies have use 
in immunoassays for qualitatively and quantitatively measuring levels of antigen-bearing substances, 
including the polypeptides of the present invention, in biological samples {See, e.g., Harlow et al., 
1988). (Incorporated by reference in the entirety). The antibodies may also be used in therapeutic 

1 0 compositions for killing cells expressing the protein or reducing the levels of the protein in the body. 
The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies that 
disrupt the receptor/1 igand interactions with the polypeptides of the invention either partially or 
fully. Included are both receptor-specific antibodies and ligand-specific antibodies. Included are 

15 receptor-specific antibodies, which do not prevent ligand binding but prevent receptor activation. 
Receptor activation (i.e., signaling) may be determined by techniques described herein or otherwise 
known in the art. Also include are receptor-specific antibodies which both prevent ligand binding 
and receptor activation. Likewise, included are neutralizing antibodies that bind the ligand and 
prevent binding of the ligand to the receptor, as well as antibodies that bind the ligand, thereby 

20 preventing receptor activation, but do not prevent the ligand from binding the receptor. Further 

included are antibodies that activate the receptor. These antibodies may act as agonists for either all 
or less than all of the biological activities affected by ligand-mediated receptor activation. The 
antibodies may be specified as agonists or antagonists for biological activities comprising specific 
activities disclosed herein. The above antibody agonists can be made using methods known in the 

25 art. See e.g., WO 96/40281; US Patent 5,81 1,097; Deng et al. (1998); Chen et ah (1998); Harrop et 
al. (1998); Zhu et al (1998); Yoon et al. (1998); Prat et al. (1998); Pitard et ah (1997); Liautard et 
al (1997); Carlson et al. (1997); Taryman et al. (1995); Muller et al. (1998); Bartunek et al. (1996) 
(said references incorporated by reference in their entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be utilized 

30 to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using techniques 
well known to those skilled in the art (See, e.g. Greenspan and Bona (1989) and Nissinoff (1991), 
which disclosures are hereby incorporated by reference in their entireties). For example, antibodies 
which bind to and competitively inhibit polypeptide multimerization or binding of a polypeptide of 
the invention to ligand can be used to generate anti-idiotypes that "mimic" the polypeptide 

35 multimerization or binding domain and, as a consequence, bind to and neutralize polypeptide or its 
ligand. Such neutralization anti-idiotypic antibodies can be used to bind a polypeptide of the 
invention or to bind its ligands/receptors, and thereby block its biological activity. 



181 



WO 02/083898 

Immunoaffinitv Chromatography 



PCT/IB01/00914 



Antibodies prepared as described herein are coupled to a support. Preferably, the antibodies 
are monoclonal antibodies, but polyclonal antibodies may also be used. The support may be any of 
those typically employed in immunoaffinity chromatography, including Sepharose CL-4B 
5 (Pharmacia, Piscataway, NJ), Sepharose CL-2B (Pharmacia, Piscataway, NJ), Affi-gel 10 (Biorad, 
Richmond, CA), or glass beads. 

The antibodies may be coupled to the support using any of the coupling reagents typically 
used in immunoaffinity chromatography, including cyanogen bromide. After coupling the antibody 
to the support, the support is contacted with a sample which contains a target polypeptide whose 
10 isolation, purification or enrichment is desired. The target polypeptide may be a polypeptide 
selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool, variants and fragments 
thereof, or a fusion protein comprising said selected polypeptide or a fragment thereof. 

Preferably, the sample is placed in contact with the support for a sufficient amount of time 
1 5 and under appropriate conditions to allow at least 50% of the target polypeptide to specifically bind 
to the antibody coupled to the support. 

Thereafter, the support is washed with an appropriate wash solution to remove polypeptides 
which have non-specifically adhered to the support. The wash solution may be any of those 
typically employed in immunoaffinity chromatography, including PBS, Tris-lithium chloride buffer 
20 (0.1M lysine base and 0.5M lithium chloride, pH 8.0), Tris-hydrochloride buffer (0.05M Tris- 
hydrochloride, pH 8.0), or Tris/Triton/NaCl buffer (50mM Tris.cl, pH 8.0 or 9.0, 0.1% Triton X- 
100, and 0.5MNaCl). 

After washing, the specifically bound target polypeptide is eluted from the support using 
the high pH or low pH elution solutions typically employed in immunoaffinity chromatography. In 
25 particular, the elution solutions may contain an eluant such as triethanolamine, diethylamine, 
calcium chloride, sodium thiocyanate, potasssium bromide, acetic acid, or glycine. In some 
embodiments, the elution solution may also contain a detergent such as Triton X-100 or octyl-beta- 
D-glucoside. 

Expression of Genset gene products 

30 Spatial expression of the GENSET genes of the invention 

Tissue expression of the cDNAs of the present invention was examined. Tables III and IV 
lists the number of hits for the cDNAs in Genset's libraries of tissues and cell types as well as in 
public databases. The tissues and cell types examined for polynucleotide expression were, for 
Table III: Brain; Fetal brain; Fetal kidney; Fetal liver; Pituitary gland; Liver; Placenta; Prostate; 
35 Salivary gland; Stomach/Intestine; and Testis. For each cDNA referred to by its corresponding 

sequence identification number from the priority application (see Table I for corresponding SEQ ID 
NO in present application), the number of proprietary 5'ESTs (i.e. cDNA fragments) expressed in a 
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particular tissue referred to by its name is indicated in parentheses (second column). In addition, the 
bias in the spatial distribution of the polynucleotide sequences of the present invention was 
examined by comparing the relative proportions of the biological polynucleotides of a given tissue 
using the following statistical analysis. The under- or over-representation of a polynucleotide of a 
5 given cluster in a given tissue was performed using the normal approximation of the binomial 
distribution. When the observed proportion of a polynucleotide of a given tissue in a given 
consensus had less than 1% chance to occur randomly according to the chi2 test, the frequency bias 
was reported as "preferred". The results are given in Table V as follows. For each polynucleotide 
showing a bias in tissue distribution as referred to by its sequence identification number in the first 
10 column, the list of tissues where the polynucleotides are under-represented is given in the second 
column entitled "low expression" and the list of tissues where the polynucleotides are over- 
represented is given in the third column entitled "high expression". 

Evaluation of Expression Levels and Patterns of GEN SET polypeptide-encoding mRNAs 

The spatial and temporal expression patterns of GENSET polypeptide-encoding mRNAs, as 

15 well as their expression levels, may also be further determined as follows. 

Expression levels and patterns of GENSET polypeptide-encoding mRNAs may be analyzed 
by solution hybridization with long probes as described in International Patent Application No. WO 
97/05277, the entire contents of which are hereby incorporated by reference. Briefly, a GENSET 
polynucleotide, or fragment thereof, corresponding to the gene encoding the mRNA to be 

20 characterized is inserted at a cloning site immediately downstream of a bacteriophage (T3, T7 or 
SP6) RNA polymerase promoter to produce antisense RNA. Preferably, the GENSET 
polynucleotide is at least a 100 nucleotides in length. The plasmid is linearized and transcribed in 
the presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP and DIG- 
UTP). An excess of this doubly labeled RNA is hybridized in solution with mRNA isolated from 

25 cells or tissues of interest. The hybridizations are performed under standard stringent conditions 
(40-50°C for 16 hours in an 80% formamide, 0.4 M NaCl buffer, pH 7-8). The unhybridized probe 
is removed by digestion with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl, 
Phy M, U2 or A). The presence of the biotin-UTP modification enables capture of the hybrid on a 
microtitration plate coated with streptavidin. The presence of the DIG modification enables the 

30 hybrid to be detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline 
phosphatase. 

The GENSET polypeptide-encoding cDNAs, or fragments thereof, may also be tagged with 
nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK Patent 
Application No. 2 305 241 A, the entire contents of which are incorporated by reference. In this 
35 method, cDNAs are prepared from a cell, tissue, organism or other source of nucleic acid for which 
it is desired to determine gene expression patterns. The resulting cDNAs are separated into two 
pools. The cDNAs in each pool are cleaved with a first restriction endonuclease, called an 
"anchoring enzyme," having a recognition site which is likely to be present at least once in most 
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cDNAs. The fragments which contain the 5' or 3' most region of the cleaved cDNA are isolated by 
binding to a capture medium such as streptavidin coated beads. A first oligonucleotide linker 
having a first sequence for hybridization of an amplification primer and an internal restriction site 
for a "tagging endonuclease" is ligated to the digested cDNAs in the first pool. Digestion with the 
5 second endonuclease produces short "tag" fragments from the cDNAs. A second oligonucleotide 
having a second sequence for hybridization of an amplification primer and an internal restriction 
site is ligated to the digested cDNAs in the second pool. The cDNA fragments in the second pool 
are also digested with the "tagging endonuclease" to generate short "tag" fragments derived from 
the cDNAs in the second pool. The "tags" resulting from digestion of the first and second pools 

1 0 with the anchoring enzyme and the tagging endonuclease are ligated to one another to produce 
"ditags." In some embodiments, the ditags are concatamerized to produce ligation products 
containing from 2 to 200 ditags. The tag sequences are then determined and compared to the 
sequences of the GENSET polypeptide-encoding cDNAs to determine which genes are expressed in 
the cell, tissue, organism, or other source of nucleic acids from which the tags were derived. In this 

1 5 way, the expression pattern of a GENSET polypeptide-encoding gene in the cell, tissue, organism, 
or other source of nucleic acids is obtained. 

Quantitative analysis of GENSET gene expression may also be performed using arrays. 
For example, quantitative analysis of gene expression may be performed with GENSET 
polynucleotides, or fragments thereof in a complementary DNA microarray as described by Schena 

20 et al. (1995 and 1996) which disclosures are hereby incorporated by reference in their entireties. 
GENSET polypeptide-encoding cDNAs or fragments thereof are amplified by PCR and arrayed 
from 96-well microtiter plates onto silylated microscope slides using high-speed robotics. Printed 
arrays are incubated in a humid chamber to allow rehydration of the array elements and rinsed, once 
in 0.2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 

25 The arrays are submerged in water for 2 min at 95°C, transferred into 0.2% SDS for 1 min, rinsed 
twice with water, air dried and stored in the dark at 25°C. Cell or tissue mRNA is isolated or 
commercially obtained and probes are prepared by a single round of reverse transcription. Probes 
are hybridized to 1 cm 2 microarrays under a 14 x 14 mm glass coverslip for 6-12 hours at 60°C. 
Arrays are washed for 5 min at 25°C in low stringency wash buffer (IX SSC/0.2% SDS), then for 

30 10 min at room temperature in high stringency wash buffer (0.1X SSC/0.2% SDS). Arrays are 
scanned in 0.1 X SSC using a fluorescence laser scanning device fitted with a custom filter set. 
Accurate differential expression measurements are obtained by taking the average of the ratios of 
two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with GENSET 

35 polypeptide-encoding cDNAs or fragments thereof in complementary DNA arrays as described by 
Pietu et al. (1996), which disclosure is hereby incorporated by reference in its entirety. The 
GENSET polynucleotides of the invention or fragments thereof are PCR amplified and spotted on 
membranes. Then, mRNAs originating from various tissues or cells are labeled with radioactive 
nucleotides. After hybridization and washing in controlled conditions, the hybridized mRNAs are 
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detected by phospho-imaging or autoradiography. Duplicate experiments are performed and a 
quantitative analysis of differentially expressed mRNAs is then performed. 

Alternatively, expression analysis of GENSET genes can be done through high density 
nucleotide arrays as described by Lockhart et ah (1996) and Sosnowski et ah (1997), which 
5 disclosures are hereby incorporated by reference in their entireties. Oligonucleotides of 15-50 
nucleotides corresponding to sequences of a GENSET polynucleotide or fragments thereof are 
synthesized directly on the chip (Lockhart et ah, supra) or synthesized and then addressed to the 
chip (Sosnowski et ah, supra). Preferably, the oligonucleotides are about 20 nucleotides in length. 
cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin or fluorescent dye, 

10 are synthesized from the appropriate mRNA population and then randomly fragmented to an 
average size of 50 to 100 nucleotides. The said probes are then hybridized to the chip. After 
washing as described in Lockhart et ah, (supra) and application of different electric fields 
(Sosnowsky et ah, supra), the dyes or labeling compounds are detected and quantified. Duplicate 
hybridizations are performed. Comparative analysis of the intensity of the signal originating from 

15 cDNA probes on the same target oligonucleotide in different cDNA samples indicates a differential 
expression of the GENSET polypeptide-encoding mRNA. 

Uses of GENSET gene expression data 

Once the expression levels and patterns of a GENSET polypeptide-encoding mRNA has 
been determined using any technique known to those skilled in the art, in particular those described 
20 in the section entitled "Evaluation of Expression Levels and Patterns of GENSET polypeptide- 
encoding mRNAs", or using the instant disclosure, these information may be used to design 
GENSET gene specific markers for detection, identification, screening and diagnosis purposes as 
well as to design DNA constructs with an expression pattern similar to a GENSET gene expression 
pattern. 

25 Detection of GENSET polypeptide expression and/or biological activity 

The invention further relates to methods of detection of GENSET polypeptide expression 
and/or biological activity in a biological sample using the polynucleotide and polypeptide sequences 
described herein. Such method scan be used, for example, as a screen for normal or abnormal 
GENSET polypeptide expression and/or biological activity and, thus, can be used diagnostically. 
30 The biological sample for use in the methods of the present invention includes a suitable sample 
from, for example, a mammal, particularly a human. For example, the sample can be issued from 
tissues or cell lines having the same origin as tissues or cell lines in which the polypeptide is known 
to be expressed, e.g. using data from Tables III, IV, or V. 

Detection of GENSET polypeptides 

35 The invention further relates to methods of detection of GENSET polypeptide or encoding 

polynucleotides in a sample using the sequences described herein and any techniques known to 
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those skilled in the art. For example, a labeled polynucleotide probe having all or a functional 
portion of the nucleotide sequence of a GENSET polypeptide-encoding polynucleotide can be used 
in a method to detect a GENSET polypeptide-encoding polynucleotide in a sample. In one 
embodiment, the sample is treated to render the polynucleotides in the sample available for 
5 hybridization to a polynucleotide probe, which can be DNA or RNA. The resulting treated sample 
is combined with a labeled polynucleotide probe having all or a portion of the nucleotide sequence 
of the GENSET polypeptide-encoding cDNA or genomic sequence, under conditions appropriate 
for hybridization of complementary sequences to occur. Detection of hybridization of 
polynucleotides from the sample with the labeled nucleic probe indicates the presence of GENSET 

10 polypeptide-encoding polynucleotides in a sample. The presence of GENSET polypeptide- 
encoding mRNA is indicative of GENSET polypeptide-encoding gene expression. 

Consequently, the invention comprises methods for detecting the presence of a 
polynucleotide comprising a nucleotide sequence selected from a group consisting of the sequences 
of SEQ IDNOs:l-169, 339-455, 561-784, the sequences of clone inserts of the deposited clone 

1 5 pool, sequences fully complementary thereto, fragments and variants thereof in a sample. In a first 
embodiment, said method comprises the following steps of: 

a) bringing into contact said sample and a nucleic acid probe or a plurality of nucleic acid 
probes which hybridize to said selected nucleotide sequence; and 

b) detecting the hybrid complex formed between said probe or said plurality of probes and 
20 said polynucleotide. 

In a preferred embodiment of the above detection method, said nucleic acid probe or said 
plurality of nucleic acid probes is labeled with a detectable molecule. In another preferred 
embodiment of the above detection method, said nucleic acid probe or said plurality of nucleic acid 
probes has been immobilized on a substrate. In still another preferred embodiment, said nucleic 
25 acid probe or said plurality of nucleic acid probes has a sequence comprised in a sequence 
complementary to said selected sequence. 

In a second embodiment, said method comprises the steps of: 

a) contacting said sample with amplification reaction reagents comprising a pair of 
amplification primers located on either side of the region of said nucleotide sequence to be 

30 amplified; 

b) performing an amplification reaction to synthesize amplification products containing said 
region of said selected nucleotide sequence; and 

c) detecting said amplification products. 

In a preferred embodiment of the above detection method, when the polynucleotide to be 
35 amplified is a RNA molecule, preliminary reverse transcription and synthesis of a second cDNA 
strand are necessary to provide a DNA template to be amplified. In another preferred embodiment 
of the above detection method, the amplification product is detected by hybridization with a labeled 
probe having a sequence which is complementary to the amplified region. In still another preferred 
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embodiment, at least one of said amplification primer has a sequence comprised in said selected 
sequence or in the sequence complementary to said selected sequence. 

Alternatively, a method of detecting GENSET polypeptide expression in a test sample can 
be accomplished using any product which binds to a GENSET olypeptide of the present invention 
5 or a portion of a GENSET polypeptide. Such products may be antibodies, binding fragments of 
antibodies, polypeptides able to bind specifically to GENSET polypeptides or fragments thereof, 
including GENSET polypeptide agonists and antagonists. Detection of specific binding to the 
antibody indicates the presence of a GENSET polypeptide in the sample (e.g., ELISA). 

Consequently, the invention is also directed to a method for detecting specifically the 
10 presence of a GENSET polypeptide according to the invention in a biological sample, said method 
comprising the steps of: 

a) bringing into contact said biological sample with a product able to bind to a polypeptide 
of the invention or fragments thereof; 

b) allowing said product to bind to said polypeptide to form a complex; and 
1 5 b) detecting said complex. 

In a preferred embodiment of the above detection method, the product is an antibody. In a 
more preferred embodiment, said antibody is labeled with a detectable molecule. In another more 
preferred embodiment of the above detection method, said antibody has been immobilized on a 
substrate. 

20 In addition, the invention also relates to methods of determining whether a GENSET gene 

product (e.g. a polynucleotide or polypeptide) is present or absent in a biological sample, said 
methods comprising the steps of: 

a) obtaining said biological sample from a human or non-human animal, preferably a 
mammal; 

25 b) contacting said biological sample with a product able to bind to a GENSET polypeptide 

or encoding polynucleotide of the invention; and 

c) determining the presence or absence of said GENSET polypeptide-encoding gene 
product in said biological sample. 

The present invention also relates to kits that can be used in the detection of GENSET 
30 polypeptide-encoding gene expression products. The kit can comprise a compound that specifically 
binds a GENSET polypeptide (e.g. binding proteins, antibodies or binding fragments thereof (e.g. 
F(ab')2 fragments) or a GENSET polypeptide-encoding mRNA (e.g. a complementary probe or 
primer), for example, disposed within a container means. The kit can further comprise ancillary 
reagents, including buffers and the like. 

35 Detection of GENSET polypeptide biological activity 

The invention further includes methods of detecting specifically a GENSET polypeptide 
biological activity, and to identify compounds capable of modulating the activity of a GENSET 
polypeptide. Assessing the GENSET polypeptide biological activity may be performed by the 
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detection of a change in any cellular property associated with the GENSET polypeptide, using a 
variety of techniques, including those described herein. To identify modulators of the polypeptides, 
a control is preferably used. For example, a control sample includes all of the same reagents but 
lacks the compound or agent being assessed; it is treated in the same manner as the test sample. A 
5 number of potentially assayable biological activities for many of the herein-described proteins are 
described supra, under the heading, "Uses of polypeptides of the invention." 

The present invention also relates to kits that can be used in the detection of GENSET 
polypeptide biological activity. The kit can comprise, e.g. substrates for GENSET polypeptides, 
GENSET-binding compounds, antibodies to GENSET polypeptides, etc., for example, disposed 
10 within a container means. The kit can further comprise ancillary reagents, including buffers and the 
like. 

Identification of a specific context of GENSET polvpeptide-encoding gene expression 

When the expression pattern of a GENSET polypeptide-encoding mRNA shows that a 
GENSET polypeptide-encoding gene is specifically expressed in a given context, probes and 

1 5 primers specific for this gene as well as antibodies binding to the GENSET polypeptide-encoding 
polynucleotide may then be used as markers for the specific context. Examples of specific contexts 
are: specific expression in a given tissue/cell or tissue/cell type (see, e.g., Tables III-V) , expression 
at a given stage of development of a process such as embryo development or disease development, 
or specific expression in a given organelle. Such primers, probes, and antibodies are useful 

20 commercially to identify tissues/cells/organelles of unknown origin, for example, forensic samples, 
differentiated tumor tissue that has metastasized to foreign bodily sites, or to differentiate different 
tissue types in a tissue cross-section using any technique known to those skilled in the art including 
in situ PCR or immunochemistry for example. 

For example, the cDNAs and proteins of the sequence listing and fragments thereof, may be 

25 used to distinguish human tissues/cells from non-human tissues/cells and to distinguish between 
human tissues/cells/organelles that do and do not express the polynucleotides comprising the 
cDNAs. By knowing the expression pattern of a given GENSET polypeptide, either through 
routine experimentation or by using the instant disclosure, the polynucleotides and polypeptides of 
the present invention may be used in methods of determining the identity of an unknown tissue/cell 

30 sample/organelle. As part of determining the identity of an unknown tissue/cell sample/organelle, 
the polynucleotides and polypeptides of the present invention may be used to determine what the 
unknown tissue/cell sample is and what the unknown sample is not. For example, if a cDNA is 
expressed in a particular tissue/cell type/organelle, and the unknown tissue/cell sample/organelle 
does not express the cDNA, it may be inferred that the unknown tissue/cells are either not human or 

35 not the same human tissue/cell type/organelle as that which expresses the cDNA. These methods of 
determining tissue/cell/organelle identity are based on methods which detect the presence or 
absence of the mRNA (or corresponding cDNA) in a tissue/cell sample using methods well know in 
the art (e.g., hybridization, PCR based methods, immunoassays, immunochemistry, ELISA). 
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Examples of such techniques are described in more detail below. Therefore, the invention 
encompasses uses of the polynucleotides and polypeptides of the invention as tissue markers. In a 
preferred embodiment, polynucleotides preferentially expressed in given tissues as indicated in 
Tables III-V and polypeptides encoded by such polynucleotides are used for this purpose. The 
5 invention also encompasses uses of polypeptides of the invention as organelle markers. 

Consequently, the present invention encompasses methods of identification of a tissue/cell 
type/subcellular compartment, wherein said method includes the steps of: 

a) contacting a biological sample which identity is to be assayed with a product able to 
bind a GENSET gene product; and 
10 b) determining whether a GENSET gene product is expressed in said biological 

sample. 

Products that are able to bind specifically to a GENSET gene product, namely a GENSET 
polypeptide or a GENSET polypeptide-encoding mRNA, include GENSET polypeptide binding 
proteins, antibodies or binding fragments thereof (e.g. F(ab')2 fragments), as well as GENSET 
1 5 polynucleotide complementary probes and primers. 

Step b) may be performed using any detection method known to those skilled in the art 
including those disclosed herein, especially in the section entitled "Detection of GENSET 
polypeptide expression and/or biological activity". 

Identification of Tissue Types or Cell Species by Means of Labeled Tissue Specific Antibodies 

20 Identification of specific tissues is accomplished by the visualization of tissue specific 

antigens by means of antibody preparations which are conjugated, directly (e.g., green fluorescent 
protein) or indirectly to a detectable marker. Selected labeled antibody species bind to their specific 
antigen binding partner in tissue sections, cell suspensions, or in extracts of soluble proteins from a 
tissue sample to provide a pattern for qualitative or semi-qualitative interpretation. 

25 Antisera for these procedures must have a potency exceeding that of the native preparation, 

and for that reason, antibodies are concentrated to a mg/ml level by isolation of the gamma globulin 
fraction, for example, by ion-exchange chromatography or by ammonium sulfate fractionation. 
Also, to provide the most specific antisera, unwanted antibodies, for example to common proteins, 
must be removed from the gamma globulin fraction, for example by means of insoluble 

30 immunoabsorbents, before the antibodies are labeled with the marker. Either monoclonal or 
heterologous antisera is suitable for either procedure. 

A. Immunohistochemical Techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a detectable 
marker, as described, for example, by Fudenberg, (1980) or Rose et al 9 (1980), which disclosures 
35 are hereby incorporated by reference in their entireties. 

A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can also 
be labeled with an enzyme that supports a color producing reaction with a substrate, such as 
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horseradish peroxidase. Markers can be added to tissue-bound antibody in a second step, as 
described below. Alternatively, the specific anti-tissue antibodies can be labeled with ferritin or 
other electron dense particles, and localization of the ferritin coupled antigen-antibody complexes 
achieved by means of an electron microscope. In yet another approach, the antibodies are 
5 radiolabeled, with, for example 125 I, and detected by overlaying the antibody treated preparation 
with photographic emulsion. Preparations to carry out the procedures can comprise monoclonal or 
polyclonal antibodies to a single protein or peptide identified as specific to a tissue type, for 
example, brain tissue, or antibody preparations to several antigenically distinct tissue specific 
antigens can be used in panels, independently or in mixtures, as required. Tissue sections and cell 

10 suspensions are prepared for immunohistochemical examination according to common histological 
techniques. Multiple cryostat sections (about 4 um, unfixed) of the unknown tissue and known 
control, are mounted and each slide covered with different dilutions of the antibody preparation. 
Sections of known and unknown tissues should also be treated with preparations to provide a 
positive control, a negative control, for example, pre-immune sera, and a control for non-specific 

1 5 staining, for example, buffer. Treated sections are incubated in a humid chamber for 30 min at 
room temperature, rinsed, then washed in buffer for 30-45 min. Excess fluid is blotted away, and 
the marker developed. If the tissue specific antibody was not labeled in the first incubation, it can 
be labeled at this time in a second antibody-antibody reaction, for example, by adding fluorescein- 
or enzyme-conjugated antibody against the immunoglobulin class of the antiserum-producing 

20 species, for example, fluorescein labeled antibody to mouse IgG. Such labeled sera are 

commercially available. The antigen found in the tissues by the above procedure can be quantified 
by measuring the intensity of color or fluorescence on the tissue section, and calibrating that signal 
using appropriate standards. 

B. Identification of Tissue Specific Soluble Proteins 

25 The visualization of tissue specific proteins and identification of unknown tissues from that 

procedure is carried out using the labeled antibody reagents and detection strategy as described for 
immunohistochemistry; however the sample is prepared according to an electrophoretic technique 
to distribute the proteins extracted from the tissue in an orderly array on the basis of molecular 
weight for detection. A tissue sample is homogenized using a Virtis apparatus; cell suspensions are 

30 disrupted by Dounce homogenization or osmotic lysis, using detergents in either case as required to 
disrupt cell membranes, as is the practice in the art. Insoluble cell components such as nuclei, 
microsomes, and membrane fragments are removed by ultracentrifugation, and the soluble protein- 
containing fraction concentrated if necessary and reserved for analysis. A sample of the soluble 
protein solution is resolved into individual protein species by conventional SDS polyacrylamide 

35 electrophoresis as described, for example, by Davis et al. 9 Section 19-2 (1986), using a range of 
amounts of polyacrylamide in a set of gels to resolve the entire molecular weight range of proteins 
to be detected in the sample. A size marker is run in parallel for purposes of estimating molecular 
weights of the constituent proteins. Sample size for analysis is a convenient volume of from 5 to55 
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ul, and containing from about 1 to 100 ug protein. An aliquot of each of the resolved proteins is 
transferred by blotting to a nitrocellulose filter paper, a process that maintains the pattern of 
resolution. Multiple copies are prepared. The procedure, known as Western Blot Analysis, is well 
described in Davis et al. 9 (1986) Section 19-3. One set of nitrocellulose blots is stained with 
5 Coomassie Blue dye to visualize the entire set of proteins for comparison with the antibody bound 
proteins. The remaining nitrocellulose filters are then incubated with a solution of one or more 
specific antisera to tissue specific proteins prepared as described herein. In this procedure, as in 
procedure A above, appropriate positive and negative sample and reagent controls are run. 

In either procedure A or B, a detectable label can be attached to the primary tissue antigen- 

10 primary antibody complex according to various strategies and permutations thereof. In a 

straightforward approach, the primary specific antibody can be labeled; alternatively, the unlabeled 
complex can be bound by a labeled secondary anti-IgG antibody. In other approaches, either the 
primary or secondary antibody is conjugated to a biotin molecule, which can, in a subsequent step, 
bind an avidin conjugated marker. According to yet another strategy, enzyme labeled or radioactive 

1 5 protein A, which has the property of binding to any IgG, is bound in a final step to either the 

primary or secondary antibody. The visualization of tissue specific antigen binding at levels above 
those seen in control tissues to one or more tissue specific antibodies, prepared from the gene 
sequences identified from cDNA sequences, can identify tissues of unknown origin, for example, 
forensic samples, or differentiated tumor tissue that has metastasized to foreign bodily sites. 

20 Screening and diagnosis of abnormal GENSET polypeptide expression and/or biological activity 

Moreover, antibodies and/or primers specific for GENSET polypeptide expression may also 
be used to identify abnormal GENSET polypeptide expression and/or biological activity, and 
subsequently to screen and/or diagnose disorders associated with abnormal GENSET polypeptide 
expression. For example, a particular disease may result from lack of expression, over expression, 

25 or under expression of a GENSET polypeptide-encoding mRNA. By comparing mRNA expression 
patterns and quantities in samples taken from healthy individuals with those from individuals 
suffering from a particular disorder, genes responsible for this disorder may be identified. Primers, 
probes and antibodies specific for this GENSET polypeptide may then be used to elaborate kits of 
screening and diagnosis for a disorder in which the gene of interest is specifically expressed or in 

30 which its expression is specifically dysregulated, i.e. underexpressed or overexpressed. 

Screening/or specific disorders 

The present invention also relates to methods and uses of GENSET polypeptides for 
identifying individuals having elevated or reduced levels of GENSET polypeptides, which 
individuals are likely to benefit from therapies to suppress or enhance GENSET polypeptide- 
35 encoding gene expression, respectively. One example of such methods and uses comprises the 
steps of: 

a) obtaining from a mammal a biological sample; 
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b) detecting the presence in said sample of a GENSET polypeptide-encoding gene product 
(mRNA or protein); 

c) comparing the amount of said GENSET polypeptide-encoding gene product present in 
said sample with that of a control sample; and 

5 d) determing whether said human or non-human mammal has a reduced or elevated level of 

GENSET gene expression compared to the control sample. 

A biological sample from a subject affected by, or at risk of developing, any disease or 
condition associated with a GENSET polypeptide can be screened for the presence of increased or 
decreased levels of GENSET gene product, relative to a normal population (standard or control), 

1 0 with an increased or decreased level of the GENSET polypeptide relative to the normal population 
being indicative of predisposition to or a present indication of the disease or condition, or any 
sympton associated with the disease or condition. Such individuals would be candidates for 
therapies, e.g., treatment with pharmaceutical compositions comprising the GENSET polypeptide, a 
polynucleotide encoding the GENSET polypeptide, or any other compound that affects the 

1 5 expression or activity of the GENSET polypeptide. Generally, the identification of elevated levels 
of the GENSET polypeptide in a patient would be indicative of an individual that would benefit 
from treatment with agents that suppress GENSET polypeptide expression or activity, and the 
identification of low levels of the GENSET polypeptide in a patient would be indicative of an 
individual that would benefit from agents that induce GENSET expression or activity. 

20 Biological samples suitable for use in this method include any biological fluids, including, 

but not limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in 
the method of the invention, including samples derived from any tissue associated with GENSET 
gene expression (see, e.g. Tables III-V). Cell cultures or cell extracts derived, for example, from 
tissue biopsies can also be used. The detection step of the present method can be performed using 

25 standard protocols for protein/mRNA detection. Examples of suitable protocols include Northern 
blot analysis, immunoassays (e.g. RIA, Western blots, immunohistochemical analyses), and PCR. 

Thus, the present invention further relates to methods and uses of GENSET polypeptides 
for identifying individuals or non-human animals at increased risk for developing, or present state 
of having, certain diseases/disorders associated with abnormal GENSET polypeptide expression or 

30 biological activity. One example of such methods comprises the steps of: 

a) obtaining from a human or non-human mammal a biological sample; 

b) detecting the presence in said sample of a GENSET gene product (mRNA or protein); 

c) comparing the amount of said GENSET gene product present in said sample with that of 
a control sample; and 

35 d) determing whether said human or non-human mammal is at increased risk for 

developing, or present state of having, a diseases or disorder. 

In preferred embodiments, the biological sample is taken from animals presenting any 
symptom associated with any disease or condition associated with a GENSET gene product. In 
accordance with this method, the presence in the sample of altered (e.g. increased or decreased) 
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levels of the GENSET product indicates that the subject is predisposed to the disease or condition. 
Biological samples suitable for use in this method include biological fluids including, but not 
limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in the 
method of the invention, including samples derived from any of the tissues listed in Tables HI-V. 
5 Cell cultures or cell extracts derived, for example, from tissue biopsies can also be used. 

The diagnostic methodologies described herein are applicable to both humans and 
non-human mammals. 

Detection of GENSET gene mutations 

The invention also encompasses methods and uses of GENSET polynucleotides to detect 

10 mutations in GENSET polynucleotides of the invention. Such methods may advantageously be 
used to detect mutations occurring in GENSET genes and preferably in their regulatory regions. 
When the mutation was proven to be associated with a disease, the detection of such mutations may 
be used for screening and diagnosis purposes. 

In one embodiment of the oligonucleotide arrays of the invention, an oligonucleotide probe 

15 matrix may advantageously be used to detect mutations occurring in GENSET genes and preferably 
in their regulatory regions. For this particular purpose, probes are specifically designed to have a 
nucleotide sequence allowing their hybridization to the genes that carry known mutations (either by 
deletion, insertion or substitution of one or several nucleotides). By known mutations, it is meant, 
mutations on the GENSET genes that have been identified according, for example to the technique 

20 used by Huang et tf/.(1996) or Samson et #/.(1996), which disclosures are hereby incorporated by 
reference in their entireties. 

Another technique that is used to detect mutations in GENSET genes is the use of a high- 
density DNA array. Each oligonucleotide probe constituting a unit element of the high density 
DNA array is designed to match a specific subsequence of a GENSET genomic DNA or cDNA. 

25 Thus, an array consisting of oligonucleotides complementary to subsequences of the target gene 
sequence is used to determine the identity of the target sequence with the wild gene sequence, 
measure its amount, and detect differences between the target sequence and the reference wild gene 
sequence of the GENSET gene. In one such design, termed 4L tiled array, is implemented a set of 
four probes (A, C, G, T), preferably 15-nucleotide oligomers. In each set of four probes, the perfect 

30 complement will hybridize more strongly than mismatched probes. Consequently, a nucleic acid 
target of length L is scanned for mutations with a tiled array containing 4L probes, the whole probe 
set containing all the possible mutations in the known wild reference sequence. The hybridization 
signals of the 15-mer probe set tiled array are perturbed by a single base change in the target 
sequence. As a consequence, there is a characteristic loss of signal or a "footprint" for the probes 

35 flanking a mutation position. This technique was described by Chee et al. in 1996, which disclosure 
is hereby incorporated by reference in its entirety. 
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Construction of DNA constructs with a GENSET gene expression pattern 



In addition, characterization of the spatial and temporal expression patterns and expression 
levels of GENSET polypeptide-encoding mRNAs is also useful for constructing expression vectors 
capable of producing a desired level of gene product in a desired spatial or temporal manner, as 
5 discussed below. 

DNA Constructs That Direct Temporal And Spatial GENSET Gene Expression In Recombinant Cell 
Hosts And In Transgenic Animals. 

In order to study the physiological and phenotypic consequences of a lack of synthesis of a 
GENSET polypeptide, both at the cellular level and at the multi cellular organism level, the 

10 invention also encompasses DNA constructs and recombinant vectors enabling a conditional 

expression of a specific allele of a GENSET polypeptide-encoding genomic sequence or cDNA and 
also of a copy of this genomic sequence or cDNA harboring substitutions, deletions, or additions of 
one or more bases as regards to a nucleotide sequence selected from the group consisting of 
sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited 

1 5 clone pool, or a fragment thereof, these base substitutions, deletions or additions being located 

either in an exon, an intron or a regulatory sequence, but preferably in the 5'-regulatory sequence or 
in an exon of the GENSET polypeptide-encoding genomic sequence or within the GENSET 
polypeptide-encoding cDNA. 

A first preferred DNA construct is based on the tetracycline resistance operon tet from E. 

20 coli transposon TnlO for controlling the GENSET gene expression, such as described by Gossen et 
al. (1992, 1995) and Furth et ah (1994), which disclosures are hereby incorporated by reference in 
their entireties. Such a DNA construct contains seven tet operator sequences from TnlO (tetop) that 
are fused to either a minimal promoter or a 5'-regulatory sequence of the GENSET gene, said 
minimal promoter or said GENSET polynucleotide regulatory sequence being operably linked to a 

25 polynucleotide of interest that codes either for a sense or an antisense oligonucleotide or for a 

polypeptide, including a GENSET polypeptide, or a peptide fragment thereof. This DNA construct 
is functional as a conditional expression system for the nucleotide sequence of interest when the 
same cell also comprises a nucleotide sequence coding for either the wild type (tTA) or the mutant 
(rTA) repressor fused to the activating domain of viral protein VP 16 of herpes simplex virus, placed 

30 under the control of a promoter, such as the HCMVIE1 enhancer/promoter or the MMTV-LTR. 

Indeed, a preferred DNA construct of the invention comprise both the polynucleotide containing the 
tet operator sequences and the polynucleotide containing a sequence coding for the tTA or the rTA 
repressor. In a specific embodiment, the conditional expression DNA construct contains the 
sequence encoding the mutant tetracycline repressor rTA, the expression of the polynucleotide of 

35 interest is silent in the absence of tetracycline and induced in its presence. 

DNA Constructs Allowing Homologous Recombination: Replacement Vectors 
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A second preferred DNA construct will comprise, from 5 '-end to 3 '-end: (a) a first 
nucleotide sequence that is found in the GENSET polypeptide-encoding genomic sequence; (b) a 
nucleotide sequence comprising a positive selection marker, such as the marker for neomycin 
resistance (neo); and (c) a second nucleotide sequence that is found in the GENSET polypeptide- 
5 encoding genomic sequence, and is located on the genome downstream the first GENSET 
polypeptide-encoding nucleotide sequence (a). 

In a preferred embodiment, this DNA construct also comprises a negative selection marker 
located upstream of the nucleotide sequence (a) or downstream from the nucleotide sequence (c). 
Preferably, the negative selection marker comprises the thymidine kinase (tk) gene (Thomas et aL, 
10 1986), the hygromycine beta gene (Te Riele et aL, 1990), the hprt gene ( Van der Lugt et aL, 1991 ; 
Reid et aL, 1990) or the Diphteria toxin A fragment (Dt-A) gene (Nada et aL, 1993; Yagi et 

1990), which disclosures are hereby incorporated by reference in their entireties. Preferably, the 
positive selection marker is located within a GENSET exon sequence so as to interrupt the sequence 
encoding a GENSET polypeptide. These replacement vectors are described, for example, by 
1 5 Thomas et a/.(1986; 1987), Mansour et a/.(1988) and Roller et aL(\992). 

The first and second nucleotide sequences (a) and (c) may be indifferently located within a 
GENSET polypeptide-encoding regulatory sequence, an intronic sequence, an exon sequence or a 
sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
nucleotide sequences (a) and (c) ranges from 1 to 50 kb, preferably from 1 to 10 kb, more 
20 preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre-LoxP System. 

These new DNA constructs make use of the site specific recombination system of the PI 
phage. The PI phage possesses a recombinase called Cre which interacts specifically with a 34 
base pairs loxP site. The loxP site is composed of two palindromic sequences of 13 bp separated by 

25 a 8 bp conserved sequence (Hoess et aL, 1986), which disclosure is hereby incorporated by 

reference in its entirety. The recombination by the Cre enzyme between two loxP sites having an 
identical orientation leads to the deletion of the DNA fragment. 

The Cre-/oxP system used in combination with a homologous recombination technique has 
been first described by Gu et aL (1993, 1994), which disclosures are hereby incorporated by 

30 reference in their entireties. Briefly, a nucleotide sequence of interest to be inserted in a targeted 
location of the genome harbors at least two loxP sites in the same orientation and located at the 
respective ends of a nucleotide sequence to be excised from the recombinant genome. The excision 
event requires the presence of the recombinase (Cre) enzyme within the nucleus of the recombinant 
cell host. The recombinase enzyme may be brought at the desired time either by (a) incubating the 

35 recombinant cell hosts in a culture medium containing this enzyme, by injecting the Cre enzyme 
directly into the desired cell, such as described by Araki et aL (1995), which disclosure is hereby 
incorporated by reference in its entirety, or by lipofection of the enzyme into the cells, such as 
described by Baubonis et aL (1993), which disclosure is hereby incorporated by reference in its 
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entirety; (b) transfecting the cell host with a vector comprising the Cre coding sequence operably 
linked to a promoter functional in the recombinant cell host, which promoter being optionally 
inducible, said vector being introduced in the recombinant cell host, such as described by Gu et al 
(1993) and Sauer et al (1988), which disclosures are hereby incorporated by reference in their 
5 entireties; (c) introducing in the genome of the cell host a polynucleotide comprising the Cre coding 
sequence operably linked to a promoter functional in the recombinant cell host, which promoter is 
optionally inducible, and said polynucleotide being inserted in the genome of the cell host either by 
a random insertion event or an homologous recombination event, such as described by Gu et 
al (1994). 

10 In a specific embodiment, the vector containing the sequence to be inserted in the GENSET 

gene by homologous recombination is constructed in such a way that selectable markers are flanked 
by loxP sites of the same orientation, it is possible, by treatment by the Cre enzyme, to eliminate the 
selectable markers while leaving the GENSET sequences of interest that have been inserted by an 
homologous recombination event. Again, two selectable markers are needed: a positive selection 

1 5 marker to select for the recombination event and a negative selection marker to select for the 

homologous recombination event. Vectors and methods using the Cre-/axP system are described by 
Zou et al (1994), which disclosure is hereby incorporated by reference in its entirety. 

Thus, a third preferred DNA construct of the invention comprises, from 5'-end to 3'-end: 
(a) a first nucleotide sequence that is comprised in the GENSET genomic sequence; (b) a nucleotide 

20 sequence comprising a polynucleotide encoding a positive selection marker, said nucleotide 

sequence comprising additionally two sequences defining a site recognized by a recombinase, such 
as a loxP site, the two sites being placed in the same orientation; and (c) a second nucleotide 
sequence that is comprised in the GENSET genomic sequence, and is located on the genome 
downstream of the first GENSET nucleotide sequence (a). 

25 The sequences defining a site recognized by a recombinase, such as a loxP site, are 

preferably located within the nucleotide sequence (b) at suitable locations bordering the nucleotide 
sequence for which the conditional excision is sought. In one specific embodiment, two loxP sites 
are located at each side of the positive selection marker sequence, in order to allow its excision at a 
desired time after the occurrence of the homologous recombination event. 

30 In a preferred embodiment of a method using the third DNA construct described above, the 

excision of the polynucleotide fragment bordered by the two sites recognized by a recombinase, 
preferably two loxP sites, is performed at a desired time, due to the presence within the genome of 
the recombinant host cell of a sequence encoding the Cre enzyme operably linked to a promoter 
sequence, preferably an inducible promoter, more preferably a tissue-specific promoter sequence 

35 and most preferably a promoter sequence which is both inducible and tissue-specific, such as 
described by Gu et al (1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may result 
from the breeding of two transgenic animals, the first transgenic animal bearing the GENSET- 
derived sequence of interest containing the loxP sites as described above and the second transgenic 
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animal bearing the Cre coding sequence operably linked to a suitable promoter sequence, such as 
described by Gu et al (1994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an 
adenovirus based vector that contains the Cre gene thus allowing infection of cells, or in vivo 
5 infection of organs, for delivery of the Cre enzyme, such as described by Anton and Graham (1995) 
and Kanegae et al (1995), which disclosures are hereby incorporated by reference in their 
entireties. 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a GENSET genomic sequence or a GEN SET cDNA sequence, 
10 and most preferably an altered copy of a GENSET genomic or cDNA sequence, within a 

predetermined location of the targeted genome, leading either to the generation of an altered copy of 
a targeted gene (knock-out homologous recombination) or to the replacement of a copy of the 
targeted gene by another copy sufficiently homologous to allow an homologous recombination 
event to occur (knock-in homologous recombination). 

15 Modifying Genset polypoptide expression and/or biological activity 

Modifying endogenous GENSET expression and/or biological activity is expressly 
contemplated by the present invention. 

Screening for compounds that modulate GENSET expression and/or biological activity 

The present invention further relates to compounds able to modulate GENSET expression 
20 and/or biological activity and methods to use these compounds. Such compounds may interact with 
the regulatory sequences of GENSET genes or they may interact with GENSET polypeptides 
directly or indirectly. 

Compounds Interacting With GENSET Regulatory Sequences 

The present invention also concerns a method for screening substances or molecules that 
25 are able to interact with the regulatory sequences of a GENSET gene, such as for example promoter 
or enhancer sequences in untranscribed regions of the genomic DNA, as determined using any 
techniques known to those skilled in the art including those described in the section entitled 
"Identification of Promoters in Cloned Upstream Sequences, or such as regulatory sequences 
located in untranslated regions of GENSET mRNA. 
30 Sequences within untranscribed or untranslated regions of polynucleotides of the invention 

may be identified by comparison to databases containing known regulatory sequence such as 
transcription start sites, transcription factor binding sites, promoter sequences, enhancer sequences, 
5'UTR and 3'UTR elements (Pesole et al, 2000; http://igs-server.cnrs- 

mrs.fr/~gauthere/UTR/index.html). Alternatively, the regulatory sequences of interest may be 
35 identified through conventional mutagenesis or deletion analyses of reporter plasmids using, for 
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instance, techniques described in the section entitled "Identification of Promoters in Cloned 
Upstream Sequences". 

Following the identification of potential GENSET regulatory sequences, proteins which 
interact with these regulatory sequences may be identified as described below. 
5 Gel retardation assays may be performed independently in order to screen candidate 

molecules that are able to interact with the regulatory sequences of the GENSET gene, such as 
described by Fried and Crothers (1981), Garner and Revzin (1981) and Dent and Latchman (1993), 
the teachings of these publications being herein incorporated by reference. These techniques are 
based on the principle according to which a DNA or mRNA fragment which is bound to a protein 

1 0 migrates slower than the same unbound DNA or mRNA fragment. Briefly, the target nucleotide 
sequence is labeled. Then the labeled target nucleotide sequence is brought into contact with either 
a total nuclear extract from cells containing regulation factors, or with different candidate molecules 
to be tested. The interaction between the target regulatory sequence of the GENSET gene and the 
candidate molecule or the regulation factor is detected after gel or capillary electrophoresis through 

15 a retardation in the migration. 

Nucleic acids encoding proteins which are able to interact with the promoter sequence of 
the GENSET gene, more particularly a nucleotide sequence selected from the group consisting of 
the polynucleotides of the 5' and 3' regulatory region or a fragment or variant thereof, may be 
identified by using a one-hybrid system, such as that described in the booklet enclosed in the 

20 Matchmaker One-Hybrid System kit from Clontech (Catalog Ref. no. K1603-1, the technical 

teachings of which are herein incorporated by reference). Briefly, the target nucleotide sequence is 
cloned upstream of a selectable reporter sequence and the resulting polynucleotide construct is 
integrated in the yeast genome (Saccharomyces cerevisiae). Preferably, multiple copies of the 
target sequences are inserted into the reporter plasmid in tandem. The yeast cells containing the 

25 reporter sequence in their genome are then transformed with a library comprising fusion molecules 
between cDNAs encoding candidate proteins for binding onto the regulatory sequences of the 
GENSET gene and sequences encoding the activator domain of a yeast transcription factor such as 
GAL4. The recombinant yeast cells are plated in a culture broth for selecting cells expressing the 
reporter sequence. The recombinant yeast cells thus selected contain a fusion protein that is able to 

30 bind onto the target regulatory sequence of the GENSET gene. Then, the cDNAs encoding the 
fusion proteins are sequenced and may be cloned into expression or transcription vectors in vitro. 
The binding of the encoded polypeptides to the target regulatory sequences of the GENSET gene 
may be confirmed by techniques familiar to the one skilled in the art, such as gel retardation assays 
or DNAse protection assays. 

35 Ligands interacting with GENSET polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to a GENSET protein 
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or one of its fragments or variants or to modulate the expression of the polynucleotide coding for 
GENSET or a fragment or variant thereof. 

In the ligand screening method according to the present invention, a biological sample or a 
defined molecule to be tested as a putative ligand of a GENSET protein is brought into contact with 
5 the corresponding purified GENSET protein, for example the corresponding purified recombinant 
GENSET protein produced by a recombinant cell host as described herein, in order to form a 
complex between this protein and the putative ligand molecule to be tested. 

As an illustrative example, to study the interaction of a GENSET protein, or a fragment 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 

10 preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and polypeptides 
encoded by the clone inserts of the deposited clone pool, with drugs or small molecules, such as 
molecules generated through combinatorial chemistry approaches, the microdialysis coupled to 
HPLC method described by Wang et al, (1997) or the affinity capillary electrophoresis method 

15 described by Bush et al. (1997), the disclosures of which are incorporated by reference, can be used. 
In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with a GENSET protein, or a fragment comprising a contiguous span of at least 6 amino 
acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 
100 amino acids of a polypeptide selected from the group consisting of sequences of SEQ ID 

20 NOs: 1 70-338, 456-560, 785-91 8 and polypeptides encoded by the clone inserts of the deposited 
clone pool may be identified using assays such as the following. The molecule to be tested for 
binding is labeled with a detectable label, such as a fluorescent, radioactive, or enzymatic tag and 
placed in contact with immobilized GENSET protein, or a fragment thereof under conditions which 
permit specific binding to occur. After removal of non-specifically bound molecules, bound 

25 molecules are detected using appropriate means. 

Various candidate substances or molecules can be assayed for interaction with a GENSET 
polypeptide. These substances or molecules include, without being limited to, natural or synthetic 
organic compounds or molecules of biological origin such as polypeptides. When the candidate 
substance or molecule comprises a polypeptide, this polypeptide may be the resulting expression 

30 product of a phage clone belonging to a phage-based random peptide library, or alternatively the 
polypeptide may be the resulting expression product of a cDNA library cloned in a vector suitable 
for performing a two-hybrid screening assay. 

A. Candidate ligands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the expression 
35 product of a DNA insert contained in a phage vector (Parmley and Smith, 1988). Specifically, 
random peptide phages libraries are used. The random DNA inserts encode for peptides of 8 to 20 
amino acids in length (Oldenburg et al., 1992; Valadon etal., 1996; Lucas, 1994; Westerink, 1995; 
Felici et ah, 1991), which disclosures are hereby incorporated by reference in their entireties. 
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According to this particular embodiment, the recombinant phages expressing a protein that binds to 
an immobilized GENSET protein is retained and the complex formed between the GENSET protein 
and the recombinant phage may be subsequently immunoprecipitated by a polyclonal or a 
monoclonal antibody directed against the GENSET protein. 
5 Once the ligand library in recombinant phages has been constructed, the phage population 

is brought into contact with the immobilized GENSET protein. Then the preparation of complexes 
is washed in order to remove the non-specifically bound recombinant phages. The phages that bind 
specifically to the GENSET protein are then eluted by a buffer (acid pH) or immunoprecipitated by 
the monoclonal antibody produced by the hybridoma anti-GENSET, and this phage population is 

10 subsequently amplified by an over-infection of bacteria (for example E. coli). The selection step 
may be repeated several times, preferably 2-4 times, in order to select the more specific 
recombinant phage clones. The last step comprises characterizing the peptide produced by the 
selected recombinant phage clones either by expression in infected bacteria and isolation, 
expressing the phage insert in another host-vector system, or sequencing the insert contained in the 

15 selected recombinant phages. 

B. Candidate ligands obtained by competition experiments. 

Alternatively, peptides, drugs or small molecules which bind to a GENSET protein or 
fragment thereof comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 

20 selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool, may be identified in 
competition experiments. In such assays, the GENSET protein, or a fragment thereof, is 
immobilized to a surface, such as a plastic plate. Increasing amounts of the peptides, drugs or small 
molecules are placed in contact with the immobilized GENSET protein, or a fragment thereof, in 

25 the presence of a detectable labeled known GENSET protein ligand. For example, the GENSET 
ligand may be detectably labeled with a fluorescent, radioactive, or enzymatic tag. The ability of 
the test molecule to bind the GENSET protein, or a fragment thereof, is determined by measuring 
the amount of detectably labeled known ligand bound in the presence of the test molecule. A 
decrease in the amount of known ligand bound to the GENSET protein, or a fragment thereof, when 

30 the test molecule is present indicated that the test molecule is able to bind to the GENSET protein, 
or a fragment thereof. 

C. Candidate ligands obtained by affinity chromatography. 

Proteins or other molecules interacting with a GENSET protein, or a fragment thereof 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
35 preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and polypeptides 
encoded by the clone inserts of the deposited clone pool, can also be found using affinity columns 
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which contain the GENSET protein, or a fragment thereof. The GENSET protein, or a fragment 
thereof, may be attached to the column using conventional techniques including chemical coupling 
to a suitable column matrix such as agarose, AfFi Gel® , or other matrices familiar to those of skill 
in art. In some embodiments of this method, the affinity column contains chimeric proteins in 
5 which the GENSET protein, or a fragment thereof, is fused to glutathion S transferase (GST). A 
mixture of cellular proteins or pool of expressed proteins as described above is applied to the 
affinity column. Proteins or other molecules interacting with the GENSET protein, or a fragment 
thereof, attached to the column can then be isolated and analyzed on 2-D electrophoresis gel as 
described in Ramunsen et al. (1997), the disclosure of which is incorporated by reference. 
10 Alternatively, the proteins retained on the affinity column can be purified by electrophoresis based 
methods and sequenced. The same method can be used to isolate antibodies, to screen phage 
display products, or to screen phage display human antibodies. 

D. Candidate ligands obtained by optical biosensor methods 

Proteins interacting with a GENSET protein, or a fragment comprising a contiguous span of 

1 5 at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 
25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the group consisting of sequences 
of SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool, can also be screened by using an Optical Biosensor as described in Edwards 
and Leatherbarrow (1997) and also in Szabo et al. (1995), the disclosures of which are incorporated 

20 by reference. This technique permits the detection of interactions between molecules in real time, 
without the need of labeled molecules. This technique is based on the surface plasmon resonance 
(SPR) phenomenon. Briefly, the candidate ligand molecule to be tested is attached to a surface 
(such as a carboxymethyl dextran matrix). A light beam is directed towards the side of the surface 
that does not contain the sample to be tested and is reflected by said surface. The SPR phenomenon 

25 causes a decrease in the intensity of the reflected light with a specific association of angle and 
wavelength. The binding of candidate ligand molecules cause a change in the refraction index on 
the surface, which change is detected as a change in the SPR signal. For screening of candidate 
ligand molecules or substances that are able to interact with the GENSET protein, or a fragment 
thereof, the GENSET protein, or a fragment thereof, is immobilized onto a surface. This surface 

30 comprises one side of a cell through which flows the candidate molecule to be assayed. The 

binding of the candidate molecule on the GENSET protein, or a fragment thereof, is detected as a 
change of the SPR signal. The candidate molecules tested may be proteins, peptides, carbohydrates, 
lipids, or small molecules generated by combinatorial chemistry. This technique may also be 
performed by immobilizing eukaryotic or prokaryotic cells or lipid vesicles exhibiting an 

35 endogenous or a recombinantly expressed GENSET protein at their surface. 

The main advantage of the method is that it allows the determination of the association rate 
between the GENSET protein and molecules interacting with the GENSET protein. It is thus 
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possible to select specifically ligand molecules interacting with the GENSET protein, or a fragment 
thereof, through strong or conversely weak association constants. 

E. Candidate ligands obtained through a two-hybrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo (Fields 
5 and Song, 1989), which disclosure is hereby incorporated by reference in its entirety, and relies 
upon the fusion of a bait protein to the DNA binding domain of the yeast Gal4 protein. This 
technique is also described in the US Patent No. US 5,667,973 and the US Patent No. 5,283,1 73, the 
technical teachings of both patents being herein incorporated by reference. 

The general procedure of library screening by the two-hybrid assay may be performed as 
10 described by Harper et ah (1993) or as described by Cho et aL (1998) or also Fromont-Racine et ah 
(1997), which disclosures are hereby incorporated by reference in their entireties. 

The bait protein or polypeptide comprises, consists essentially of, or consists of a GENSET 
polypeptide or a fragment thereof comprising a contiguous span of at least 6 amino acids, preferably 
at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids 
15 of a polypeptide selected from the group consisting of sequences of SEQ ID NOs: 1 70-338, 456- 
560, 785-91 8 and polypeptides encoded by the clone inserts of the deposited clone pool. 

More precisely, the nucleotide sequence encoding the GENSET polypeptide or a fragment 
or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 
protein, the fused nucleotide sequence being inserted in a suitable expression vector, for example 
20 pAS2orpM3. 

Then, a human cDNA library is constructed in a specially designed vector, such that the 
human cDNA insert is fused to a nucleotide sequence in the vector that encodes the transcriptional 
domain of the GAL4 protein. Preferably, the vector used is the pACT vector. The polypeptides 
encoded by the nucleotide inserts of the human cDNA library are termed "prey" polypeptides. 

25 A third vector contains a detectable marker gene, such as beta galactosidase gene or CAT 

gene that is placed under the control of a regulation sequence that is responsive to the binding of a 
complete Gal4 protein containing both the transcriptional activation domain and the DNA binding 
domain. For example, the vector pG5EC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example the 

30 two different yeast strains may be the followings : 

- Y190, the phenotype of which is (MATa, Leu2-3, 1 12 ura3-12, trpl-901, his3-D200, 
ade2-101, gal4Dgall80D URA3 GAL-LacZ, LYS G^Z-HIS3, cyh r ); 

- Y187, the phenotype of which is (MATa gal4 gal80 his3 trpl-901 ade2-101 ura3-52 leu2- 
3, -1 12 URA3 GAL-lacZmef ), which is the opposite mating type of Y 190. 

35 Briefly, 20 \ig of pAS2/GENSET and 20 jig of pACT-cDNA library are co-transformed 

into yeast strain Y190. The transformants are selected for growth on minimal media lacking 
histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 mM). 
Positive colonies are screened for beta galactosidase by filter lift assay. The double positive 
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colonies (His + , beta-gal + ) are then grown on plates lacking histidine, leucine, but containing 
tryptophan and cycloheximide (10 mg/ml) to select for loss of pAS2/GENSET plasmids but 
retention of pACT-cDNA library plasmids. The resulting Y190 strains are mated with Y187 strains 
expressing GENSET or non-related control proteins; such as cyclophilin B, lamin, or SNF1, as Gal4 
5 fusions as described by Harper et ah (1993) and by Bram et ah (1993), which disclosures are hereby 
incorporated by reference in their entireties, and screened for beta galactosidase by filter lift assay. 
Yeast clones that are beta gal- after mating with the control Gal4 fusions are considered false 
positives. 

In another embodiment of the two-hybrid method according to the invention, interaction 
10 between the GENSET or a fragment or variant thereof with cellular proteins may be assessed using 
the Matchmaker Two Hybrid System 2 (Catalog No. Kl 604-1, Clontech). As described in the 
manual accompanying the kit, the disclosure of which is incorporated herein by reference, nucleic 
acids encoding the GENSET protein or a portion thereof, are inserted into an expression vector such 
that they are in frame with DNA encoding the DNA binding domain of the yeast transcriptional 
15 activator GAL4. A desired cDNA, preferably human cDNA, is inserted into a second expression 
vector such that they are in frame with DNA encoding the activation domain of GAL4. The two 
expression plasmids are transformed into yeast and the yeast are plated on selection medium which 
selects for expression of selectable markers on each of the expression vectors as well as GAL4 
dependent expression of the HIS3 gene. Transformants capable of growing on medium lacking 
20 histidine are screened for GAL4 dependent lacZ expression. Those cells which are positive in both the 
histidine selection and the lacZ assay contain interaction between GENSET and the protein or peptide 
encoded by the initially selected cDNA insert. 

Compounds Modulating GENSET biological activity 

Another method of screening for compounds that modulate GENSET expression and/or 
25 biological activity is by measuring the effects of test compounds on specific biological activity, e.g. 
a GENSET biological activity in a host cell. In one embodiment, the present invention relates to a 
method of identifying an agent which alters GENSET biological activity, wherein a nucleic acid 
construct comprising a nucleic acid which encodes a mammalian GENSET polypeptide is 
introduced into a host cell. The host cells produced are maintained under conditions appropriate for 
30 expression of the encoded mammalian GENSET polypeptides, whereby the nucleic acid is 

expressed. The host cells are then contacted with a compound to be assessed (an "agent," or "test 
agent"), and the properties of the cells is assessed. Detection of a change in any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent alters GENSET 
activity. In a particular embodiment, the invention relates to a method of identifying an agent 
35 which is an activator of GENSET activity, wherein detection of an increase of any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent activates 
GENSET activity. In another particular embodiment, the invention relates to a method of 
identifying an agent which is an inhibitor of GENSET activity, wherein detection of a decrease of 
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any GENSET polypeptide-associated property in the presence of the agent indicates that the agent 
inhibits GENSET activity. 

In a particular embodiment, a high throughput screen can be used to identify agents that 
activate (enhance) or inhibit GENSET activity (See e.g., PCT publication WO 98/45438, which 
5 disclosure is hereby incorporated by reference in its entirety). For example, the method of 
identifying an agent which alters GENSET activity can be performed as follows. A nucleic acid 
construct comprising a polynucleotide which encodes a mammalian GENSET polypeptide is 
introduced into a host cell to produce recombinant host cells. The recombinant host cells are then 
maintained under conditions appropriate for expression of the encoded mammalian GENSET 

10 polypeptide, whereby the nucleic acid is expressed. The compound to be assessed is added to the 
recombinant host cells; the resulting combination is referred to as a test sample. A detectable, 
GENSET polypeptide-associated property of the cells is detected. A control can be used in the 
methods of detecting agents which alter GENSET activity. For example, the control sample 
includes the same reagents but lacks the compound or agent being assessed; it is treated in the same 

1 5 manner as the test sample. 

Methods of Screening for Compounds Modulating GENSET Expression and/or Activity 

The present invention also relates to methods of screening compounds for their ability to 
modulate (e.g. increase or inhibit) the activity or expression of GENSET. More specifically, the 
present invention relates to methods of testing compounds for their ability either to increase or to 
20 decrease expression or activity of GENSET. The assays are performed in vitro or in vivo. 

In vitro methods 

In vitro, cells expressing GENSET polypeptides are incubated in the presence and absence 
of the test compound. By determining the level of GENSET expression in the presence of the test 
compound or the level of GENSET activity in the presence of the test compound, compounds can 

25 be identified that suppress or enhance GENSET expression or activity. Alternatively, constructs 
comprising a GENSET regulatory sequence operably linked to a reporter gene (e.g. luciferase, 
chloramphenicol acetyl transferase, LacZ, green fluorescent protein, etc.) can be introduced into 
host cells and the effect of the test compounds on expression of the reporter gene detected. Cells 
suitable for use in the foregoing assays include, but are not limited to, cells having the same origin 

30 as tissues or cell lines in which the polypeptide is known to be expressed using the data from Tables 
III, IV, orV. 

Consequently, the present invention encompasses a method for screening molecules that 
modulate the expression of a GENSET gene, said screening method comprising the steps of: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a nucleotide 
35 sequence encoding a GENSET protein or a variant or a fragment thereof, placed under the control 

of its own promoter; 

b) bringing into contact said cultivated cell with a molecule to be tested; 
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c) quantifying the expression of said GENSET protein or a variant or a fragment thereof in 

the presence of said molecule. 

Using DNA recombination techniques well known by the one skill in the art, the GENSET 

protein encoding DNA sequence is inserted into an expression vector, downstream from its 
5 promoter sequence. As an illustrative example, the promoter sequence of the GENSET gene is 

contained in the 5 5 untranscribed region of the GENSET genomic DNA. 

The quantification of the expression of a GENSET protein may be realized either at the 

mRNA level (using for example Northen blots, RT-PCR, preferably quantitative RT-PCR with 

primers and probes specific for the GENSET mRNA of interest) or at the protein level (using 
10 polyclonal or monoclonal antibodies in immunoassays such as ELISA or RIA assays, Western blots, 

or immunochemistry). 

The present invention also concerns a method for screening substances or molecules that 

are able to increase, or in contrast to decrease, the level of expression of a GENSET gene. Such a 

method may allow the one skilled in the art to select substances exerting a regulating effect on the 
1 5 expression level of a GENSET gene and which may be useful as active ingredients included in 

pharmaceutical compositions for treating patients suffering from disorders associated with abnormal 

levels of GENSET products. 

Thus, another part of the present invention is a method for screening a candidate molecule 

that modulates the expression of a GENSET gene, this method comprises the following steps: 
20 a) providing a recombinant cell host containing a nucleic acid, wherein said nucleic acid 

comprises a GENSET 5' regulatory region or a regulatory active fragment or variant thereof, 

operably linked to a polynucleotide encoding a detectable protein; 

b) obtaining a candidate molecule; and 

c) determining the ability of said candidate molecule to modulate the expression levels of 
25 said polynucleotide encoding the detectable protein. 

In a further embodiment, said nucleic acid comprising a GENSET 5' regulatory region or a 
regulatory active fragment or variant thereof, includes the 5'UTR region of a GENSET cDNA 
selected from the group comprising of the 5'UTRs of the sequences of SEQ ID NOs: 1-169, 339- 
455, 561-784, sequences of clones inserts of the deposited clone pool, regulatory active fragments 
30 and variants thereof. In a more preferred embodiment of the above screening method, said nucleic 
acid includes a promoter sequence which is endogenous with respect to the GENSET 5'UTR 
sequence. In another more preferred embodiment of the above screening method, said nucleic acid 
includes a promoter sequence which is exogenous with respect to the GENSET 5'UTR sequence 
defined therein. 

35 Preferred polynucleotides encoding a detectable protein are polynucleotides encoding beta 

galactosidase, green fluorescent protein (GFP) and chloramphenicol acetyl transferase (CAT). 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method of screening a candidate molecule that modulates the expression 
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of a GENSET gene and furthermore mixing the identified molecule with a pharmaceutically 
acceptable carrier. 

The invention also pertains to kits for the screening of a candidate substance modulating the 
expression of a GENSET gene. Preferably, such kits comprise a recombinant vector that allows the 
5 expression of a GENSET 5' regulatory region or a regulatory active fragment or a variant thereof, 
operably linked to a polynucleotide encoding a detectable protein or a GENSET protein or a 
fragment or a variant thereof. More preferably, such kits include a recombinant vector that 
comprises a nucleic acid including the 5'UTR region of a GENSET cDNA selected from the group 
comprising the 5'UTRs of the sequences of SEQ ID NOs:l-169, 339-455, 561-784, sequences of 
1 0 clone inserts of the deposited clone pool, regulatory active fragments and variants thereof, being 
operably linked to a polynucleotide encoding a detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening methods 
described above, it will be referred to the section of the present specification wherein the preferred 
recombinant vectors of the invention are detailed. 
1 5 Another object of the present invention comprises methods and kits for the screening of 

candidate substances that interact with a GENSET polypeptide, fragments or variants thereof. By 
their capacity to bind covalently or non-covalently to a GENSET protein, fragments or variants 
thereof, these substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro, said interacting molecules may be used as detection means in order to identify the 
20 presence of a GENSET protein in a sample, preferably a biological sample. 

A method for the screening of a candidate substance that interact with a GENSET 
polypeptide, fragments or variants thereof, said methods comprising the following steps: 

a) providing a polypeptide comprising, consisting essentially of, or consisting of a 
GENSET protein or a fragment comprising a contiguous span of at least 6 amino acids, preferably 

25 at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids 
of a polypeptide selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456- 
560, 785-918 and polypeptides encoded by the clone inserts of the deposited clone pool; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 

30 d) detecting the complexes formed between said polypeptide and said candidate substance. 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method for the screening of a candidate substance that interact with a 
GENSET polypeptide, fragments or variants thereof and furthermore mixing the identified 
substance with a pharmaceutically acceptable carrier. 

35 The invention further concerns a kit for the screening of a candidate substance interacting 

with the GENSET polypeptide, wherein said kit comprises: 

a) a polypeptide comprising, consisting essentially of, or consisting of a GENSET protein 
or a fragment comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 
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selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-91 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool; and 

b) optionally means useful to detect the complex formed between said polypeptide or a 
variant thereof and the candidate substance. 
5 In a preferred embodiment of the kit described above, the detection means comprises a 

monoclonal or polyclonal antibody binding to said GENSET protein or fragment or variant thereof. 

In vivo methods 

Compounds that suppress or enhance GENSET expression can also be identified using in 
vivo screens. In these assays, the test compound is administered (e.g. IV, IP, IM, orally, or 

1 0 otherwise), to the animal, for example, at a variety of dose levels. The effect of the compound on 
GENSET expression is determined by comparing GENSET levels, for example in tissues known to 
express the gene of interest using, for example the data obtained in Tables III, IV, or V, and using 
Northern blots, immunoassays, PCR, etc., as described above. Suitable test animals include, but are 
not limited to, rodents (e.g., mice and rats), primates, and rabbits. Humanized mice can also be 

1 5 used as test animals, that is mice in which the endogenous mouse protein is ablated (knocked out) 
and the homologous human protein added back by standard transgenic approaches. Such mice 
express only the human form of a protein. Humanized mice expressing only the human GENSET 
can be used to study in vivo responses to potential agents regulating GENSET protein or mRNA 
levels. As an example, transgenic mice have been produced carrying the human apoE4 gene. They 

20 are then bred with a mouse line that lacks endogenous apoE, to produce an animal model carrying 
human proteins believed to be instrumental in development of Alzheimer's pathology. Such 
transgenic animals are useful for dissecting the biochemical and physiological steps of disease, and 
for development of therapies for disease intervention (Loring, et al, 1996) (incorporated herein by 
reference in its entirety). 

25 Uses for compounds modulating GENSET expression and/or biological activity 

Using in vivo (or in vitro) systems, it may be possible to identify compounds that exert a 
tissue specific effect, for example, that increase GENSET expression or activity only in tissues of 
interest, such as the adrenal gland, bone marrow, brain, cerebellum, colon, fetal brain, fetal kidney, 
fetal liver, heart, hypertrophic prostate, kidney, liver, lung, lymph ganglia, lymphocytes, muscle, 

30 ovary, pancreas, pituitary gland, placenta, prostate, salivary gland, spinal cord, spleen, stomach, 
intestine, substantia nigra, testis, thyroid, umbilical cord, and uterus. Screening procedures such as 
those described above are also useful for identifying agents for their potential use in 
pharmacological intervention strategies. Agents that enhance GENSET gene expression or 
stimulate its activity may thus be used to induce any phenotype associated with a GENSET gene, or 

35 to treat disorders resulting from a deficiency of a GENSET polypeptide activity or expression. 
Compounds that suppress GENSET polypeptide expression or inhibit its activity can be used to 
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treat any disease or condition associated with increased or deleterious GENSET polypeptide activity 
or expression. 

Also encompassed by the present invention is an agent which interacts with a GENSET 
gene or polypeptide directly or indirectly, and inhibits or enhances GENSET polypeptide expression 
5 and/or function. In one embodiment, the agent is an inhibitor which interferes with a GENSET 
polypeptide directly (e.g., by binding the GENSET polypeptide) or indirectly (e.g., by blocking the 
ability of the GENSET polypeptide to have a GENSET biological activity). In a particular 
embodiment, an inhibitor of a GENSET protein is an antibody specific for the GENSET protein or a 
functional portion of the GENSET protein; that is, the antibody binds a GENSET polypeptide. For 

10 example, the antibody can be specific for a polypeptide encoded by one of the nucleic acid 

sequences of human GENSETs (SEQ ID NOs: 1-169, 339-455, 561-784), a mammalian GENSET 
nucleic acid, or portions thereof. Alternatively, the inhibitor can be an agent other than an antibody 
(e.g., small organic molecule, protein or peptide) which binds the GENSET polypeptide and blocks 
its activity. For example, the inhibitor can be an agent which mimics the GENSET polypeptide 

15 structurally, but lacks its function. Alternatively, it can be an agent which binds to or interacts with 
a molecule which the GENSET polypeptide normally binds to or interacts with, thus blocking the 
GENSET polypepetide from doing so and preventing it from exerting the effects it would normally 
exert. 

In another embodiment, the agent is an enhancer (activator) of a GENSET polypeptide 

20 which increases the activity of the GENSET polypeptide (increases the effect of a given amount or 
level of GENSET), increases the length of time it is effective (by preventing its degradation or 
otherwise prolonging the time during which it is active) or both either directly or indirectly. For 
example, GENSET polynucleotides and polypeptides can be used to identify drugs which increase 
or decrease the ability of GENSET polypeptides to induce GENSET biological activity, which 

25 drugs are useful for the treatment or prevention of any disease or condition associated with a 
GENSET biological activity. 

The GENSET sequences of the present invention can also be used to generate nonhuman 
gene knockout animals, such as mice, which lack a GENSET gene or transgenically overexpress a 
GENSET gene. For example, such GENSET gene knockout mice can be generated and used to 

30 obtain further insight into the function of the GENSET gene as well as assess the specificity of 
GENSET activators and inhibitors. Also, over expression of the GENSET gene (e.g., a human 
GENSET gene) in transgenic mice can be used as a means of creating a test system for GENSET 
activators and inhibitors (e.g., against a human GENSET polypeptide). In addition, the GENSET 
gene can be used to clone the GENSET promoter/enhancer in order to identify regulators of 

35 GENSET gene transcription. GENSET gene knockout animals include animals which completely 
or partially lack the GENSET gene and/or GENSET activity or function. Thus the present 
invention relates to a method of inhibiting (partially or completely) a GENSET biological activity in 
a mammal (e.g., a human), the method comprising administering to the mammal an effective 
amount of an inhibitor of a GENSET polypeptide or polynucleotide. The invention also relates to a 
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method of enhancing a GENSET biological activity in a mammal, the method comprising 
administering to the mammal an effective amount of an enhancer of a GENSET polypeptide or 
polynucleotide. 

Inhibiting GENSET gene expression 

5 Therapeutic compositions according to the present invention may comprise advantageously 

one or several GENSET oligonucleotide fragments as an antisense tool or a triple helix tool that 
inhibits the expression of the corresponding GENSET gene. 

Antisense Approach 

In antisense approaches, nucleic acid sequences complementary to an mRNA are 

1 0 hybridized to the mRNA intracellularly, thereby blocking the expression of the protein encoded by 
the mRNA. The antisense nucleic acid molecules to be used in gene therapy may be either DNA or 
RNA sequences. Preferred methods using antisense polynucleotide according to the present 
invention are the procedures described by Sczakiel et a/.(1995), which disclosure is hereby 
incorporated by reference in its entirety. 

1 5 Preferably, the antisense tools are chosen among the polynucleotides ( 1 5-200 bp long) that 

are complementary to GENSET mRNA, more preferably to the 5 'end of the GENSET mRNA. In 
another embodiment, a combination of different antisense polynucleotides complementary to 
different parts of the desired targeted gene are used. 

Other preferred antisense polynucleotides according to the present invention are sequences 

20 complementary to either a sequence of GENSET mRNAs comprising the translation initiation 

codon ATG or a sequence of GENSET genomic DNA containing a splicing donor or acceptor site. 

Preferably, the antisense polynucleotides of the invention have a 3' polyadenylation signal 
that has been replaced with a self-cleaving ribozyme sequence, such that RNA polymerase II 
transcripts are produced without poly(A) at their 3' ends, these antisense polynucleotides being 

25 incapable of export from the nucleus, such as described by Liu et a/.(1994), which disclosure is 
hereby incorporated by reference in its entirety. In a preferred embodiment, these GENSET 
antisense polynucleotides also comprise, within the ribozyme cassette, a histone stem-loop structure 
to stabilize cleaved transcripts against 3'-5' exonucleolytic degradation, such as the structure 
described by Eckner et tf/.(1991), which disclosure is hereby incorporated by reference in its 

30 entirety. 

The antisense nucleic acids should have a length and melting temperature sufficient to 
permit formation of an intracellular duplex having sufficient stability to inhibit the expression of the 
GENSET mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for use in 
gene therapy are disclosed in Green et al. 9 (1986) and Izant and Weintraub, (1984), the disclosures 
35 of which are incorporated herein by reference. 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
GENSET coding region with respect to a promoter so as to transcribe the opposite strand from that 
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which is normally transcribed in the cell. The antisense molecules may be transcribed using in vitro 
transcription systems such as those which employ T7 or SP6 polymerase to generate the transcript. 
Another approach involves transcription of GENSET antisense nucleic acids in vivo by operably 
linking DNA containing the antisense sequence to a promoter in a suitable expression vector. 
5 Alternatively, oligonucleotides which are complementary to the strand normally transcribed 

in the cell may be synthesized in vitro. Thus, the antisense nucleic acids are complementary to the 
corresponding mRNA and are capable of hybridizing to the mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase 
stability and make them less sensitive to RNase activity. Examples of modifications suitable for use 

10 in antisense strategies include 2' O-methyl RNA oligonucleotides and Protein-nucleic acid (PNA) 
oligonucleotides. Further examples are described by Rossi et al, (1991), which disclosure is hereby 
incorporated by reference in its entirety. 

Various types of antisense oligonucleotides complementary to the sequence of the GENSET 
cDNA or genomic DNA may be used. In one preferred embodiment, stable and semi-stable 

15 antisense oligonucleotides described in International Application No. PCT WO94/23026, hereby 
incorporated by reference, are used. In these molecules, the 3' end or both the 3' and 5' ends are 
engaged in intramolecular hydrogen bonding between complementary base pairs. These molecules 
are better able to withstand exonuclease attacks and exhibit increased stability compared to 
conventional antisense oligonucleotides. 

20 In another preferred embodiment, the antisense oligodeoxynucleotides against herpes 

simplex virus types 1 and 2 described in International Application No. WO 95/04141, hereby 
incorporated by reference, are used. 

In yet another preferred embodiment, the covalently cross-linked antisense oligonucleotides 
described in International Application No. WO 96/31523, hereby incorporated by reference, are 

25 used. These double- or single-stranded oligonucleotides comprise one or more, respectively, inter- 
or intra-oligonucleotide covalent cross-linkages, wherein the linkage consists of an amide bond 
between a primary amine group of one strand and a carboxyl group of the other strand or of the 
same strand, respectively, the primary amine group being directly substituted in the T position of 
the strand nucleotide monosaccharide ring, and the carboxyl group being carried by an aliphatic 

30 spacer group substituted on a nucleotide or nucleotide analog of the other strand or the same strand, 
respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International 
Application No. WO 92/1 8522, incorporated by reference, may also be used. These molecules are 
stable to degradation and contain at least one transcription control recognition sequence which binds 
35 to control proteins and are effective as decoys therefor. These molecules may contain "hairpin" 
structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy structures 
and "loop" structures. 

In another preferred embodiment, the cyclic double-stranded oligonucleotides described in 
European Patent Application No. 0 572 287 A2, hereby incorporated by reference are used. These 
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ligated oligonucleotide "dumbbells" contain the binding site for a transcription factor and inhibit 
expression of the gene under control of the transcription factor by sequestering the factor. 

Use of the closed antisense oligonucleotides disclosed in International Application No. WO 
92/19732, hereby incorporated by reference, is also contemplated. Because these molecules have 
5 no free ends, they are more resistant to degradation by exonucleases than are conventional 

oligonucleotides. These oligonucleotides may be multifunctional, interacting with several regions 
which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be 
determined using in vitro expression analysis. The antisense molecule may be introduced into the 

10 cells by diffusion, injection, infection or transfection using procedures known in the art. For 
example, the antisense nucleic acids can be introduced into the body as a bare or naked 
oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by 
viral protein, or as an oligonucleotide operably linked to a promoter contained in an expression 
vector. The expression vector may be any of a variety of expression vectors known in the art, 

15 including retroviral or viral vectors, vectors capable of extrachromosomal replication, or integrating 
vectors. The vectors may be DNA or RNA. 

The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between lxl0" 10 M to lxlO^M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage 

20 suitable for use in vivo. For example, an inhibiting concentration in culture of lxlO" 7 translates into 
a dose of approximately 0.6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 mg/kg 
bodyweight or higher may be possible after testing the toxicity of the oligonucleotide in laboratory 
animals. It is additionally contemplated that cells from the vertebrate are removed, treated with the 
antisense oligonucleotide, and reintroduced into the vertebrate. 

25 In a preferred application of this invention, the polypeptide encoded by the gene is first 

identified, so that the effectiveness of antisense inhibition on translation can be monitored using 
techniques that include but are not limited to antibody-mediated tests such as RIAs and ELISA, 
functional assays, or radiolabeling. 

An alternative to the antisense technology that is used according to the present invention 

30 comprises using ribozymes that will bind to a target sequence via their complementary 

polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing its target site 
(namely "hammerhead ribozymes"). Briefly, the simplified cycle of a hammerhead ribozyme 
comprises (1) sequence specific binding to the target RNA via complementary antisense sequences; 
(2) site-specific hydrolysis of the cleavable motif of the target strand; and (3) release of cleavage 

35 products, which gives rise to another catalytic cycle. Indeed, the use of long-chain antisense 

polynucleotide (at least 30 bases long) or ribozymes with long antisense arms are advantageous. A 
preferred delivery system for antisense ribozyme is achieved by covalently linking these antisense 
ribozymes to lipophilic groups or to use liposomes as a convenient vector. Preferred antisense 
ribozymes according to the present invention are prepared as described by Rossi et al 9 (1991) and 
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Sczakiel et al. (1995), the specific preparation procedures being referred to in said articles being 
herein incorporated by reference. 

Triple Helix Approach 

The GENSET genomic DNA may also be used to inhibit the expression of the GENSET 
5 gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 
particularly useful for studying alterations in cell activity when it is associated with a particular 
gene. The GENSET cDNAs or genomic DNAs of the present invention or, more preferably, a 
fragment of those sequences, can be used to inhibit gene expression in individuals having diseases 
10 associated with expression of a particular gene. Similarly, a portion of the GENSET genomic DNA 
can be used to study the effect of inhibiting GENSET gene transcription within a cell. 
Traditionally, homopurine sequences were considered the most useful for triple helix strategies. 
However, homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine 
oligonucleotides bind to the major groove at homopurine:homopyrimidine sequences. Thus, both 
1 5 types of sequences from the GENSET genomic DNA are contemplated within the scope of this 
invention. 

To carry out gene therapy strategies using the triple helix approach, the sequences of the 
GENSET genomic DNA are first scanned to identify 10-mer to 20-mer homopyrimidine or 
homopurine stretches which could be used in triple-helix based strategies for inhibiting GENSET 
20 expression. Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in inhibiting GENSET expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into tissue culture cells which express the 
GENSET gene. 

The oligonucleotides can be introduced into the cells using a variety of methods known to 
25 those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE- 
Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced GENSET expression using 
techniques such as Northern blotting, RNase protection assays, or PCR based strategies to monitor 
the transcription levels of the GENSET gene in cells which have been treated with the 
30 oligonucleotide. The cell functions to be monitored are predicted based upon the homologies of the 
target gene corresponding to the cDNA from which the oligonucleotide was derived with known 
gene sequences that have been associated with a particular function. The cell functions can also be 
predicted based on the presence of abnormal physiology within cells derived from individuals with 
a particular inherited disease, particularly when the cDNA is associated with the disease using 
35 techniques described in the section entitled "Identification of genes associated with hereditary 
diseases or drug response". 
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The oligonucleotides which are effective in inhibiting gene expression in tissue culture cells 

may then be introduced in vivo using the techniques and at a dosage calculated based on the in vitro 

results, as described in the section entitled "Antisense Approach". 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
5 replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an 

intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha 

oligonucleotide to stabilize the triple helix. For information on the generation of oligonucleotides 

suitable for triple helix formation see Griffin et ah (1989), which is hereby incorporated by this 

reference. 

1 0 Treating GENSET gene-related disorders 

The present invention further relates to methods, uses of GENSET polypeptides and 
polynucleotides, and uses of modulators of GENSET polypeptides and polynucleotides, for treating 
diseases/disorders associated with GENSET genes by increasing or decreasing GENSET gene 
activity and/or expression. These methodologies can be effected using compounds selected using 

1 5 screening protocols such as those described herein and/or by using the gene therapy and antisense 
approaches described in the art and herein. Gene therapy can be used to effect targeted expression 
of GENSET genes in any tissue, e.g. a tissue associated with the disease or condition to be treated. 
The GENSET coding sequence can be cloned into an appropriate expression vector and targeted to 
a particular cell type(s) to achieve efficient, high level expression. Introduction of the GENSET 

20 coding sequence into target cells can be achieved, for example, using particle mediated DNA 

delivery, (Haynes, 1996 and Maurer, 1999), direct injection of naked DNA, (Levy et aL, 1996; and 
Feigner, 1996), or viral vector mediated transport (Smith et al, 1996, Stone et al, 2000; Wu and 
Atai, 2000), each of which disclosures are hereby incorporated by reference in their entireties. 
Tissue specific effects can be achieved, for example, in the case of virus mediated transport by 

25 using viral vectors that are tissue specific, or by the use of promoters that are tissue specific. For 
instance, any tissue-specific promoter may be used to achieve specific expression, for example 
albumin promoters (liver specific; Pinkert et al., 1987 Genes Dev. 1:268-277), lymphoid specific 
promoters (Calame et al., 1988 Adv. Immunol. 43:235-275), promoters of T-cell receptors (Winoto 
et al., 1989 EMBO J. 8:729-733) and immunoglobulins (Banerji et al., 1983 Cell 33:729-740; 

30 Queen and Baltimore 1983 Cell 33:741-748), neuron-specific promoters (e.g. the neurofilament 
promoter; Byrne et al., 1989 Proc. Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 
promoters (Edlunch et al., 1985 Science 230:912-916) or mammary gland-specific promoters (milk 
whey promoter, U.S. Pat. No. 4,873,316 and European Application Publication No. 264, 166). 
Developmentally-regulated promoters can also be used, such as the murine homeobox promoters 

35 (Kessel et al., 1990 Science 249:374-379) or the alpha-fetoprotein promoter (Campes et al., 1989 
Genes Dev. 3:537-546). 

Combinatorial approaches can also be used to ensure that the GENSET coding sequence is 
activated in the target tissue (Butt and Karathanasis, 1995; Miller and Whelan, 1997), which 
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disclosures are hereby incorporated by reference in their entireties. Antisense oligonucleotides 
complementary to GENSET mRNA can be used to selectively diminish or ablate the expression of 
the protein, for example, at sites of inflammation. More specifically, antisense constructs or 
antisense oligonucleotides can be used to inhibit the production of GENSET in high expressing 
5 cells such as those cited in the third column of Table V. Antisense mRNA can be produced by 
transfecting into target cells an expression vector with the GENSET gene sequence, or a portion 
thereof, oriented in an antisense direction relative to the direction of transcription. Appropriate 
vectors include viral vectors, including retroviral, adenoviral, and adeno-associated viral vectors, as 
well as nonviral vectors. Tissue specific promoters can be used, as described supra. Alternatively, 

10 antisense oligonucleotides can be introduced directly into target cells to achieve the same goal. 
(See also other delivery methodologies described herein in connection with gene therapy.). 
Oligonucleotides can be selected/designed to achieve a high level of specificity (Wagner et al. 9 
1996), which disclosure is hereby incorporated by reference in its entirety. The therapeutic 
methodologies described herein are applicable to both human and non-human mammals (including 

1 5 cats and dogs). 

Pharmaceutical and physiologically acceptable compositions 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, as active agent, the polypeptides, nucleic acids or antibodies of the 
invention. The invention also relates to compositions comprising, as active agent, compounds 

20 selected using the above-described screening protocols. Such compositions include the active agent 
in combination with a pharmaceutical or physiologically acceptable carriers. In the case of naked 
DNA, the "carrier" may be gold particles. The amount of active agent in the composition can vary 
with the agent, the patient and the effect sought. Likewise, the dosing regimen can vary depending 
on the composition and the disease/disorder to be treated. 

25 Therefore, the invention related to methods for the production of pharmaceutical 

composition comprising a method for selecting an active agent, compound, substance or molecule 
using any of the screening method described herein and furthermore mixing the identified active 
agent, compound, substance or molecule with a pharmaceutical ly acceptable carrier. 

The pharmaceutical compositions utilized in this invention may be administered by any 

30 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable pharmaceutically acceptable carriers comprising 
excipients and auxiliaries which facilitate processing of the active compounds into preparations 

35 which can be used pharmaceutically. Further details on techniques for formulation and 

administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co. Easton, Pa). 
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Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutical^ acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the 
5 patient. 

Pharmaceutical preparations for oral use can be obtained through a combination of active 
compounds with solid excipient, suiting mixture is optionally grinding, and processing the mixture 
of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, or 

10 sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums including arabic and 
tragacanth; and proteins such as gelatin and collagen. If desired, disintegrating or solubilizing 
agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt 
thereof, such as sodium alginate. 

1 5 Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 

solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, i.e., dosage. 

20 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with a filler or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, 

25 liquid, or liquidpolyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethylcellulose, 

30 sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 

35 For topical or nasal administration, penetrants appropriate to the particular barrier to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
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The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. 
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free 
base forms. In other cases, the preferred preparation may be a lyophilized powder which may 
5 contain any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a 
pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of a 
GENSET polypeptide, such labeling would include amount, frequency, and method of 
10 administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

15 culture assays, e.g., of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or pigs. 
The animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example a 

20 GENSET polypeptide or fragments thereof, antibodies specific to GENSET polypeptides, agonists, 
antagonists or inhibitors of GENSET polypeptides, which ameliorates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50% of the 
population) and LD50 (the dose lethal to 50% of the population). The dose ratio between 

25 therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 

LD50/ED50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage contained in such compositions is preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage varies within 

30 this range depending upon the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to provide sufficient levels 
of the active moiety or to maintain the desired effect. Factors which may be taken into account 
35 include the severity of the disease state, general health of the subject, age, weight, and gender of the 
subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions maybe administered every 
3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the 
particular formulation. 
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Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of 
about 1 g, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
5 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 



Uses of Genset Sequences: Computer-Rela ted Embodiments 

As used herein the term "cDNA codes of SEQ ID NOs:l-169, 339-455, 561-784" 

10 encompasses the nucleotide sequences of SEQ ID NOs:l-169, 339-455, 561-784 and of clones 
inserts of the deposited clone pool, fragments thereof, nucleotide sequences homologous thereto, 
and sequences complementary to all of the preceding sequences. The fragments include fragments 
of SEQ ID NOs: 1-1 69, 339-455,561-784 comprising at least 8, 10, 12, 15, 18, 20,25, 28, 30,35, 
40, 50, 75, 100, 150, 200, 300, 400, 500, 1000 or 2000 consecutive nucleotides of SEQ ID NOs: 1- 

15 169, 339-455, 561-784. Preferably the fragments include polynucleotides described herein as 

encoding polypeptides having a biological activity. Homologous sequences and fragments of SEQ 
ID NOs: 1-169, 339-455, 561-784 refer to a sequence having at least 99%, 98%, 97%, 96%, 95%, 
90%, 85%, 80%, or 75% identity to these sequences. Identity may be determined using any of the 
computer programs and parameters described herein, including BLAST2N with the default 

20 parameters or with any modified parameters. Homologous sequences also include RNA sequences 
in which uridines replace the thymines in the cDNA codes of SEQ ID NOs: 1 -1 69, 339-455, 561- 
784. The homologous sequences may be obtained using any of the procedures described herein or 
may result from the correction of a sequencing error as described above. It will be appreciated that 
the cDNA codes of SEQ ID NOs: 1-1 69, 339-455, 561-784 can be represented in the traditional 

25 single character format (see, e.g. the inside back cover of Stryer, 1995) or in any other format which 
records the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID NOs: 170-338, 456-560, 785-918" 
encompasses the polypeptide sequences of SEQ ID NOs: 170-338, 456-560, 785-918 which are 
encoded by the cDNAs of SEQ ID NOs: 1-169, 339-455, 561-784, the polypeptide sequences 

30 encoded by the clone inserts of the deposited clone pool, polypeptide sequences homologous 

thereto, or fragments of any of the preceding sequences. Homologous polypeptide sequences refer 
to a polypeptide sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75% 
identity to one of the polypeptide sequences of SEQ ID NOs: 170-33 8, 456-560, 785-918. Identity 
may be determined using any of the computer programs and parameters described herein, including 

35 FASTA with the default parameters or with any modified parameters. The homologous sequences 
may be obtained using any of the procedures described herein or may result from the correction of a 
sequencing error as described above. The polypeptide fragments comprise at least 5, 6, 8, 10, 12, 
1 5, 20, 25, 30, 35, 40, 50, 60, 75, 100, 1 50 or 200 consecutive amino acids of the polypeptides of 
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SEQ ID NOs: 1 70-338, 456-560, 785-918. Preferably, the fragments include polypeptides described 
herein as having a biological activity, or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, or 150 consecutive amino acids of polypeptides described herein as having a biological 
activity. It will be appreciated that the polypeptide codes of the SEQ ID NOs: 170-338, 456-560, 
5 785-918 can be represented in the traditional single character format or three letter format (see, the 
inside back cover of Stryer, 1995) or in any other format which relates the identity of the 
polypeptides in a sequence. 

It will be appreciated by those skilled in the art that the nucleic acid codes of the invention 
and polypeptide codes of the invention can be stored, recorded, and manipulated on any medium 

10 which can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. A skilled artisan can readily 
adopt any of the presently known methods for recording information on a computer readable 
medium to generate manufactures comprising one or more of the nucleic acid codes of the 
invention, or one or more of the polypeptide codes of the invention. Another aspect of the present 

15 invention is a computer readable medium having recorded thereon at least 2, 5, 10, 15, 20, 25, 30, or 
50 nucleic acid codes of the invention. Another aspect of the present invention is a computer 
readable medium having recorded thereon at least 2, 5, 10, 15, 20, 25, 30, or 50 polypeptide codes 
of the invention. 

Computer readable media include magnetically readable media, optically readable media, 
20 electronically readable media and magnetic/optical media. For example, the computer readable 
media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), 
Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other 
media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 
25 which store and manipulate the sequence information described herein. One example of a computer 
system 100 is illustrated in block diagram form in Figure 1. As used herein, "a computer system" 
refers to the hardware components, software components, and data storage components used to 
analyze the nucleotide sequences of the nucleic acid codes of the invention or the amino acid 
sequences of the polypeptide codes of the invention. In one embodiment, the computer system 1 00 
30 is a Sun Enterprise 1000 server (Sun Microsystems, Palo Alto, CA). The computer system 100 
preferably includes a processor for processing, accessing and manipulating the sequence data. The 
processor 105 can be any well-known type of central processing unit, such as the Pentium III from 
Intel Corporation, or similar processor from Sun, Motorola, Compaq or International Business 
Machines. 

35 Preferably, the computer system 100 is a general purpose system that comprises the 

processor 105 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 
skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 
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In one particular embodiment, the computer system 100 includes a processor 105 connected 
to a bus which is connected to a main memory 1 1 5 (preferably implemented as RAM) and one or 
more internal data storage devices 1 10, such as a hard drive and/or other computer readable media 
having data recorded thereon. In some embodiments, the computer system 100 further includes one 
5 or more data retrieving device 1 1 8 for reading the data stored on the internal data storage devices 
110. 

The data retrieving device 1 18 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 10 is 
a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 

10 containing control logic and/or data recorded thereon. The computer system 100 may 

advantageously include or be programmed by appropriate software for reading the control logic 
and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 100 can be linked to other 

15 computer systems 125a-c in a network or wide area network to provide centralized access to the 
computer system 1 00. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes of 
the invention or the amino acid sequences of the polypeptide codes of the invention (such as search 
tools, compare tools, and modeling tools etc.) may reside in main memory 115 during execution. 

20 In some embodiments, the computer system 100 may further comprise a sequence comparer 

for comparing the above-described nucleic acid codes of the invention or the polypeptide codes of 
the invention stored on a computer readable medium to reference nucleotide or polypeptide 
sequences stored on a computer readable medium. A "sequence comparer" refers to one or more 
programs which are implemented on the computer system 100 to compare a nucleotide or 

25 polypeptide sequence with other nucleotide or polypeptide sequences and/or compounds including 
but not limited to peptides, peptidomimetics, and chemicals stored within the data storage means. 
For example, the sequence comparer may compare the nucleotide sequences of nucleic acid codes 
of the invention or the amino acid sequences of the polypeptide codes of the invention stored on a 
computer readable medium to reference sequences stored on a computer readable medium to 

30 identify homologies, motifs implicated in biological function, or structural motifs. The various 
sequence comparer programs identified elsewhere in this patent specification are particularly 
contemplated for use in this aspect of the invention. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 

35 homology levels between the new sequence and the sequences in the database. The database of 
sequences can be a private database stored within the computer system 100, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 
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The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 
sequence to be compared is stored to a memory in a computer system 100. As discussed above, the 
memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
5 analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
stored in the database is read into a memory on the computer. A comparison is then performed at a 
state 210 to determine if the first sequence is the same as the second sequence. It is important to 
note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 

10 comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 
can be introduced into one sequence in order to raise the homology level between the two tested 
sequences. The parameters that control whether gaps or other features are introduced into a 
sequence during comparison are normally entered by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, a 

1 5 determination is made at a decision state 210 whether the two sequences are the same. Of course, 
the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 
the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 

20 notifies the user that the sequence with the displayed name fulfills the homology constraints that 
were entered. Once the name of the stored sequence is displayed to the user, the process 200 moves 
to a decision state 218 wherein a determination is made whether more sequences exist in the 
database. If no more sequences exist in the database, then the process 200 terminates at an end state 
220. However, if more sequences do exist in the database, then the process 200 moves to a state 

25 224 wherein a pointer is moved to the next sequence in the database so that it can be compared to 
the new sequence. In this manner, the new sequence is aligned and compared with every sequence 
in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
30 state 218 in order to determine if any other sequences were available in the database for 
comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of the invention or a 
polypeptide code of the invention, a data storage device having retrievably stored thereon reference 
35 nucleotide sequences or polypeptide sequences to be compared to the nucleic acid code of the 
invention or polypeptide code of the invention and a sequence comparer for conducting the 
comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify motifs implicated in biological function and structural motifs in the nucleic 
acid code of the invention and polypeptide codes of the invention or it may identify structural 
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motifs in sequences which are compared to these nucleic acid codes and polypeptide codes. In 
some embodiments, the data storage device may have stored thereon the sequences of at least 2, 5, 
10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the invention or polypeptide codes of the 
invention. 

5 Another aspect of the present invention is a method for determining the level of homology 

between a nucleic acid code of the invention and a reference nucleotide sequence, comprising the 
steps of reading the nucleic acid code and the reference nucleotide sequence through the use of a 
computer program which determines homology levels and determining homology between the 
nucleic acid code and the reference nucleotide sequence with the computer program. The computer 

10 program may be any of a number of computer programs for determining homology levels, including 
those specifically enumerated herein, including BLAST2N with the default parameters or with any 
modified parameters. The method may be implemented using the computer systems described 
above. The method may also be performed by reading 2, 5, 10, 15, 20, 25, 30, or 50 of the above 
described nucleic acid codes of the invention through the use of the computer program and 

15 determining homology between the nucleic acid codes and reference nucleotide sequences. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. The process 250 begins at a start state 252 and 
then moves to a state 254 wherein a first sequence to be compared is stored to a memory. The 
second sequence to be compared is then stored to a memory at a state 256. The process 250 then 

20 moves to a state 260 wherein the first character in the first sequence is read and then to a state 262 
wherein the first character of the second sequence is read. It should be understood that if the 
sequence is a nucleotide sequence, then the character would normally be either A, T, C, G or U. If 
the sequence is a protein sequence, then it should be in the single letter amino acid code so that the 
first and sequence sequences can be easily compared. 

25 A determination is then made at a decision state 264 whether the two characters are the 

same. If they are the same, then the process 250 moves to a state 268 wherein the next characters in 
the first and second sequences are read. A determination is then made whether the next characters 
are the same. If they are, then the process 250 continues this loop until two characters are not the 
same. If a determination is made that the next two characters are not the same, the process 250 

30 moves to a decision state 274 to determine whether there are any more characters either sequence to 
read. 

If there are no more characters to read, then the process 250 moves to a state 276 wherein 
the level of homology between the first and second sequences is displayed to the user. The level of 
homology is determined by calculating the proportion of characters between the sequences that 
35 were the same out of the total number of sequences in the first sequence. Thus, if every character in 
a first 100 nucleotide sequence aligned with a every character in a second sequence, the homology 
level would be 100%. 

Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
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sequences in order to determine whether the nucleic acid code of the invention differs from a 
reference nucleic acid sequence at one or more positions. Optionally such a program records the 
length and identity of inserted, deleted or substituted nucleotides with respect to the sequence of 
either the reference polynucleotide or the nucleic acid code of the invention. In one embodiment, 
5 the computer program may be a program which determines whether the nucleotide sequences of the 
nucleic acid codes of the invention contain one or more single nucleotide polymorphisms (SNP) 
with respect to a reference nucleotide sequence. These single nucleotide polymorphisms may each 
comprise a single base substitution, insertion, or deletion. 

Another aspect of the present invention is a method for determining the level of homology 

10 between a polypeptide code of the invention and a reference polypeptide sequence, comprising the 
steps of reading the polypeptide code of the invention and the reference polypeptide sequence 
through use of a computer program which determines homology levels and determining homology 
between the polypeptide code and the reference polypeptide sequence using the computer program. 
Accordingly, another aspect of the present invention is a method for determining whether a 

15 nucleic acid code of the invention differs at one or more nucleotides from a reference nucleotide 
sequence comprising the steps of reading the nucleic acid code and the reference nucleotide 
sequence through use of a computer program which identifies differences between nucleic acid 
sequences and identifying differences between the nucleic acid code and the reference nucleotide 
sequence with the computer program. In some embodiments, the computer program is a program 

20 which identifies single nucleotide polymorphisms. The method may be implemented by the 

computer systems described above and the method illustrated in Figure 3. The method may also be 
performed by reading at least 2, 5, 10, 1 5, 20, 25, 30, or 50 of the nucleic acid codes of the 
invention and the reference nucleotide sequences through the use of the computer program and 
identifying differences between the nucleic acid codes and the reference nucleotide sequences with 

25 the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of the invention or the 
amino acid sequences of the polypeptide codes of the invention. An "identifier" refers to one or 
more programs which identifies certain features within the above-described nucleotide sequences of 

30 the nucleic acid codes of the invention or the amino acid sequences of the polypeptide codes of the 
invention. In one embodiment, the identifier may comprise a program which identifies an open 
reading frame in the cDNAs codes of the invention. 

Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 and 

35 then moves to a state 304 wherein a first sequence that is to be checked for features is stored to a 
memory 1 15 in the computer system 100. The process 300 then moves to a state 306 wherein a 
database of sequence features is opened. Such a database would include a list of each feature's 
attributes along with the name of the feature. For example, a feature name could be "Initiation 
Codon" and the attribute would be "ATG". Another example would be the feature name 
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"TAATAA Box" and the feature attribute would be "TAATAA". An example of such a database is 
produced by the University of Wisconsin Genetics Computer Group (www.gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a state 
308 wherein the first feature is read from the database. A comparison of the attribute of the first 
5 feature with the first sequence is then made at a state 310. A determination is then made at a 
decision state 3 1 6 whether the attribute of the feature was found in the first sequence. If the 
attribute was found, then the process 300 moves to a state 3 1 8 wherein the name of the found 
feature is displayed to the user. 

The process 300 then moves to a decision state 320 wherein a determination is made 

1 0 whether move features exist in the database. If no more features do exist, then the process 300 
terminates at an end state 324. However, if more features do exist in the database, then the process 
300 reads the next sequence feature at a state 326 and loops back to the state 3 10 wherein the 
attribute of the next feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 

1 5 decision state 3 1 6, the process 300 moves directly to the decision state 320 in order to determine if 
any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program which 
determines the 3-dimensional structure of the polypeptides codes of the invention. Such programs 
may use any methods known to those skilled in the art including methods based on homology- 

20 modeling, fold recognition and ab initio methods as described in Sternberg et al, 1999, which 

disclosure is hereby incorporated by reference in its entirety. In some embodiments, the molecular 
modeling program identifies target sequences that are most compatible with profiles representing 
the structural environments of the residues in known three-dimensional protein structures. (See, 
e.g., Eisenberg et al., U.S. Patent No. 5,436,850 issued July 25, 1995, which disclosure is hereby 

25 incorporated by reference in its entirety). In another technique, the known three-dimensional 
structures of proteins in a given family are superimposed to define the structurally conserved 
regions in that family. This protein modeling technique also uses the known three-dimensional 
structure of a homologous protein to approximate the structure of the polypeptide codes of the 
invention. (See e.g., Srinivasan, et al., U.S. Patent No. 5,557,535 issued September 17, 1996, which 

30 disclosure is hereby incorporated by reference in its entirety). Conventional homology modeling 
techniques have been used routinely to build models of proteases and antibodies. (Sowdhamini et 
al., (1997)). Comparative approaches can also be used to develop three-dimensional protein models 
when the protein of interest has poor sequence identity to template proteins. In some cases, proteins 
fold into similar three-dimensional structures despite having very weak sequence identities. For 

35 example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 
template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 
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performed using Multiple Sequence Threading (MST), structural equivalencies are deduced from 
the threading output using a distance geometry program DRAGON to construct a low resolution 
model, and a full-atom representation is constructed using a molecular modeling package such as 
QUANTA. 

5 According to this 3-step approach, candidate templates are first identified by using the 

novel fold recognition algorithm MST, which is capable of performing simultaneous threading of 
multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints and 
fed into the distance geometry program DRAGON, together with auxiliary information obtained 

10 from secondary structure predictions. The program combines the restraints in an unbiased manner 
and rapidly generates a large number of low resolution model confirmations. In a third step, these 
low resolution model confirmations are converted into full-atom models and subjected to energy 
minimization using the molecular modeling package QUANTA. (See e.g., Aszodi et al., (1997)). 
The results of the molecular modeling analysis may then be used in rational drug design 

1 5 techniques to identify agents which modulate the activity of the polypeptide codes of the invention. 

Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of the invention or the polypeptide codes of the invention comprising 
reading the nucleic acid code(s) or the polypeptide code(s) through the use of a computer program 
which identifies features therein and identifying features within the nucleic acid code(s) or 

20 polypeptide code(s) with the computer program. In one embodiment, computer program comprises 
a computer program which identifies open reading frames. In a further embodiment, the computer 
program identifies linear or structural motifs in a polypeptide sequence. In another embodiment, 
the computer program comprises a molecular modeling program. The method may be performed by 
reading a single sequence or at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the 

25 invention or the polypeptide codes of the invention through the use of the computer program and 
identifying features within the nucleic acid codes or polypeptide codes with the computer program. 

The nucleic acid codes of the invention or the polypeptide codes of the invention may be 
stored and manipulated in a variety of data processor programs in a variety of formats. For 
example, they may be stored as text in a word processing file, such as Microsoft WORD or 

30 WORDPERFECT or as an ASCII file in a variety of database programs familiar to those of skill in 
the art, such as DB2, SYBASE, or ORACLE. In addition, many computer programs and databases 
may be used as sequence comparers, identifiers, or sources of reference nucleotide or polypeptide 
sequences to be compared to the nucleic acid codes of the invention or the polypeptide codes of the 
invention. The following list is intended not to limit the invention but to provide guidance to 

35 programs and databases which are useful with the nucleic acid codes of the invention or the 

polypeptide codes of the invention. The programs and databases which may be used include, but 
are not limited to: MacPattern (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine 
(Molecular Applications Group), Look (Molecular Applications Group), MacLook (Molecular 
Applications Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul et al, 1990), 
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FASTA (Pearson and Lipman, 1988), FASTDB (Brutlag et aL, 1990), Catalyst (Molecular 
Simulations Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius2.DB Access (Molecular 
Simulations Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular Simulations Inc.), 
Discover (Molecular Simulations Inc.), CHARMm (Molecular Simulations Inc.), Felix (Molecular 
5 Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, (Molecular Simulations Inc.), 
Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular 
Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.), WebLab (Molecular 
Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein 

10 database, the MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwents's World Drug Index database, the 
BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other 
programs and data bases would be apparent to one of skill in the art given the present disclosure. 
Motifs which may be detected using the above programs include sequences encoding 

15 leucine zippers, helix-turn-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, and 
beta sheets, signal sequences encoding signal peptides which direct the secretion of the encoded 
proteins, sequences implicated in transcription regulation such as homeoboxes, acidic stretches, 
enzymatic active sites, substrate binding sites, and enzymatic cleavage sites. 

Conclusion 

20 As discussed above, the GENSET polynucleotides and polypeptides of the present 

invention or fragments thereof can be used for various purposes. The polynucleotides can be used 
to express recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in disease states); as molecular weight 

25 markers on Southern gels; as chromosome markers or tags (when labeled) to identify chromosomes 
or to map related gene positions; as a reagent (including a labeled reagent) in assays designed to 
quantitatively determine levels of GENSET expression in biological samples; to compare with 
endogenous DNA sequences in patients to identify potential genetic disorders; as probes to 
hybridize and thus discover novel, related DNA sequences; as a source of information to derive 

30 PCR primers for genetic fingerprinting; for selecting and making oligomers for attachment to a 

"gene chip M or other support, including for examination for expression patterns; to raise anti-protein 
antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 

35 polynucleotide can also be used in interaction trap assays (such as, for example, that described in 
Gyuris et aL, (1993) to identify polynucleotides encoding the other protein with which binding 
occurs or to identify inhibitors of the binding interaction. 
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The proteins or polypeptides provided by the present invention can similarly be used in 
assays to determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) 
5 in biological fluids; as markers for tissues in which the corresponding protein is preferentially 

expressed (either constitutively or at a particular stage of tissue differentiation or development or in 
a disease state); and, of course, to isolate correlative receptors or ligands. Where the protein binds 
or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify inhibitors 
10 of the binding interaction. Proteins involved in these binding interactions can also be used to screen 
for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or kit 
format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
15 References disclosing such methods include without limitation "Molecular Cloning; A Laboratory 
Manual", 2d ed., Cole Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology; Guide to Molecular Cloning Techniques", Academic 
Press, Berger and Kimmel eds., 1 987, which disclosures are hereby incorporated by reference in 
their entireties. 

20 Polynucleotides and proteins of the present invention can also be used as nutritional sources 

or supplements. Such uses include without limitation use as a protein or amino acid supplement, 
use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In such cases 
the protein or polynucleotide of the invention can be added to the feed of a particular organism or 
can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, 

25 solutions, suspensions or capsules. In the case of microorganisms, the protein or polynucleotide of 
the invention can be added to the medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred embodiments, 
other embodiments which will be apparent to those of ordinary skill in the art in view of the 
disclosure herein are also within the scope of this invention. Accordingly, the scope of the 

30 invention is intended to be defined only by reference to the appended claims. 

Examples 

Preparation of Antibody Compositions to the GENSET protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
35 containing an expression vector encoding the GENSET protein or a portion thereof. The 

concentration of protein in the final preparation is adjusted, for example, by concentration on an 
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Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes in the GENSET protein or a portion thereof can be 
5 prepared from murine hybridomas according to the classical method of Kohler and Milstein, (1975) 
or derivative methods thereof. Also see Harlow and Lane. (1988). 

Briefly, a mouse is repetitively inoculated with a few micrograms of the GENSET protein 
or a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 

10 with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 

selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 

15 (1980), which disclosure is hereby incorporated by reference in its entirety, and derivative methods 
thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested 
for use. Detailed procedures for monoclonal antibody production are described in Davis, et aL 
(1986) Section 21-2. 

20 B. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the GENSET 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 
GENSET protein or a portion thereof, which can be unmodified or modified to enhance 
immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 

25 usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 

enriched for GENSET concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 
adjuvant (e.g. aluminum hydroxide, R1BI, etc.) which is known in the art. In addition the protein, 
fragment or preparation can be pretreated with an agent which will increase antigenicity, such 

30 agents are known in the art and include, for example, methylated bovine serum albumin (mBSA), 
bovine serum albumin (BSA), Hepatitis B surface antigen, and keyhole limpet hemocyanin (KLH). 
Serum from the immunized animal is collected, treated and tested according to known procedures. 
If the serum contains polyclonal antibodies to undesired epitopes, the polyclonal antibodies can be 
purified by immunoaffinity chromatography. 

35 Effective polyclonal antibody production is affected by many factors related both to the 

antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
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level) of antigen administered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are known in the art. An effective immunization 
protocol for rabbits can be found in Vaitukaitis et al. (1971), which disclosure is hereby 
incorporated by reference in its entirety. 
5 Booster injections can be given at regular intervals, and antiserum harvested when antibody 

titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 
against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony et al., 
(1973), which disclosure is hereby incorporated by reference in its entirety. Plateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 uM). Affinity of the 

1 0 antisera for the antigen is determined by preparing competitive binding curves, as described, for 
example, by Fisher (1980), which disclosure is hereby incorporated by reference in its entirety. 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 

1 5 identify the presence of antigen in a biological sample. The antibodies may also be used in 

therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 
in the body. 
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188208 


29673 
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NUC715 


813496 


30218 
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241681;242553;589203 


30446 
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105288 


30477 






NUC718 
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30583 
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1 17932;194613;225013;331614 
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NUC404 


1000867870 


46679 
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47171 
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48024 
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PRT337 


NUC454 


113448 


48548 
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48603 






NUC753 
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48671 
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48823 
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48901 
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49018 
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49034 
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49133 
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49140 
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49261 
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186091 
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NUC764 


530211 
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313150;313151 


49493 
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49640 
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50241 
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646118 
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523002 
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51796 
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51954 






NUC783 


1 4 1 97 1 ; 1 83 1 1 7;205855;50224 1 


52076 






NUC784 


625028 



TABLE II 



SEO ID NO. 
in priority 
application 


Chromosomal location 


r 51 


3q26.2 
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Chromosomal location 


in priority 
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SFO ID NO 


rhrnmosnmal location 


in priority 
application 




31422 


llql3 


31627 


11 


32962 


9q32-33 


33712 


3p21.3, 3p21.31 


35185 


2 


38631 


15 


48823 


13q34-qter, 17 


49018 


7q22 


49133 


lq32 


49387 


17q21 


50015 


5q31 


51380 


Xq28 



TABLE HI 



SEO ID NO. in 


Tissue Distribution 


37 


1:11 


51 


A:55 B:4 C:l E:l F:30 G:19 H:9 


179 


F:16K:4 


180 


F:5 K:l 


183 


H:6 


201 


C:l 


326 


1:3 


362 


K:2 


440 


A:7 


452 


F:l G:5 


483 


G:4 


500 


1:1 


505 


A:2 


528 


K:2 


573 


F:3 K:l 


574 


A:l 


587 


H:l 1:3 


588 


F:6 K:2 


593 


A:2C:15F:2G:14 


599 


A:8 F:7 K:3 


603 


K:l 


621 


C:l 


628 


F:l K:3 


653 


B:6F:3 G:l 1:1 K:7 


670 


K:ll 


678 


F:l K:2 


693 


B:7G:7 


703 


F:9 K:4 


746 


A:9H:1 K:2 


770 


G:4 


775 


F:l K:13 


796 


G:l 


812 


A:13F:1 H:l 


940 


F:8 


988 


F:8 K:l 


996 


K:4 
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SEO ID NO. in 


Tissue Distribution 


1036 


A:13 1:11 


1064 


A:12 B:130 C:16 D:7 F:l G:16 


1151 


A:9 


1 190 


B:4 C:l H:7 1:6 


1458 


J:55 


1590 


A:l 


1853 


A:l G:l H:l 


1904 


H:l 


2028 


A:8 


2173 


B:l C:3 D:l G:l 


2368 
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B:7 D:5 H:39 
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D:l 
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H:l 
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2843 
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F:l 


2932 


D:l 
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H:2 
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F:4 K:3 


3326 


H:2 


3387 


H:7 
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1:5 


3501 


A:10 C:l 


3633 


A:6 H:3 


3678 


K:2 


3714 


F:4 K:l 


3796 


F:l K:l 


3801 


K:2 


3804 


B:l C:10 D:l G:2 1:11 


3892 


A:9 


3985 


B:l C:5 D:l H:l J:6 


4005 


F:6 1:2 K:2 


4063 


A:2 B:10 C:4 


4088 


K:6 


41 1 1 


A: 11 


4126 


F:3 K:l 


4172 


F:3 K:2 


4261 


H:2 


4340 


A:3 G:3 1:1 


4436 


F:3 


4609 


D:10 


4647 


D:3 


4660 


D:l 


A Z"ZT A 

4664 


B:48 C:2 H:2 1:3 


4678 


C:5 D:17 G:4 


4682 


G:l 


4687 


D:5 F:2 H:l 


4690 


D:3 F:l 1:1 K:l 


4694 


1:J 
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1:1 


4733 


D:l 


4807 


B:ll H:l 


4809 


1:1 


4830 


K:l 
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TicciiP Oi<;t riHiitinn 

I 1/1311 lUUUUII 


4855 


H:l 


4900 


A:l 


4908 


A:16 


4943 


F:6 K:2 


4947 


K:3 


4976 


J:2 


5000 


A:2 B:14 C:17 D:5 F:l G:9 H:3 1:3 


5002 


B:2 


5005 


C 31 Fl J*2 K*2 


501 1 


H:5 1:2 


5040 


B l C*2 D ll Gl J*l 


5058 


K:l 


5071 


F:l K:l 


5089 


1:1 


51 17 


H:7 


5141 


F:2 K:l 


5162 


A:l B:8 


5167 


1:1 


5178 


G:l 


5192 


F:14 K:3 


5214 


A10 B19 C*5 


5230 


A 3 B*6 C*9 D l F l G*4 H"3 1*2 


5240 


A:l 


5250 


A 3 B 39 C'29 D*3 H*3 1*1 Kl 


5262 


A 3 B'39 C*34 D*3 H*3 1*1 K l 


5270 


B:8 


5278 


D:l 


5358 


G:l 


5453 


A:2 C:6 


5494 


A:l D:l 


5499 


A:3 


5533 


A:2 B:7 C:l 


5563 


D:l F:l 


5609 


G:3 


5657 


G:l 


5691 


K:l 


5748 


A:3 B: 1 C:4 D: 1 G:30 H: 1 


5806 


A*2 B*7 C I F l 1*3 K*3 


5816 


B:5 


5824 


H:3 


5861 


B:16 C:27 D:13 F:7 H:21 1:12 K:4 


5885 


A:l 


5913 


K:l 


5947 


D:2 


5966 


G:ll 


5970 


C:4 


5974 


D:12 


5983 


A:5 


5985 


B:2 C:5 H:l 1:1 K:2 


601 1 


B:l 


6080 


B:l 1:1 


6081 


1:9 


6108 


F:5 


6159 


B:l 


6231 


A:l 
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SEO ID NO. in 


Tissue Distribution 
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SEO ID NO. in 


Tissue Distribution 


31554 


K:l 


31627 


C:l 


31726 


C:l 


31744 


F:l 


31790 


A:25 


32102 


G:l 


32473 


B:l 


32475 


B:l 


32962 


G:l 


33130 


K:3 


33712 


B:l C:l 


35005 


A:l 


35185 


C:l 


35258 


K:l 


35326 


B:l F:3 G:l H:l 1:1 K:3 


35597 


F:l K:l 


35912 


A:l 


35984 


F:l 


36122 


K:l 


37337 


F:l K:2 


38112 


K:l 


38220 


A:l 


38311 


A:l 


38631 


K:l 
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H:l 
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40163 


K:2 
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B:8D:1 H:39 
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B:8 D:l H:39 
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H:l 
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SEO ID NO. in 


Tissue Distribution 
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K:3 
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1:6 j 
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F:3 H:l 
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F:l K:l 


50763 
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50982 


A:2 
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51346 
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TABLE IV 



SEQ ID NO. in 

priority 

application 


Tissue source 


37 


adenocarcinoma^), carcinoid(2), testis(l), tonsil(l) 


51 


adipose tissue, white(4), cerebellum(3), cochlea(l), colon tumor rer+(2), dorsal root 
ganglion(l), hippocampus( 1 ), kidney(l), liver(l), malignant melanoma, metastatic to lymph 
node(l), muscle(2), normal leg muscle(l), parathyroid tumor(l), pectoral muscle (after 
mastectomy)(l), placenta(l), substantia nigra(l), total brain(l) 


201 


pancreas(l) 


326 


ovarian tumor( 1 ), uterus( 1 ) 


440 


brain cortex(l), carcinoid tumor(l) 


483 


pbl(l), adenocarcinoma(l), astrocytoma( 1 ), ovarian tumor(l), schizophrenic brain s-11 
frontal lobe(l) 


500 


colon(4), colon tumor rer+(l), pooled germ cell tumors(l) 


505 


total brain( 1 ) 


528 


brain(2), cerebellum(l), colon(l), ovarian tumor(6) 


573 


melanoma (mewo cell line)(l) 


587 


germinal center b cell(l), lymphoma(l), parathyroid tumor(l) 


593 


ovarian tumor( 1 ), placenta( 1 ) 


621 


placenta.(2) 


653 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma^), glioblastoma 
(pooled)(2) 


678 


ovarian tumor(l), prostate(l) 


693 


brain(l), placenta(l) 


703 


small cell carcinoma(2) 


746 


small cell carcinoma(l) 


770 


colon(l), frontal lobe(l), human pancreatic islets(l), normal leg muscle(l), ovarian tumor(l), 
pancreatic islet(2), senescent fibroblast 1), total brain(l) 


775 


anaplastic oligodendroglioma(l), frontal lobe(l) 


796 


adrenal adenoma(l), adrenal gland(l), breast tumor(3), placenta(2) 


812 


heart(l), brain(l), frontal lobe(3), neuroepithelial cells(l), retina(l), small cell carcinoma(l), 
total brain(l) 


988 


germinal center b cell(l) 


1064 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(2), adenocarcinoma(5), anaplastic 
oligodendroglioma^), breast tumor(l), carcinoid(4), colon(3), colon tumor(l), colon tumor 
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rer+(2), frontal lobe(4), germinal center b cell(7), kidney tumor(l), lung tumor(l), metastatic 
prostate bone lesion(2), ovarian tumor(6), parathyroid tumor(7), pectoral muscle (after 
mastectomy)(12), placenta(l), pooled germ cell tumors(4), senescent fibroblast(2), squamous 
cell carcinoma from base of tongue(l), tumor, 5 pooled (see description)( 1 ) 


1151 


anaplastic oligodendroglioma^), carcinoid(l), colon tumor rer+(l), medulloblastoma(l), 
normal prostate(l), parathyroid tumor(l), tumor(l) 


1190 


colon(l) 


1853 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(l) 


2173 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(2), cd34+, cd38- 
from normal bone marrow donor(6), adenocarcinoma(l), alveolar rhabdomyosarcoma(l), 
anaplastic oligodendroglioma(3), breast(l), carcinoid(l), cerebellum(2), cochlea(3), 
colon(ll), colon tumor rer+(2), dorsal root ganglion(l), early stage papillary serous 
carcinoma(l), epithelium (cell line)(l), follicular lymphoma(l), frontal lobe(15), germinal 
center b-cells(3), invasive adenocarcinoma^), kidney(l), kidney tumor(2), larynx(l), 
liver(l), lung carcinoma( 1 ), lymphoma(l), meningioma(l), moderately differentiated 
adenocarcinoma(2), moderately-differentiated adenocarcinoma( 1 ), muscle( 1 ), normal 
prostate(6), normal prostatic epithelial cells(l), oligodendroglioma(3), ovarian tumor(4), 
papillary serous carcinoma(l), pectoral muscle (after mastectomy)(23), pooled germ cell 
tumors(2), prostate(l), stem cell 34+/38+(2), thyroid(l), tumor, 5 pooled (see description)(4), 
two pooled squamous cell carcinomas(2) 


2553 


brain(2), tumor(l) 


2556 


brain(2), tumor(l) 


2755 


frontal lobe(l) 


2843 


b-cell, chronic lymphotic leukemia(2), anaplastic oligodendroglioma(5), carcinoid(l), 
placenta(l) 


2852 


frontal lobe(3) 


2932 


bone marrow(14), brain(l), hematopoietic from ami patient(l), liver(2), normal cortical 
stroma(l) 


3078 


frontal lobe(2) 


3280 


b-cell, chronic lymphotic leukemia(l), germinal center b cell(l), leiomyosarcoma(l), pooled 
germ cell tumors(l) 


3326 


muscle(3), normal leg muscle(l), parathyroid tumor(l) 


3387 


carcinoid( 1 ), pooled germ cell tumors(4) 


3439 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(3), ewing's 
sarcoma(l), anaplastic oligodendroglioma(l), colon(2), glioblastoma (poo!ed)(l), mantle cell 
lymphoma(l), parathyroid tumor(l), senescent fibroblast 1 ), squamous cell carcinoma(l) 


3501 


anaplastic oligodendroglioma(l), breast(l), carcinoid(2), colon(l), colon tumor rer+(l), 
epithelium (cell line)(l), kidney tumor(l), ovarian tumor(l), parathyroid tumor(2), pooled 
germ cell tumors(4), senescent fibroblast 1 ), squamous cell carcinoma(l), testis(l), tumor(2) 


3633 


cerebellum(l), muscle(l), retina(4), small cell carcinoma(l), total brain(3) 


3714 


2 pooled tumors (clear cell type)(l), colon(l) 


3804 


heart(l), anaplastic oligodendroglioma(l), carcinoid(l), colon tumor(2), germinal center b 
cell(2), kidney tumor(l), lung tumor(3), lymphoid(2), moderately differentiated 
adenocarcinoma(l), muscle(l), ovarian tumor(3), pectoral muscle (after mastectomy)(l), 
squamous cell carcinoma(3) 




dioou^i ), total orain^zj 




anaplastic oligodendroglioma(l), breast(l) 


4005 


2 pooled tumors (clear cell type)(l), brain(2), germinal center b cell(l), muscle(l) 


4063 


cerebral cortex(l), brain(l), carcinoid(2), cerebellum(l), schizophrenic brain s-11 frontal 
lobe(l), senescent fibroblast 1 ), total brain(2) 


4111 


anaplastic oligodendroglioma(l) ) cerebellum(l), colon(l) 


4172 


retina(3) 


4340 


heart(l), blood(l), brain(2), colon(6), frontal lobe(9), invasive tumor (cell line)(l), 
melanocyte(l), neuroepithelial cells(l), senescent fibroblast 1 ), small cell carcinoma(2), total 
brain(3) 


4436 


b-cell, chronic lymphotic leukemia(l), bulk tumor(l), colon(24), early stage papillary serous 
carcinoma(3), lung carcinoma(l), pancreatic cancer(l), pooled germ cell tumors(l) 


4609 


blood(l) 


4647 


colon(l), liver(l) 
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4660 


adenocarcinoma( 1 ) 


4664 


2 pooled tumors (clear cell type)(5), adenocarcinoma^), brain(l), breast(l), colon(4), colon 
tumor rer+(l), frontal lobe(5), liver(l), neuroepithelial cells(l), normal prostate(l), ovarian 
tumor(l), ovary(l), total brain(l), tumor(l) 


4678 


colon tumor, rer+(2) 


4682 


2 pooled tumors (clear cell type)(4), anaplastic oligodendroglioma^), breast(3), carcinoid(l), 
glioblastoma (pooledXl), pooled germ cell tumors(l) 


4687 


2 pooled tumors (clear cell type)(l), carcinoid(3), colon(3), normal prostate(l), pooled germ 
cell tumors(l) 


4690 


carcinoid( 1 ), colon(3), normal prostate(l), pooled germ cell tumors(l) 


4694 


colon tumor, rer+(l) 


4733 


colon(3), liver(14), pancreatic islet(l) 


4807 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid(3), colon(2), 
normal prostate(2), normal prostatic epithelial cells(l), ovarian tumor(23), prostate(2), serous 
adenocarcinoma(l), total brain(2), tumor(l), tumor, 5 pooled (see description)(4) 


4809 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid(3), colon(2), 
normal prostate(2), normal prostatic epithelial cells(l), ovarian tumor(28), prostate(2), serous 
adenocarcinoma(l), total brain(2), tumor(l), tumor, 5 pooled (see description)(5) 


4855 


b-cell, chronic lymphotic leukemia(l), carcinoid(l) 


4900 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), colon(l), liver(l), 
low-grade prostatic neoplasia(l), normal prostate(l), pooled germ cell tumors(l) 


4908 


alveolar rhabdomyosarcoma(l), colon(l), hemopoietic system(l), liver(l), malignant ascitic 
efiusion( 1 ), moderately-differentiated adenocarcinoma( 1 ), neuroepithelial cells( 1 ), 
parathyroid tumor(l), synovial membrane(l), uterus(l) 


4947 


parathyroid tumor(l), testis(2) 


5000 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma^), breast(4), carcinoid(2), colon tumor, rer+(l), germ cell tumor(l), 
germinal center b cell(6), invasive prostate tumor(l), lung tumor(l), metastatic prostate bone 
lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy )(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 
(see description)(2) 


5002 


pectoral muscle (after mastectomy)(l), senescent fibroblast 1 ) 


5005 


anaplastic ohgodendroglioma(l), germinal center b cell(l), lobullar carcinoma in situ(l), 
lymphoma(l), oligodendroglioma(l), pooled germ cell tumors(8) 


5011 


breast cancer(l), normal prostate(5), seminal vesicles(l) 


5040 


bone marrow(18), brain(l), hematopoietic from ami patient(l), liver(2), normal cortical 
stroma(l) 


5089 


parotid gland(l) 


5117 


adenocarcinoma^), breast(2), carcinoid(l), colon(3), colon tumor rer+(l), epithelium (cell 
line)(3), moderately differentiated adenocarcinoma(l), moderately-differentiated 
adenocarcinoma^), normal prostate(3), normal prostatic epithelial cells(2), placenta(l), 
prostate(3), small cell carcinoma(l), squamous cell carcinoma(2), squamous cell carcinoma 
from base of tongue( 1 ) 


5162 


hippocampus(l), schizophrenic brain s-1 1 frontal lobe(l) 


5167 


colon(3), colon tumor rer+(2), pooled germ cell tumors(l) 


5214 


kidney(l), ovarian tumor(l), pituitar gland(l), total brain(2) 


5230 


2 pooled tumors (clear cell type)(5), placenta(l), breast(l), breast tumor(l), carcinoid(2), 
colon(l), colon carcinoma(l), colon mucosa(l), colon tumor rer+(4), germinal center b 
cell(l), kidney tumor(l), normal prostate(5), papillary serous carcinoma(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy)(3), pooled germ cell tumors(2), senescent 
fibroblast 1), squamous cell carcinoma from base oftongue(l) 


5240 


bone(l), colon(2), frontal lobe(2), glioblastoma (pooled)(l), liver(l), ovarian tumor(l), 
parathyroid tumor(l), tumor, 5 pooled (see description)( 1 ) 


OCA 

5250 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma^), brain(5), carcinoid(l), 
colon(2), colon tumor rer+(l), pooled germ cell tumors(l) 


5262 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma^), brain(5), carcinoid(l), 
colon(2), colon tumor rer+(l), pooled germ cell tumors(l) 


5270 


ovarian tumor(l) 


5278 


2 pooled tumors (clear cell type)(7), anaplastic oligodendroglioma^), breast(l), breast 
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tumor(4), carcinoid(3), colon(l), colon tumor rer+(2), glioblastoma (pooled)(l), liver(3), 
pooled germ cell tumors(6), thyroid(2) 


5358 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(2), colon(2), 
germinal center b eel 1(4), melanocyte(l), normal prostate(3), parathyroid tumor(2), pectoral 
muscle (after mastectomy)(3), pooled germ cell tumors(4), senescent fibroblast(2) 


5494 


2 pooled tumors (clear cell type)(2), cd34+, cd38- from normal bone marrow donor(2), 
anaplastic oligodendroglioma^), brain(2), colon(2), colon tumor rer+(l), early stage 
papillary serous carcinoma( 1 ), germinal center b cell(3), glioblastoma (poo!ed)(3), 
moderately-differentiated adenocarcinoma(l), normal prostate(l), normal prostatic epithelial 
cells(l), omentum(l), ovarian tumor(4), ovary(l), parathyroid tumor(3), pectoral muscle 
(after mastectomy)(l), pooled germ cell tumors(l), senescent fibroblast(2), stem cell 
34+/38+(l), synovial membrane(l), synovial sarcoma(l), tumor(l) 


5499 


2 pooled tumors (clear cell type)(l), brain(2), pancreatic islet(2) 


5533 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma^), breast(l), carcinoid(l), 
germinal center b cell(2), glioblastoma (pooled)(2), pooled germ cell tumors(2), senescent 
fibroblast(l), small cell carcinoma(2) 


5563 


colon(3), kidney(l), liver(l), neuroepithelial cells(l), normal prostatic epithelial cells(l), 
ovarian tumor(l), senescent fibroblast 1), total brain(l) 


5691 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), 
adenocarcinoma^), anaplastic oligodendroglioma^), carcinoid(3), colon(2), germinal center 
b cell(l), glioblastoma (pooled)(3), medulloblastoma(l), ovarian tumor(l), parathyroid 
tumor(2), pectoral muscle (after mastectomy )(2), prostate(l), three pooled meningiomas(l), 
total brain(l) 


5748 


bone(2), anaplastic oligodendroglioma(5), breast(l), carcinoid(l), colon tumor rer+(l), 
frontal lobe(2), germinal center b cell(l), glioblastoma (pooled)(l), ovarian tumor(4), 
parathyroid tumor(2), pooled germ cell tumors(l), senescent fibroblast 1 ), tumor, 5 pooled 
(see description)(l) 


5806 


2 pooled tumors (clear cell type)(4), female, 19 years old, normal leg muscle(2), alveolar 
rhabdomyosarcoma(l), anaplastic oligodendroglioma^), breast(l), carcinoid(3), germinal 
center b cell(l), glioblastoma (pooled)(2), normal prostate(l) 


5816 


anaplastic oligodendroglioma(l), frontal Iobe(4) 


5824 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(2), 
adenocarcinoma(l), anaplastic oligodendroglioma(3), blood(l), breast(l), carcinoid(9), 
cerebellum(l), colon(3), fibrotheoma( 1 ), follicular lymphoma(l), germinal center b eel 1(2), 
glioblastoma (pooled)(2), kidney tumor(l), low-grade prostatic neoplasia(2), normal prostatic 
epithelial cells(l), ovarian tumor(2), parathyroid tumor(7), pooled germ cell tumors(l), 
senescent fibroblast(6), thyroid(l) 


5861 


adipose tissue, white(2), bone marrow from femur(l), carcinoid(l), epithelium(l), normal 
prostatic epithelial cells(l), pectoral muscle (after mastectomy)(l) 


5885 


2 pooled tumors (clear cell type)(l), female, 19 years old, normal leg muscle(l), anaplastic 
oligodendroglioma^), bone marrow stroma(l), colon tumor rer+(l), germinal center b 
cell(5), glioblastoma (pooled)(l), kidney tumor(l), melanocyte(2), moderately differentiated 
adenocarcinoma(l), moderately-differentiated adenocarcinoma(l), normal prostate(l), 
normal prostatic epithelial cells(2), ovarian tumor(2), parathyroid tumor(l), senescent 
fibroblast 1), three pooled meningiomas(l), tumor(l) 


5947 


2 pooled tumors (clear cell type)(4), adenocarcmoma(l), colon tumor(3), mtestine(l), liver(l) 


5970 


b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled germ 
cell tumors(3) 


5974 


blood(5), bone marrow(l), liver(4), reticulocyte( 1 ) 


5983 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid( 1 ), germinal center b 
cell(4), medulloblastoma(l), pooled germ cell tumors(l), tumor, 5 pooled (see description)(l) 


5985 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma^), breast(4), carcinoid(2), colon tumor, rer+(l), germ cell tumor(l), 
germinal center b cell(6), invasive prostate tumor(l), lung tumor(l), metastatic prostate bone 
lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy)(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 
(see description)(2) 


6011 


brain(5), carcinoid(l), frontal lobe(l), lung carcinoma(l), retina(l) 


6080 


2 pooled tumors (clear cell type)(8), ewing's sarcoma(2), heart(l), adenocarcinoma(l), 
alveolar rhabdomyosarcoma(6), anaplastic oligodendroglioma^), aorta(l), breast(4), bulk 
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germ cell seminoma(2), colon(3), germinal center b cell(2), glioblastoma (pooled)(l), 
kidney(2), lung carcinoma(l), metastatic prostate bone lesion(4), normal prostate(l), normal 
prostatic epithelial cells(l), oligodendroglioma(l), ovary(2), parathyroid tumor(5), pectoral 
muscle (after mastectomy)(19), pooled germ cell tumors(l), prostate(l), senescent 
fibroblast(4), tumor, 5 pooled (see description)(l) 


6081 


brain(l), germinal center b cell(l), testis(l) 


6108 


2 pooled tumors (clear cell type)(l), carcinoid(l), germinal center b cell(l), parathyroid 
tumor(l) 


6159 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(l), schwannoma tumor(l), adipose 
tissue, white(l), adrenal adenoma(3), amygdala(l), anaplastic oligodendroglioma(l), 
astrocytoma(2), bone marrow stroma(3), borderline ovarian carcinoma(l), cochlea(3), colon 
tumor(l), epithelium (cell line)(4), frontal lobe(3), germinal center b cell(l), human 
pancreatic islets(l), kidney tumor(3), larynx(l), liver(l), lung carcinoma(l), lung tumor(l), 
normal leg muscle(l), oligodendroglioma(l), ovarian tumor(8), parathyroid tumor(l), 
pectoral muscle (after mastectomy)(8), prostate tumor(l), senescent fibroblast^), small cell 
carcinoma(4) 


6231 


adenocarcinoma(2), breast(3), colon(4), colon tumor(2), endometrioid ovarian metastasis( 1 ), 
epithelium (cell line)(22), frontal lobe(l), germ cell tumor(3), invasive tumor (cell line)(13), 
moderately differentiated adenocarcinoma(l), normal prostatic epithelial cells(l), ovarian 
tumor(4), ovary(l), pancreatic islet(l), placenta(trophoblast)( 1 ), pooled germ cell tumors(2), 
squamous cell carcinoma( 1 ), synovial sarcoma(l), tumor(3), tumor, 5 pooled (see 
description)(2), two pooled squamous cell carcinomas(2) 


6238 


bone(l), colon(2), ovarian tumor(2), small cell carcinoma(2), total brain(l) 


6252 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(6), adenocarcinoma(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(2), brain(l), breast(2), breast tumor(l), 
carcinoid(2), cochlea(2), germ cell tumor(l), glioblastoma (pooled)(l), kidney(l), kidney 
tumor(6), liposarcoma(l), lung carcinoma(l), lymphoma(l), metastatic prostate bone 
lesion(6), muscle(l), normal prostate(l), normal prostatic epithelial cells(l), ovarian 
tumor(2), ovary(4), parathyroid tumor(l), pectoral muscle (after mastectomy)(5), pooled 
germ cell tumors(4), prostate(l), renal cell tumor(l), senescent fibroblast(3), stem cells(l), 
tumor, 5 pooled (see description)(4) 


6290 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), heart(3), 
lymphoma(l), adenocarcinoma(l), adipose tissue, white(l), brain(3), carcinoid(5), 
cerebellum(5), colon(8), epithelium (cell line)(l), frontal lobe(l), germinal center b-cells(l), 
kidney tumor(l), medulloblastoma(l), melanoma (mewo cell line)(l), normal prostate(l), 
omentum(l), ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), 
retina(l), retinal fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast 1), 
small cell carcinoma(3), synovial membrane(l), total brain(2) 


6322 


b-cell, chronic lymphotic leukemia(l), heart(4), lymphoma(l), adipose tissue, white(l), 
brain(3), carcinoid(4), cerebellum(5), colon(8), epithelium (cell line)(l), frontal lobe(l), 
germinal center b-cells(l), kidney tumor(l), melanoma (mewo cell line)(l), omentum(l), 
ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), retinal fovaea(l), 
schizophrenic brain s-11 frontal lobe(l), senescent fibroblast 1), small cell carcinoma(3), 
synovial membrane(l), total brain(2) 


6329 


b-cell, chronic lymphotic leukemia(3), heart(3), lymphoma(2), adenocarcinoma(6), adipose 
tissue, white(l), adrenal adenoma(2), anaplastic oligodendroglioma(2), bone marrow 
stroma(l), brain(2), breast(l), carcinoid(7), cerebellum(5), colon(8), epithelium (cell line)(l), 
frontal lobe(l), germ cell tumor(2), germinal center b-cells(l), kidney tumor(l), larynx(l), 
lung tumor(l), medulloblastoma(l), melanoma (mewo cell line)(l), metastatic melanoma to 
bowel(l), moderately differentiated adenocarcinoma(l), normal prostate(2), omentum(l), 
ovarian tumor(l), pancreatic islet(2), papillary serous ovarian metastasis(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(l), placenta(2), pooled frontal lobe(l), pooled 
germ cell tumors(3), prostate(l), retinal fovaea(l), schizophrenic brain s-11 frontal lobe(l), 
senescent fibroblast^ ), small cell carcinoma(5), squamous cell carcinoma(l), synovial 
membrane(l), total brain(2), tumor(l), tumor, 5 pooled (see description)(l), two pooled 
squamous cell carcinomas(l) 


6334 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma^), glioblastoma (pooled)(3), kidney(l), normal prostate(3), ovarian 
tumor(l), senescent fibroblast(2) 


6345 


testis(3) 
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6350 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(l), bone(l), 
adenocarcinoma(l), anaplastic oligodendroglioma(lO), breast(l), breast tumor(l), colon(2), 
glioblastoma (pooled)(6), kidney(l), lung carcinoma(l), medulloblastoma(l), metastatic 
prostate bone lesion(l), normal prostate(l), pectoral muscle (after mastectomy)(4), pooled 
germ cell tumors(2), senescent fibroblast(2), squamous cell carcinoma(l), testis(l), tumor(l), 
tumor, 5 pooled_(see description)(l) 


6358 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma^), brain(4), carcinoid(2), 
colon(3), colon tumor rer+(l), germinal center b cell(4), glioblastoma (pooled)(2), normal 
prostate(l), normal prostatic epithelial cells(l), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(l), senescent flbroblast(3) 


6384 


epithelium(l), meningioma(l), parathyroid tumor(2), senescent fibroblast 1) 


6400 


blood(2), brain(2), colon(6), parathyroid tumor(l), tumor(l) 


6431 


adenocarcinoma( 1 ) 


6453 


pooled germ cell tumors(3) 


6636 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(2), adenocarcinoma(3), anaplastic 
oligodendroglioma^), breast tumor(l), carcinoid(4), colon(2), colon tumor(l), colon tumor 
rer+(2), frontal lobe(3), germinal center b cell(6), kidney tumor(l), ovarian tumor(4), 
parathyroid tumor(4), pectoral muscle (after mastectomy)( 1 0), placenta(l), pooled germ cell 
tumors(2), senescent fibroblast 1 ), squamous cell carcinoma from base of tongue(l), tumor, 5 
pooled (see description)( 1 ) 


6688 


b-cell, chronic lymphotic leukemia(3), alveolar rhabdomyosarcoma(l), anaplastic 
oligodendroglioma(l), breast(2), carcinoid(l), colon(l), colon tumor(l), four pooled pituitary 
adenomas(l), germinal center b cell(2), glioblastoma (pooled)(l), kidney tumor(2), 
moderately-differentiated adenocarcinoma(l), muscle(2), pectoral muscle (after 
mastectomy)(l), pooled germ cell tumors(3), senescent fibroblast(3), synovial sarcoma(l), 
testis(l), tumor, 5 pooled (see description)(3) 


6727 


frontal lobe(l), schizophrenic brain s-1 1 frontal lobe(l) 


6865 


frontal lobe(4), germinal center b cell(l), muscle(l), ovarian tumor(2), pectoral muscle (after 
mastectomy)(3), senescent fibroblast 1 ), small cell carcinoma(l), thyroid(l) 


6892 


2 pooled tumors (clear cell type)(5), adenocarcinoma(l), anaplastic oligodendroglioma(5), 
brain(5), breast(3), breast tumor(l), carcinoid(5), cerebellum(l), colon(3), colon tumor 
rer+(2), frontal lobe(5), germinal center b cell(3), glioblastoma (pooled)(2), moderately- 
differentiated adenocarcinoma(l), normal prostate(3), ovary(2), parathyroid tumor(3), 
pectoral muscle (after mastectomy)(l), placenta(l), pooled germ cell tumors(5), senescent 
fibroblast(3), tumor(l), tumor, 5 pooled (see description)(l) 


7000 


anaplastic oligodendroglioma^), frontal lobe(2) 


7041 


germinal center b cell(l), total brain(2) 


7533 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), schwannoma 
tumor(l), anaplastic oligodendroglioma(l), astrocytoma(l), breast(l), cochlea(l), colon 
tumor rer+(2), germ cell tumor(l), germinal center b cell(2), glioblastoma (pooled)(3), 
hepatoma(4), medulloblastoma(l), metastatic prostate bone lesion(l), moderately- 
differentiated adenocarcinoma(l), ovarian tumor(l), prostate tumor(l), senescent 
fibroblast 1 ), squamous cell carcinoma(l), three pooled meningiomas(l) 


7535 


b-cell, chronic lymphotic leukemia(l), schwannoma tumor(l), anaplastic 
oligodendroglioma(l), astrocytoma( 1 ), breast 1), cochlea(l), colon tumor rer+(2), germ cell 
tumor(l), germinal center b cell(2), glioblastoma (pooled)(3), hepatoma(4), 
medulloblastoma(l), metastatic prostate bone lesion(l), moderately-differentiated 
adenocarcmoma(l), ovarian tumor(l), senescent fibroblast 1 ), squamous cell carcmoma(l) 


7697 


colon(l), invasive adenocarcinoma(3), liver(l), moderately-differentiated adenocarcinoma(l) 


8009 


alveolar rhabdomyosarcoma(2), anaplastic oligodendroglioma(l), carcinoid(9), colon(l), 
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colon tumor rer-i-(l), germinal center b cell(2), glioblastoma (pooled)(l), normal prostate(l), 
ovary(2), pooled germ cell tumors(2), thyroid(2), tumor(l) 


8078 


frontal lobe(2) 


8079 


b-cell, chronic lymphotic leukemia(2), bone(l), anaplastic oligodendroglioma(l), colon(l), 
pooled germ cell tumors(3) 


8097 


colon(l) 


8166 


2 pooled tumors (clear cell type)(5), adenocarcinoma(6), anaplastic oligodendroglioma(3), 
breast tumor(l), carcinoid(l), colon(l), epithelium (cell line)(l), glioblastoma (pooled)(2), 
lung tumor(2), metastatic melanoma to bowel(l), moderately-differentiated 
adenocarcinoma(l), normal prostate(l), ovary(l), papillary serous ovarian metastasis(l), 
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parathyroid tumor(l), pooled germ cell tumors(4), renal cell tumor(l), squamous cell 
carcinoma(5), synovial sarcoma(l), tumor(3), tumor, 5 pooled (see description)(3) 


8341 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma^), germinal center b 
cell(3), glioblastoma (pooled)(l), normal prostate(l), oligodendroglioma(l), pectoral muscle 
(after mastectomy)(2), tumor( 1 ) 


8534 


germinal center b cell(4) 


8666 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(5), b-cell, 
chronic lymphotic leukemia(2), adenocarcinoma(3), alveolar rhabdomyosarcoma^), 
amygdala(l), anaplastic oligodendroglioma(3), blood(2), breast(l), carcinoid(3), colon(3), 
germinal center b cell(4), muscle(l), normal prostate(l), ovarian tumor(6), parathyroid 
tumor(2), pectoral muscle (after mastectomy)(4), pheochromocytoma(l), placenta(l), 
senescent fibroblast 1 1), two pooled squamous cell carcinomas( 1 ) 


8671 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell, 
chronic lymphotic leukemia(l), adenocarcinoma^), alveolar rhabdomyosarcoma^), 
amygdala(2), anaplastic oligodendroglioma^), blood(2), breast(2), carcinoid(l), colon(2), 
germinal center b cell(5), liver cancer(l), metastatic prostate bone lesion(l), muscle(l), 
normal prostate(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), placenta(2), pooled germ cell tumors(l), renal cell tumor(l), senescent 
fibroblast^) 


8968 


b-cell, chronic lymphotic leukemia(l), neuroepithelial cells(l), total brain(l) 


9406 


brain(2), colon(l), tumor(l) 


9407 


brain(2), colon(l), tumor(l) 


9668 


2 pooled tumors (clear cell type)(8), adenocarcinoma(l), anaplastic oligodendroglioma(l), 
breast carcinoma in situ(3), cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2), parathyroid tumor(26), pooled germ cell tumors(l) 


9679 


ovarian tumor(3), parathyroid tumor(l), uterus(l) 


9755 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid(l), ovarian tumor(l), 
pooled germ cell tumors(3), senescent fibroblast(2), total brain(2) 


9868 


brain(l), senescent fibroblast 1) 


10044 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell, 
chronic lymphotic leukemia(l), adenocarcinoma^), alveolar rhabdomyosarcoma^), 
amygdala(2), anaplastic oligodendroglioma(2), blood(l), breast(2), carcinoid(l), colon(2), 
germinal center b cell(5), liver cancer(l), metastatic prostate bone lesion(l), normal 
prostate(l), osteosarcoma(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), placenta(2), pooled germ cell tumors(l), renal cell tumor(l), senescent 
fibroblast^) 


10322 


2 pooled tumors (clear cell type)(9), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma^), breast(l), carcinoid(3), colon(3), colon tumor rer+(l), germinal center 
b cell(l), glioblastoma (pooled)(4), kidney tumor(l), low-grade prostatic neoplasia(l), 
metastatic melanoma to bowel(l), normal prostate(2), ovary bulk tumor(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy )(3), pooled germ cell tumors(2), senescent 
fibroblast(2), small cell carcinoma(2), synovial sarcoma(2), two pooled squamous cell 
carcinomas(3) 


10526 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(4), adenocarcinoma(l), adipose tissue, 
white(l), alveolar rhabdomyosarcoma(3), anaplastic oligodendroglioma(5), brain(l), 
breast(4), breast tumor(l), carcinoid(2), colon(10), epithelium (cell line)(l), frontal lobe(7), 
germ cell tumor(l), glioblastoma (pooled)(l), invasive prostate tumor(l), kidney tumor(8), 
lymphoma(4), metastatic prostate bone lesion(2), muscle(3), normal prostate(2), normal 
prostatic epithelial cells(3), ovarian tumor(l), ovary(2), parathyroid tumor(4), pectoral 
muscle (after mastectomy)(26), placenta(l), pooled germ cell tumors(6), prostate(9), 
senescent fibroblast(6), small cell carcinoma(l), tumor, 5 pooled (see description)(2) 


10584 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(4), anaplastic 
oligodendroglioma(6), carcinoid(2), colon(3), colon tumor rer+(4), colon tumor, rer+(2), 
ovarian tumor(4) 


10650 


pooled germ cell tumors(2) 


10739 


2 pooled tumors (clear cell type)(5), ewing's sarcoma(l), liver(l), alveolar 
rhabdomyosarcoma(l), anaplastic oligodendroglioma^), breast(3), breast tumor(l), 
carcinoid(9), colon(5), colon tumor rer+(3), frontal lobe(8), germinal center b cell(3), 
glioblastoma (pooled)(l 1), invasive prostate tumor(l), kidney(l), metastatic prostate bone 
lesion(l), moderately-differentiated adenocarcinoma(l), normal prostate(2), normal prostatic 
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epithelial cells(l), parathyroid tumor(4), pectoral muscle (after mastectomy)(6), placenta(l), 
pooled germ cell tumors(l), tumor(l), tumor, 5 pooled (see description)(l) 


10743 


2 pooled tumors (clear cell type)(5), ewing's sarcoma(l), liver(l), schwannoma tumor(l), 
alveolar rhabdomyosarcoma(l), anaplastic oligodendroglioma^), breast(3), breast tumor(l), 
carcinoid(lO), colon(5), colon tumor rer+(3), germinal center b cell(3), glioblastoma 
(pooled)(l 1), invasive prostate tumor(l), kidney(l), liver(l), metastatic prostate bone 
lesion( 1 ), moderately-differentiated adenocarcinoma( 1 ), muscle( 1 ), normal prostate(2), 
normal prostatic epithelial cells(l), ovarian tumor(4), parathyroid tumor(4), pectoral muscle 
(after mastectomy)(6), placenta(l), pooled germ cell tumors(2), thyroid(l), tumor(l), tumor, 
5 pooled (see description)(l) 


10744 


heart(2), four pooled pituitary adenomas(l), frontal lobe(3), glioblastoma (pooled)(l), 
liver(l), moderately-differentiated adenocarcinoma( 1 ), peripheral blood(l), retina(l) 


10880 


2 pooled tumors (clear cell type)(3), ewing's sarcoma(l), adenocarcinoma^), brain(l), 
carcinoid(8), germinal center b cell(l), glioblastoma (pooled)(5), kidney tumor(l), normal 
prostate(3), oligodendroglioma(l), parathyroid tumor(ll), senescent fibroblast(4), three 
pooled meningiomas(l), total brain(3) 


10942 


2 pooled tumors (clear cell type)(8), adenocarcinoma(l), anaplastic oligodendroglioma( 1 ), 
breast carcinoma in situ(3), cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2), parathyroid tumor(26), pooled germ cell tumors(l) 


11019 


epidermis(l), ewing's sarcoma(2), heart(l), schwannoma tumor(l), adenocarcinoma^), 
alveolar rhabdomyosarcoma^), amygdala(l), brain(4), carcinoid(l), colon mucosa(l), 
endometrioid ovarian metastasis( 1 ), epithelium (cell line)(2), frontal lobe(l), germ cell 
tumor(3), heart(l), invasive prostate tumor(2), kidney(2), kidney tumor(2), liposarcoma(l), 
liver(24), mantle cell lymphoma(3), medulloblastoma(l), metastatic prostate bone lesion(6), 
moderately-differentiated adenocarcinoma(l), muscle(l), normal prostatic epithelial cells(5), 
ovary(l), papillary serous ovarian metastasis(l), parathyroid tumor(l), placenta(2), pooled 
germ cell tumors(2), prostate(l), thyroid(l), tumor(l), tumor, 5 pooled (see description)(l), 
uterus(4) 


11278 


2 pooled tumors (clear cell type)(4), bone(4), heart(2), anaplastic oligodendroglioma^), 
carcinoid(3), colon tumor rer+(l), epithelium (cell Iine)(l), frontal lobe(12), kidney(2), 
liposarcoma(l), liver(6), lung carcinoma(l), muscle(6), normal prostate(2), ovary(2), 
parathyroid tumor(6), pectoral muscle (after mastectomy )(3), schizophrenic brain s-1 1 frontal 
lobe(l), senescent fibroblast(6), small cell carcinoma(4), synovial membrane(l) 


11342 


2 pooled tumors (clear cell type)(3), blood(l), germinal center b cell(4), normal 
epithelium(l), pooled germ cell tumors(l), tumor, 5 pooled (see description)( 1 ) 


11735 


adenocarcinoma(l), breast(2), colon(l), frontal lobe(2), placenta(l), pooled germ cell 
tumors(8) 


12039 


testis(l) 


12043 


spleen(l) 


12048 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), brain(2), colon tumor 
rer+(2), spleen(2) 


12098 


astrocytoma( 1 ), ovarian tumor(l) 


12202 


2 pooled tumors (clear cell type)(2), adenocarcinoma(l), cochlea(l), germinal center b 
cell(2), pituitary(l) 


12243 


alveolar rhabdomyosarcoma(2), anaplastic oligodendroglioma(l), carcinoid(9), colon(l), 
colon tumor rer+(l), germinal center b cell(2), glioblastoma (pooled)(2), metastatic prostate 
bone lesion(l), normal prostate(2), ovary(2), pooled germ cell tumors(2), thyroid(2), 
tumor(l) 


12263 


anaplastic oligodendroglioma(l), carcinoid(l), germinal center b cell(2), invasive 
adenocarcinoma(l), liver and spleen(2), papillary serous carcinoma(l) 


12490 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(l), ewing's 
sarcoma(l), adenocarcinoma(2), anaplastic oligodendroglioma(l), germinal center b cell(2), 
kidney tumor(l), larynx(l), mantle cell lymphoma(l), medulloblastoma(l), melanocyte(2), 
moderately-differentiated adenocarcinoma(l), ovarian tumor(12), parathyroid tumor(2), 
pooled germ cell tumors(4), prostate(2), tumor, 5 pooled (see description)(2) 


12604 


2 pooled tumors (clear cell type)(12), b-cell, chronic lymphotic leukemia(4), bone(l), ewing's 
sarcoma(l), adenocarcinoma(l), anaplastic oligodendroglioma(lO), carcinoid(3), colon(3), 
colon tumor rer+(l), germinal center b cell(l), glioblastoma (pooled)(3), invasive prostate 
tumor(l), kidney(l), larynx(l), muscle(l), normal leg muscle(l), normal prostate(3), ovarian 
tumor(l), parathyroid tumor(12), pectoral muscle (after mastectomy)(28), pooled germ cell 
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tumors(3), prostate(3), renal cell tumor(l), senescent fibroblast 12), skeletal muscle(l) 


12657 


adenocarcinoma(l), carcinoid(2), colon tumor, rer+(2) 


12788 


2 pooled tumors (clear cell type)(3), adenocarcinoma(l), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(l), carcinoid(l), colon(l), colon tumor rer+(l), germinal center b cell(l), 
glioblastoma (pooled)(2), ovary(l), parathyroid tumor(l), senescent fibroblast(2), tumor, 5 
pooled (see description)(l) 


12901 


anaplastic oligodendroglioma^), carcinoid( 1 ), germinal center b cell(6), invasive tumor (cell 
line)( 1 ), parathyroid tumor( 1 ) 


12907 


adipose tissue, white(l), carcinoid(l), parathyroid tumor(2), pooled frontal lobe(l), tumor, 5 
pooled (see description)( 1 ) 


13013 


invasive tumor (cell line)(l), parathyroid tumor(3) 


13202 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(3), breast(l), breast 
tumor(l), carcinoid(2), colon tumor rer+(l), colon tumor, rer+(l), germinal center b cell(l), 
kidney tumor(l), lung carcinoma(l), lung tumor(l), meningioma(2), moderately 
differentiated adenocarcinoma(l), normal prostate(3), ovarian tumor(l), squamous cell 
carcinoma( 1 ), tumor, 5 pooled (see description)(l) 


13229 


cochlea(l), frontal cortex(l), pooled germ cell tumors(l) 


13256 


2 pooled tumors (clear cell type)(2), adenocarcinoma^), anaplastic oligodendroglioma(l), 
parathyroid tumor(l), pooled germ cell tumors(3) 


13285 


2 pooled tumors (clear cell type)(6), carcinoid(l), colon tumor rer+(l), kidney(l), liver(l) 


26710 


2 pooled tumors (clear cell type)(l), pooled germ cell tumors(l) 


27273 


anaplastic oligodendroglioma(l), cerebellum(l), germinal center b cell(2), moderately- 
differentiated adenocarcinoma(l), pectoral muscle (after mastectomy)(2) 


27301 


astrocytoma(2), ovarian tumor(l) 


27336 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(6), 
adenocarcinoma(l), anaplastic oligodendroglioma(9), carcinoid(6), colon(l), colon tumor 
rer+(l), colon tumor, rer+(6), frontal lobe(l), germinal center b cell(2), glioblastoma 
(pooled)(l), pooled germ cell tumors(4), senescent fibroblast(2) 


27361 


colon(l) 


27374 


b-cell, chronic lymphotic leukemia(l), pooled germ cell tumors(5) 


27627 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), frontal lobe(l), 
germinal center b eel 1(1), glioblastoma (pooled)(2), ovarian tumor(l) 


27697 


skeletal muscle(2), thyroid(l) 


27877 


carcinoid(l), colon(l), germinal center b cell(l) 


29469 


frontal lobe(4), germinal center b cell(l), ovarian tumor(2), pectoral muscle (after 
mastectomy)(3), senescent fibroblast 1 ), small cell carcinoma( 1 ), thyroid(l) 


29557 


ewing's sarcoma(l), larynx(l), medulloblastoma(l), moderately-differentiated 
adenocarcinoma(l), ovarian tumor(7), parathyroid tumor(2), pooled germ cell tumors(l), 
prostate(l) 


29673 


heart(l), colon tumor(l), kidney tumor(l), lung tumor(3), lymphoid(2), moderately 
differentiated adenocarcinoma(l), muscle(l), ovarian tumor(3), squamous cell carcinoma(l) 


29814 


brain(2), colon(l), pancreatic islet(l) 


30218 


small cell carcinoma(2) 


30446 


adenocarcinoma(2), adrenal adenoma(3), colon tumor(l), glioblastoma (pooled)(2), kidney 
tumor(l), ovarian tumor(l), parathyroid tumor(l), prostate(l), small cell carcinoma(l) 


30583 


carcinoid(l), ovarian tumor(l), pooled germ cell tumors(2) 


30719 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(l), breast(2), 
colon(l), colon tumor, rer+(l), germinal center b cell(l), glioblastoma (pooled)(3), liver(2), 
normal prostate(2), pooled germ cell tumors(lO) 


31356 


pooled germ cell tumors(l) 


31422 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(2), bone(l), 
anaplastic oligodendroglioma(l 1), bone marrow stroma(l), brain(2), breast(l), carcinoid(20), 
colon(6), colon tumor, rer+(l), germinal center b cell(2), glioblastoma (pooled)(3), kidney 
tumor(l), larynx(l), medulloblastoma(l), normal prostate(l), ovarian tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy )(2), pooled germ cell tumors(2), senescent 
fibroblast(5), tumor(l), tumor, 5 pooled (see description)(3) 


31554 


2 pooled tumors (clear cell type)(l), carcinoid(l), germinal center b cell(4) 


31627 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma( 1 ), anaplastic 
oligodendroglioma^), frontal lobe(7), normal prostate(l), oligodendroglioma(l), parathyroid 
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tumor(l), pectoral muscle (after mastectomy)(3), pooled germ cell tumors(l) 


31744 


adenocarcinoma(l ) 


31790 


ovarian tumor(l), synovial membrane(l) 


32102 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(2), heart(3), 
lymphoma(2), adenocarcinoma^), adipose tissue, white(l), anaplastic oligodendroglioma^), 
brain(2), carcinoid(9), cerebellum(5), colon(7), epithelium (cell line)(l), frontal lobe(4), germ 
cell tumor(l), glioblastoma (pooled)(3), juvenile granulosa tumor(l), kidney tumor(2), 
medulloblastoma(l), melanoma (mewo cell line)(l), metastatic melanoma to bowel(l), 
moderately-differentiated adenocarcinoma(l), normal epithelium(l), normal prostate(3), 
omentum(l), ovarian tumor(2), pancreatic islet(l), parathyroid tumor(4), pectoral muscle 
(after mastectomy )(2), placenta(l), pooled frontal lobe(l), pooled germ cell tumors(l), retinal 
fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(5), small cell 
carcinoma(4), synovial membrane(l), total brain(2), tumor, 5 pooled (see description)( 1 ), two 
pooled squamous cell carcinomas(2) 


32473 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma^), anaplastic oligodendroglioma^), breast(2), colon(2), colon tumor 
rer+(l), invasive prostate tumor(l), kidney(l), metastatic prostate bone lesion(l), normal 
prostate(l), pectoral muscle (after mastectomy )(22), pooled germ cell tumors(2), senescent 
fibroblast 1) 


32475 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma^), anaplastic oligodendroglioma^), breast(2), colon(2), colon tumor 
rer+(l), kidney(l), metastatic prostate bone lesion(l), normal prostate(l), pectoral muscle 
(after mastectomy)(22), pooled germ cell tumors(2), senescent fibroblast 1) 


33712 


2 pooled tumors (clear cell type)(l), adenocarcinoma(2), anaplastic oligodendroglioma^), 
brain frontal cortex(2), breast(l), germinal center b cell(4), juvenile granulosa tumor(l), 
normal prostatic epithelial cells(2), ovarian tumor(l), three pooled meningiomas(2) 


35005 


bone(l), pooled germ cell tumors(l) 


35185 


spinal cord( 1 ) 


35326 


brain(l), breast(2), carcinoid(l), colon(l), germinal center b cell(2), human pancreatic 
islets(l), liver(l), melanocyte(l), ovarian tumor(l), placenta(l), total brain(l) 


37337 


anaplastic oligodendroglioma(2), colon(2), glioblastoma (pooled)(3), juvenile granulosa 
tumor(l), pooled germ cell tumors(3), tumor, 5 pooled (see description)( 1 ), uterus(2) 


38220 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), 
adenocarcinoma(2), breast(l), carcinoid(l), frontal lobe(l), germinal center b cell(2), normal 
prostate(l), normal prostatic epithelial cells(l), pooled germ cell tumors(3), total brain(2) 


38311 


bone(l), frontal lobe(l), germinal center b cell(l), moderately differentiated 
adenocarcinoma( 1 ), placenta( 1 ) 


38631 


brain( 1 ) 


38749 


2 pooled tumors (clear cell type)(4), breast(l), cochlea(l), colon tumor rer+(l), liver(l), 
moderately differentiated adenocarcinoma^ ), normal prostate(l), pancreas (with no medical 
abnormalities)( 1 ), parathyroid tumor( 1 ), prostate( 1 ) 


40975 


brain(2), tumor(l) 


40991 


brain(2), tumor( 1 ) 


44053 


2 pooled tumors (clear cell type)(l), bone(l), total brain(l) 


45179 


colon(l), metastatic prostate bone lesion(l) 


45274 


2 pooled tumors (clear cell type)(6), cd34+, cd38- from normal bone marrow donor(l), 
ewing's sarcoma(l), heart(l), lung(l), adenocarcinoma^), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma^), borderline ovarian carcinoma(17), 
brain(3), breast(4), breast carcinoma in situ(20), bronchioalveolar carcinoma(4), carcinoid(4), 
cerebellum(l), colon(4), colon tumor rer+(3), early stage papillary serous carcinoma(3), 
glioblastoma (pooled)(2), invasive adenocarcinoma(8), invasive carcinoma(2), lobullar 
carcinoma in situ(2), low-grade prostatic neoplasia(l), lung carcinoma(l), lung tumor(l), 
normal prostate(2), normal prostatic epithelial cells(l), oligodendroglioma(3), ovarian 
tumor(2), papillary serous carcinoma( 1 8), papillary serous ovarian metastasis(l), parathyroid 
tumor(5), pectoral muscle (after mastectomy)(7), pooled germ cell tumors(2), senescent 
fibroblast^), stem cell 34+/38+(2), thymus(l) 


46679 


2 pooled tumors (clear cell type)(4), anaplastic oligodendroglioma(l), pectoral muscle (after 
mastectomy)( 1 ), pooled germ cell tumors(l) 


48024 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma^), anaplastic oligodendroglioma^), breast(2), colon(2), colon tumor 
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rer+(l), invasive prostate tumor(l), kidney(l), metastatic prostate bone lesion(l), normal 
prostate(l), pectoral muscle (after mastectomy)(22), pooled germ cell tumors(2), senescent 
fibroblast 1) 


48548 


liver(4) 


48823 


2 pooled tumors (clear cell type)(l), bone(l), alveolar rhabdomyosarcoma^), breast(l), 
epithelium (cell line)(l), glioblastoma (pooled)(l), lung(l), lung tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(l), renal cell tumor(l) 


48901 


germinal center b cell(l), pooled germ cell tumors(l) 


49018 


anaplastic oligodendroglioma(l), brain(l), breast(l), frontal lobe (see description)(l), 
testisip ) 


49034 


adipose tissue, white(l), pancreas (with no medical abnormalities)(l) 


49133 


retina(l) 


49140 


2 pooled tumors (clear cell type)(l), adenocarcinoma^ ), carcinoid(6), colon tumor rer+(l), 
glioblastoma (pooled)(2), moderately differentiated adenocarcinoma(l), normal prostate(l), 
pancreatic islet(2), parathyroid tumor(3) 


49387 


colon(l), pooled germ cell tumors(l) 


49416 


i a' l" ii i* y i x * ■ i/ix i ii y^x r* .till /"^x • 1 

anaplastic ohgodendroglioma(l), carcinoid(3), cerebellum(2), frontal lobe(2), germmal center 
b cell(2), parathyroid tumor(l), retina(l), tumor(l) 


49493 


neuroepithelial cells(l), ovarian tumor(4), thyroid(l) 


49640 


>^ • -% _ si ii- v y -t \ 1 x • 1* 1 1 1" f * "\ 1 • f f\ "v i - 

2 pooled tumors (clear cell type)(l), anaplastic ohgodendroglioma(l), brain(2), colon tumor 
rer+(l), spleen(2) 


49863 


l ii i i 1 „ • 1 1 • ✓ ■« x • ■ i/i \ • i . i ii/n\ i j 

b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled germ 
cell tumors(3) 


49871 


1 11 * * » 1 i • 1 1 * ✓ 1 X * * J / 1 X *1 A I 11 /(%\ 1 J 

b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled germ 
cell tumors(3) 


50185 


total brain(l) 


50 /o3 


cerebral cortex(3), brain(2), colon(l), colon tumor rer+(l), ovarian tumor(l), schizophrenic 

ni*Qin c _ 1 1 TTV\irf"ci 1 1 rx£^i 1 1 
Uldlll 3-11 llOIllal lUUC^ 1 ) 




O rninlf»H tnmnrc (o\(*ziv c*&\\ \\rmp'\(0\ h-r»p11 rYwcstxAO IvtnnhfYfip IpiiWf^mijitf"^ parpinniHf 1 ^ 

aL. UUU1CU lUlllVJlo 1 V<1CCU WCll lylJ^JyjLJy \J d HUll lis 1 YlllLFIlVJtlV IVUIxvIlllCl^Jy, VrlllUlU v 1 ^, 

cochlea(l), germinal center b cell(2) 


51130 


adenocarcinoma(l) 


51212 


placenta.(2) 


51346 


bone(l), small cell carcinoma(2) 


51380 


placenta(4) 


51400 


liver(l) 


51954 


germinal center b cell(l), total brain(l) 


52076 


normal prostatic epithelial cells(l) 



TABLE V 



SEQ ID NO. in 

priority 

application 


Low frequency expression 


High frequency expression 


37 


salivary gland 




51 


fetal brain, fetal kidney 


brain, salivary gland, liver 


179 


liver 




180 


liver 




183 


prostate 




326 


salivary gland 




362 


testis 




440 


brain 




452 


placenta 




483 


placenta 




500 


salivary gland 




528 


testis 
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571 


liver 




587 


c c> 1 1 \/ ci r~v olnnH 
Sail Val y gldllU 




588 

JOO 


1 1VPI* 
11 VCI 




501 


fi=»tn1 ViHnf»\/ r»l5»f*f»tita 

IClal MUllCj, pidWCULd 




j j ? 


1i\/f*t* 

11 VCI 




628 


tf*QtiQ 
LCOLld 




651 


testis 




670 


testis 




691 


nljiff^ntJi 
pidccnid 




703 


liver 




746 


Krain 
iji din 




770 


ntafpnta 
pidccuid 




775 

/ / j 


ICS113 




81? 


hmin 
L/i alii 




040 


I \\IC*T 

II VCI 




Q88 


I ivf*t* 

II VCI 






tactic 




1016 


cnlivsirv olnnH hrain 

odllVdljr t^ldJJU, Ululll 




1 UUt 


UI dlll 7 11 VCI, piUoldlC 


fptal hrain 

ICtdl Ul dill 


1151 
1 1 j i 


hrj* in 

UI dill 




1 1 00 


C5*1 iv5H"v olnnH nrriQtJil'P 

odllVdljr gldllU, pi(Jj>tdlC 




1 458 


blUIIldCIl/ UUCblUlC 




1 004 


pi UoldlC 




9078 


brain 




9168 


Lcaiio 




9551 


pi UoLdlC 




7556 


pi VJotdLC 




9600 


tactic 




9755 

L/JJ 


pi UoldlC 




9800 


I i \/f*r tactic 

II VCI j ICoLlo 




9841 


ht*5i i n 

UI dill 




9019 


f*f»t5»l livf»r 

lCldl 11 VCI 




1078 


pi UjLUIC 




1980 


liver 




1196 


pi USlalC 




1187 


pi UolaiC' 




1410 


cq 1 ivarv olanH 
idiivdi y c^idiiu 




1501 


Kra in 

Ul dill 




3633 


brdin 




1678 


testis 




3714 


liver 




3801 


testis 




3804 


<;alivarv ffland fetal kidnev 




3892 


brain 




3985 


stomach/intestine 




4005 


liver 




4063 


■fiat a 1 hrain 




4088 






41 1 1 


brain 




4196 


liver 




4172 


liver 




4261 


prostate 




4436 


liver 




4609 


fetal liver 




4647 


fetal liver 




4660 


fetal liver 
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4664 


fetal brain 




4678 


fetal liver 




4687 


fetal liver 




4690 


fetal liver 




4694 


salivary gland 




4696 


salivary gland 




4733 


fetal liver 




4807 


fetal brain 




4809 


salivary gland 




4855 


prostate 




4908 


brain 




4943 


liver 




4947 


testis 




4976 


stomach/intestine 




5000 


fetal kidney 




5005 


fetal kidney 




5011 


prostate 




5040 


fetal liver 




5089 


salivary gland 




5117 


prostate 




5141 


liver 




5162 


fetal brain 




5167 


salivary gland 




5192 


liver 




5214 


fetal brain 




5250 


fetal kidney, fetal brain 




5262 


brain 


fetal kidney, fetal brain 


5270 


fetal brain 




5278 


fetal liver 




5453 


fetal kidney 




5494 


fetal liver 




5499 


brain 




5533 


fetal brain 




5563 


fetal liver 




5609 


placenta 




5748 


placenta 




5816 


fetal brain 




5824 


prostate 




5861 


fetal liver, prostate, fetal kidney, 
salivary gland 




5947 


fetal liver 




5966 


placenta 




5970 


fetal kidney 




5974 


fetal liver 




5983 


brain 




5985 


fetal kidney 




6080 


salivary gland 




6081 


salivary gland 




6108 


liver 




6238 


placenta, brain 




6252 


placenta 




6334 


liver 




6345 


brain 




6358 


fetal brain 




6636 


fetal brain 




6688 


fetal brain 
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6727 


testis 




6865 


salivary gland 




6892 


stomach/intestine, fetal brain 




7041 


liver 




7533 


placenta 




7535 


brain 




7697 


fetal liver 




7712 


testis 




8078 


testis 




8097 


salivary gland 




8166 


placenta, fetal liver 




8262 


prostate 




8534 


testis 




8666 


placenta 




8744 


prostate 




8968 


brain 




8994 


testis 




9297 


fetal liver 




9332 


testis 




9406 


prostate 




9407 


prostate 




9668 


testis 




9755 


testis 




9868 


liver 




10044 


fetal liver, placenta 




10526 


prostate 




10650 


testis 




10743 


stomach/intestine 




10744 


salivary gland 




10761 


prostate 




10880 


fetal liver 




10942 


brain 




11019 


liver 




11278 


salivary gland 




11562 


testis 




11688 


testis 




11735 


fetal brain 




11813 


brain 




12043 


brain 




12202 


testis 




12220 


fetal kidney 




12243 


brain, testis, liver 


fetal brain, fetal liver 


12263 


prostate 




12276 


brain 




12604 


fetal kidney 




12657 


fetal kidney, stomach/intestine 




12788 


testis 




12901 


placenta 




12907 


salivary gland 




13013 


fetal liver 




13229 


brain 




13256 


testis 




13267 


salivary gland 




13285 


fetal brain 


fetal kidney, stomach/intestine 


26638 


fetal liver 




26710 


brain 
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26726 


fetal kidney 




26786 


brain 




27084 


testis 




27273 


fetal kidney 




27361 


liver 




27374 


fetal liver 




27877 


salivary gland 




28413 


prostate 




28517 


testis 




28518 


placenta 




29120 


salivary gland 




29673 


fetal liver 




30218 


liver 




30446 


prostate 




30477 


placenta 




30583 


testis 




30719 


fetal brain, salivary gland 




31356 


prostate 




31790 


brain 




33130 


testis 




38749 


prostate 




38890 


placenta 




40163 


testis 




40975 


prostate 




40991 


prostate 




42896 


prostate 




44053 


stomach/intestine, fetal kidney 




45091 


stomach/intestine 




45179 


prostate 




45274 


fetal kidney 




46679 


fetal liver 




48024 


fetal kidney 




48603 


liver, testis 




48671 


testis 




48823 


fetal kidney 




48901 


salivary gland 




49018 


stomach/intestine 




49034 


salivary gland 




49133 


liver 




49261 


testis 




49387 


liver 




49416 


fetal brain 




49426 


stomach/intestine 




49493 


salivary gland 




49640 


brain 




49863 


brain 




49871 


brain 




50015 


testis 




50049 


testis 




50112 


testis 




50185 


salivary gland 




50241 


liver 




50763 


brain 




51130 


testis 




51212 


stomach/intestine 




51400 


liver 
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WHAT IS CLAIMED IS: 

I . An isolated polynucleotide, said polynucleotide comprising a nucleic acid sequence encoding: 

i) A polypeptide comprising the amino acid sequence shown as SEQ 
5 IDNO:305; 

ii) a polypeptide comprising any one of the amino acid sequences 
shown as SEQ ID NOs:170-304, 306-338, 456-560, 785-918; or 

iii) a biologically active fragment of any of said polypeptides. 

10 2. The polynucleotide of claim 1, wherein said polypeptide comprises a signal peptide. 

3. The polynucleotide of claim 1, wherein said polypeptide is a mature protein. 

4. The polynucleotide of claim 1, wherein said polynucleotide comprises any one of the nucleic 
15 acid sequences shown as SEQ ID NOs: 1-169, 339-455, or 561-784. 

5. The polynucleotide of claim 1, wherein said polynucleotide is operably linked to a promoter. 

6. An expression vector comprising the polynucleotide of claim 5. 

20 

7. A host cell recombinant for the polynucleotide of claim 1 . 

8. A non-human transgenic animal comprising the host cell of claim 7. 

25 9. A pharmaceutical composition comprising the polynucleotide of claim 1, and a 
pharmaceutical ly acceptable carrier. 

10. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of host cells comprising the polynucleotide of 
30 claim 5; and 

b) culturing said population of host cells under conditions conducive to the 
production of said polypeptide within said host cells. 

I I . The method of claim 10, further comprising purifying said polypeptide from said population of 
35 host cells. 
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12. An isolated polynucleotide, said polynucleotide comprising any one of the nucleic acid 
sequences shown as SEQ ID NOs: 1-169, 339-455, or 561-784. 

13. A biologically active polypeptide encoded by the polynucleotide of claim 12. 

5 

14. An isolated polypeptide or biologically active fragment thereof, said polypeptide comprising 
any one of the amino acid sequences shown as SEQ ID NOs: 170-338, 456-560, or 785-918. 

15. The polypeptide of claim 14, wherein said polypeptide comprises a signal peptide. 

10 

16. The polypeptide of claim 14, wherein said polypeptide is a mature protein. 



17. An antibody that specifically binds to the polypeptide of claim 14. 



15 18. A pharmaceutical composition comprising the polypeptide of claim 14, and a pharmaceutically 
acceptable carrier. 



19. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of cells comprising a polynucleotide encoding 
20 the polypeptide of claim 14, operably linked to a promoter; 

b) culturing said population of cells under conditions conducive to the 
production of said polypeptide within said cells; and 

c) purifying said polypeptide from said population of cells. 



25 20. A method of determining whether a GENSET gene is expressed within a mammal, said method 
comprising the steps of: 

a) providing a biological sample from said mammal 

b) contacting said biological sample with either of: 

i) a polynucleotide that hybridizes under stringent conditions to the 
30 polynucleotide of claim 1; or 

ii) a polypeptide that specifically binds to the polypeptide of claim 14; and 

c) detecting the presence or absence of hybridization between said polynucleotide 
and an RNA species within said sample, or the presence or absence of binding 
of said polypeptide to a protein within said sample; 

35 wherein a detection of said hybridization or of said binding indicates that said GENSET gene is 
expressed within said mammal. 
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21. The method of claim 21, wherein said polynucleotide is a primer, and wherein said 

hybridization is detected by detecting the presence of an amplification product comprising the 
sequence of said primer. 



5 22. The method of claim 21, wherein said polypeptide is an antibody. 



23. A method of determining whether a mammal has an elevated or reduced level of GENSET gene 
expression, said method comprising the steps of : 

a) providing a biological sample from said mammal; and 
1 0 b) comparing the amount of the polypeptide of claim 14, or of an RNA species 

encoding said polypeptide, within said biological sample with a level 
detected in or expected from a control sample; 
wherein an increased amount of said polypeptide or said RNA species within said biological 
sample compared to said level detected in or expected from said control sample indicates that said 
15 mammal has an elevated level of said GENSET gene expression, and wherein a decreased amount 
of said polypeptide or said RNA species within said biological sample compared to said level 
detected in or expected from said control sample indicates that said mammal has a reduced level of 
said GENSET gene expression. 

20 24. A method of identifying a candidate modulator of a GENSET polypeptide, said method 
comprising : 

a) contacting the polypeptide of claim 14 with a test compound; and 

b) determining whether said compound specifically binds to said 
polypeptide; 

25 wherein a detection that said compound specifically binds to said polypeptide indicates that 

said compound is a candidate modulator of said GENSET polypeptide. 



25. The method of claim 24, further comprising testing the biological activity of said GENSET 
polypeptide in the presence of said candidate modulator, wherein an alteration in the biological 

30 activity of said GENSET polypeptide in the presence of said compound in comparison to the 
activity in the absence of said compound indicates that the compound is a modulator of said 
GENSET polypeptide. 

26. A method for the production of a pharmaceutical composition comprising 

35 a) identifying a modulator of a GENSET polypeptide using the method of 

claim 24; and 

b) combining said modulator with a pharmaceutically acceptable carrier. 
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c 
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<110> GENSET SA 

<120> Full-length human cDNAs encoding potentially secreted proteins 

<130> 81. WO 

<150> US 60/197,873 

<151> 2000-04-18 

<150> US 60/224,009 

<151> 2000-08-07 

<150> US 60/260,328 

<151> 2001-01-08 

<150> US 60/224,006 

<151> 2000-08-04 

<160> 918 

<170> Patent. pm 



<210> 1 

<211> 615 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .471 

<220> 

<221> sig_peptide 

<222> 88. . 189 

<223> Von Heijne matrix 

score 13.1278588169685 

seq VLLLLSLLLTAXA/SS 

<220> 

<221> polyA_site 
<222> 600. . 615 

<220> 

<221> polyA_signal 
<222> 577 . .582 

<400> 1 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ctt gat acc acc cct 114 

Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 

tec tgt aac agt gcg aga cca ctt cat gec ttg cag gtg ctg ctg ctt 162 

Ser Cys Asn Ser Ala Arg Pro Leu His Ala Leu Gin Val Leu Leu Leu 

-25 " -20 -15 -10 

ctg tea ttg ctg ctg act get ytg get tec tec acc aaa gga caa ayt 210 

Leu Ser Leu Leu Leu Thr Ala Leu Ala Ser Ser Thr Lys Gly Gin Xaa 
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-5 










1 








5 








aag 


aga 


aac 


ttg 


gcg 


aaa 


ggc 


aaa 


gac 


gaa 


agt 


eta 


gac 


agt 


gac 


ttg 


258 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 


Lys 


Asp 


Glu 


Ser 


Leu 


Asp 


Ser 


Asp 


Leu 








10 










15 










20 










tat 


get 


gaa 


etc 


cgc 


tgc 


atg 


tgt 


ata 


aag 


aca 


acc 


tct 


gga 


att 


cat 


306 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 






25 










30 










35 












ccc 


aaa 


aac 


ate 


caa 


agt 


ttg 


gaa 


gtg 


ate 


ggg 


aaa 


gga 


acc 


cat 


tgc 


354 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 


Glu 


Val 


He 


Gly 


Lys 


Gly 


Thr 


His 


Cys 




40 










45 










50 










55 




aac 


caa 


gtc 


gaa 


gtg 


ata 


gee 


aca 


ctg 


aag 


gat 


ggg 


agg 


aaa 


ate 


tgc 


402 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 


Thr 


Leu 


Lys 


Asp 


Gly 


Arg 


Lys 


He 


Cys 












60 










65 










70 






ctg 


gac 


cca 


gat 


get 


ccc 


aga 


ate 


aag 


aaa 


att 


gta 


cag 


aaa 


aaa 


ttg 


450 


Leu 


Asp 


Pro 


Asp 


Ala 


Pro 


Arg 


He 


Lys 


Lys 


He 


Val 


Gin 


Lys 


Lys 


Leu 










75 










80 










85 








gca 


ggt 


gat 


gaa 


tct 


get 


gat 


taatttgttc tgtttctgcc aaacttcttt 


501 


Ala 


Gly 


Asp 


Glu 


Ser 


Ala 


Asp 























90 



aactcccagg aagggtagaa ttttgaaacc ttgattttct agttctcatt tattcaggat 561 
acctattctt actgtattaa aatttggata tgtgtttcaa aaaaaaaaaa aaaa 615 

<210> 2 

<211> 539 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. . 518 

<220> 

<221> sig_peptide 

<222> 39. . 92 

<223> Von Heijne matrix 

score 12.2866240999866 

seq LTLAVLFLTGSQA/RH 

<220> 

<221> polyA_site 
<222> 524. .539 

<220> 

<221> polyA_signal 
<222> 507. .512 

<400> 2 

agagactgeg agaaggaggt cccccacggc ccttcagg atg aaa get gcg gtg ctg 56 

Met Lys Ala Ala Val Leu 
-15 



acc 


ttg 


gee 


gtg 


etc 


ttc 


ctg 


acg 


ggg 


age 


cag 


get 


egg 


cat 


ttc 


tgg 


104 


Thr 


Leu 


Ala 


Val 


Leu 


Phe 


Leu 


Thr 


Gly 


Ser 


Gin 


Ala 


Arg 


His 


Phe 


Trp 








-10 










-5 










1 










cag 


caa 


gat 


gaa 


ccc 


ccc 


cag 


age 


ccc 


tgg 


gat 


cga 


gtg 


aag 


gac 


ctg 


152 


Gin 


Gin 


Asp 


Glu 


Pro 


Pro 


Gin 


Ser 


Pro 


Trp 


Asp 


Arg 


Val 


Lys 


Asp 


Leu 




5 










10 










15 










20 




gee 


act 


gtg 


tac 


gtg 


gat 


gtg 


etc 


aaa 


gac 


age 


ggc 


aga 


gac 


tat 


gtg 


200 


Ala 


Thr 


Val 


Tyr 


Val 


Asp 


Val 


Leu 


Lys 


Asp 


Ser 


Gly 


Arg 


Asp 


Tyr 


Val 












25 










30 










35 






tec 


cag 


ttt 


gaa 


ggc 


tec 


gee 


ttg 


gga 


aaa 


cag 


eta 


aac 


eta 


aag 


etc 


248 


Ser 


Gin 


Phe 


Glu 


Gly 


Ser 


Ala 


Leu 


Gly 


Lys 


Gin 


Leu 


Asn 


Leu 


Lys 


Leu 










40 










45 










50 








ctt 


gac 


aac 


tgg 


gac 


age 


gtg 


acc 


tec 


gee 


ttc 


age 


aat 


ctg 


cgc 


gaa 


296 
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3 



Leu 


Asp 


Asn 


Trp 


Asp 


Ser 


Val 


Thr 


Ser 


Ala 


Phe 


Ser 


Asn 


Leu 


Arg 


Glu 








55 










60 










65 










cag 


etc 


ggc 


cct 


gtg 


ace 


cag 


gag 


tty 


tgg 


gat 


aac 


ctg 


gaa 


aag 


gag 


344 


Gin 


Leu 


Gly 


Pro 


Val 


Thr 


Gin 


Glu 


Phe 


Trp 


Asp 


Asn 


Leu 


Glu 


Lys 


Glu 






70 










75 










80 












aca 


gag 


ggc 


ctg 


agg 


cag 


gag 


atg 


age 


aag 


gat 


ctg 


gag 


gag 


gtg 


aag 


392 


Thr 


Glu 


Gly 


Leu 


Arg 


Gin 


Glu 


Met 


Ser 


Lys 


Asp 


Leu 


Glu 


Glu 


Val 


Lys 




85 










90 










95 










100 




gcc 


aag 


gtg 


cag 


ccc 


tac 


ctg 


gac 


gac 


ttc 


cag 


aag 


aag 


tgg 


cag 


gag 


440 


Ala 


Lys 


Val 


Gin 


Pro 


Tyr 


Leu 


Asp 


Asp 


Phe 


Gin 


Lys 


Lys 


Trp 


Gin 


Glu 












105 










110 










115 






gag 


atg 


gag 


etc 


tac 


cgc 


cag 


aag 


gca 


get 


ttc 


tta 


act 


ate 


eta 


aca 


488 


Glu 


Met 


Glu 


Leu 


Tyr 


Arg 


Gin 


Lys 


Ala 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Thr 










120 










125 










130 








age 


ctt 


gga 


cca 


aat 


gga 


aat 


aaa 


get 


ttt 


tgatgaaaaa aaaaaaaaaa a 


539 


Ser 


Leu 


Gly 


Pro 


Asn 


Gly 


Asn 


Lys 


Ala 


Phe 

















135 140 



<210> 3 

<211> 796 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134 . .373 



<220> 

<221> sig_peptide 

<222> 134 . .211 

<223> Von Heijne matrix 

score 12.0462155590967 
seq LLLALLLPTQIYS/SE 



<400> 3 

atttgtggct ttcctggtat ataaggtctc gccggctcgc cgcgctcccc accttgcctg 
cgcccgcccg gagecagegg ttctccaagc acccagcatc ctgctagacg cgccgcgcac 
egaeggaggg gac atg ggc aga gca atg gtg gcc agg etc ggg ctg ggg 
Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 



-25 



-20 



-15 



ctg ctg ctg ctg gca ctg etc eta ccc acg cag att tat tec agt gaa 
Leu Leu Leu Leu Ala Leu Leu Leu Pro Thr Gin He Tyr Ser Ser Glu 



-10 



-5 



act aca act gga act tea agt aac tec tec cag agt act tec aac tct 
Thr Thr Thr Gly Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser 



10 



15 



ggg ttg gcc cca aat cca act aat gcc ace ace aag gtg get ggt ggt 
Gly Leu Ala Pro Asn Pro Thr Asn Ala Thr Thr Lys Val Ala Gly Gly 

20 25 30 

gcc ctg cag tea aca gcc agt etc ttc gtg gtc tea etc tct ctt ctg 
Ala Leu Gin Ser Thr Ala Ser Leu Phe Val Val Ser Leu Ser Leu Leu 
35 40 45 50 

cay eye tac tct taagagactc aggecaagaa aegtyttcta aatttcccca 
His Xaa Tyr Ser 

tcttctaaac ccaatccaaa tggcgtctgg aagtccaatg tggcaaggaa aaacaggtct 
tcatcgaatc tactaattcc acacctttta ttgacacaga aaatgttgag aatcccaaat 
ttgattgatt tgaagaacat gtgagaggtt tgactagatg atggatgcca atattaaatc 
tgctggagtt tcatgtacaa gatgaaggag aggcaacatc caaaatagtt aagacatgat 
ttccttgaat gtggcttgag aaatatggac acttaatact accttgaaaa taagaataga 
aataaaggat gggattgtgg aatggagatt cagttttcat ttggttcatt aattctataa 
ggccataaaa caggtaatat aaa 



60 
120 
169 



217 



265 



313 



361 



413 

473 
533 
593 
653 
713 
773 
796 
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<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .377 

<220> 

<221> sig_peptide 

<222> 75. . 155 

<223> Von Heijne matrix 

score 11.8486806626293 

LFLGLLLLPLVVA/ FA 

<220> 

<221> polyA_site 
<222> 513. . 528 

<220> 

<221> polyA_signal 
<222> 498. .503 

<400> 4 

attggccaca gagacccagc ccgagtttcc catcgcactg agcactgaga tcctgctgga 60 

agctctgccg cage atg age tec gca gee ggg ttc tgc gec tea cgc ccc 110 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 

ggg ctg ctg ttc ctg ggg ttg ctg etc ctg cca ctt gtg gtc gec ttc 158 

Gly Leu Leu Phe Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe 

-15 -10 -5 1 

gec age get gaa get gaa gaa gat ggg gac ctg cag tgc ctg tgt gtg 206 

Ala Ser Ala Glu Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val 

5 10 15 



aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


ccc 


agg 


cac 


ate 


acc 


age 


ctg 


gag 


gtg 


254 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


ccc 


cac 


tgc 


ccc 


act 


gee 


caa 


ctg 


ata 


gee 


acg 


ctg 


302 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 

45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gac 


ctg 


caa 


gee 


ccg 


ctg 


tac 


aag 


350 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag < 


etgcctacgt 




397 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 457 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcaa gttgcaaaaa 517 
aaaaaaaaaa a 528 

<210> 5 

<211> 522 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76. .378 

<220> 

<221> sig_peptide 

<222> 76. . 156 

<223> Von Heijne matrix 



WO 02/083898 PCT/IB01/00914 

5 

score 11.8486806626293 
LFLGLLLLPLVVA/FA 

<220> 

<221> polyA_site 
<222> 508 . . 522 

<220> 

<221> polyA_signal 
<222> 499. .504 

<400> 5 

aattggccac agagacccag cccgagtttc ccatcgcact gagcactgag atcctgctgg 60 
aagctctgcc gcagc atg age tec gca gee ggg ttc tgc gee tea cgc ccc 111 
Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 
ggg ctg ctg ttc ctg ggg ttg ctg etc ctg cca ctt gtg gtc gee ttc 159 
Gly Leu Leu Phe Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe 
-15 -10 -5 1 

gee age get gaa get gaa gaa gat ggg gac ctg cag tgc ctg tgt gtg 207 
Ala Ser Ala Glu Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val 
5 10 15 



aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


ccc 


agg 


cac 


ate 


acc 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


ccc 


cac 


tgc 


ccc 


act 


gee 


caa 


ctg 


ata 


gee 


acg 


ctg 


303 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gac 


ctg 


caa 


gee 


ccg 


ctg 


tac 


aag 


351 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag > 


ctgcctacgt 




398 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 458 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcta aaaaaaaaaa 518 
aaaa 522 

<210> 6 

<211> 652 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .332 

<220> 

<221> sig_peptide 

<222> 75 . . 128 

<223> Von Heijne matrix 

score 11.7816699306029 

seq VLLLLLLVEQAAA/LG 

<220> 

<221> polyA__site 
<222> 637. . 652 

<220> 

<221> polyA__signal 
<222> 616. . 621 



<400> 6 



WO 02/083898 



PCT/IB01/00914 



6 



gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 



cga ctg 
Arg Leu 



aag ate 
Lys lie 
30 

tea gat ggt gtt 
Ser Asp Gly Val 
45 

gag act ctg aag 
Glu Thr Leu Lys 
60 

tcatattctt 
catgecatta 
caccagtggt 
tattctagtc 
aaatatggga 



15 

caa gtg 
Gin Val 



20 
get 
Ala 



tac 
Tyr 

cat 
His 



acg 
Thr 



cga aag gca 
Arg Lys Ala 

35 

ggc ctg age acc 



ata 
He 



acc age 
Thr Ser 



agg aac cag 



25 

tat gag aaa 
Tyr Glu Lys 
40 

gag act tac 



Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr 
50 55 
gag aaa cca ccm cag tagecttaga atagatgegg 
Glu Lys Pro Pro Gin 
65 

ctttggcttc tggttcttcc agccctcatg tttggcatca catatgectg 
acaccagctg gccctacccc tataatgatc ctgtgtccta aattaatata 
tcctcctccc tgttaaagac taatgctcag atgctgttta eggatattta 
tcactctctt gtcccaccct tcttctcttc cccattccca actccagcta 
agggagaacc cccaataaaa ctgccatgga ctggaaaaaa aaaaaaaaaa 



60 



atctccagcc 


caag 


atg 


att 


cca 


gca 


gtg 


gtc 


ttg 


etc 


tta 


etc 


ctt 


ttg 


110 






Met 


He 


Pro 


Ala 


Val 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 














-15 










-10 










gtt gaa caa 


gca 


gcg 


gee 


ctg 


gga 


gag 


cct 


cag 


etc 


tgc 


tat 


ate 


ctg 


158 


Val Glu Gin 


Ala 


Ala 


Ala 


Leu 


Gly 


Glu 


Pro 


Gin 


Leu 


Cys 


Tyr 


He 


Leu 




-5 








1 








5 










10 




gat gee ate 


ctg 


ttt 


ctg 


tat 


gga 


att 


gtc 


etc 


acc 


etc 


etc 


tac 


tgt 


206 


Asp Ala He 


Leu 


Phe 


Leu 


Tyr 


Gly 


He 


Val 


Leu 


Thr 


Leu 


Leu 


Tyr 


Cys 





254 



302 



352 



412 
472 
532 
592 
652 



<210> 7 

<211> 743 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .387 



<220> 

<221> sig_peptide 

<222> 112. . 183 

<223> Von Heijne matrix 

score 11.7298925418815 
seq FVLGLGLTPPTLA/QD 



<220> 

<221> polyA_site 
<222> 728. .743 



<220> 

<221> polyA_signal 
<222> 705. .710 



<400> 7 

acacaactgg aacccatctc caggaacaaa cagctggaac ccatctcccg ttgaagggaa 60 
actgecagat ttttgtaaga ttcttcctcc tgggagcctg tgttggaaga g atg gtg 117 

Met Val 

atg ggc ctg ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg 165 
Met Gly Leu Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu 

-20 -15 -10 

acc cca ccg acc ctg get cag gat aac tec agg tac aca cac ttc ctg 213 
Thr Pro Pro Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu 

-5 15 10 

acc cag cac tat gat gee aaa cca cag ggc egg gat gac aga tac tgt 261 
Thr Gin His Tyr Asp Ala Lys Pro Gin Gly Arg Asp Asp Arg Tyr Cys 

15 20 25 

gaa age ate atg agg aga egg ggc ctg acc tea ccc tgc aaa gac ate 309 



WO 02/083898 



PCT/IB01/00914 



Glu Ser lie Met Arg Arg Arg Gly Leu Thr Ser 

30 35 
aac aca ttt att cat ggc aac aag cgc asa tea 
Asn Thr Phe lie His Gly Asn Lys Arg Xaa Ser 

45 50 
aca aga atg gaa acc etc aca gag aaa acc taa 
Thr Arg Met Glu Thr Leu Thr Glu Lys Thr 

60 65 
ttccaggtca ccacttgcaa gctacatgga ggttccccct 
gccacagcgg ggttcagaaa cgttgttgtt gcttgtgaaa 
gatcagtcaa ttttcegteg tccgtaacca gcgggcccct 
gtccttgcct tccatttccc ctctgcaccc agaacagtgg 
ggeccaaaga aagagctacc tggacctttt gttttctgtt 
aaaaatgtct tgatatcagc aaaaaaaaaa aaaaaa 



Pro Cys Lys Asp lie 
4 0 

agg cca tct gtg aaa 357 
Arg Pro Ser Val Lys 
55 

gaataag caagtcttct 407 



ggcctccatg ccagtaccga 467 
atggcttacc tgtccacttg 527 
ggtcaagtgc tggctctget 587 
tggcaacatt cattgecaag 647 
tgacaacatg tttaataaat 707 

743 



<210> 8 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .371 



<220> 

<221> sig_peptide 

<222> 39. .86 

<223> Von Heijne matrix 

score 11.56948973398 
seq ILLSVALLAFSSA/QD 



<220> 

<221> polyA_site 
<222> 487. .502 



<220> 

<221> polyA_signal 
<222> 467. .472 



<400> 8 

agcataaagt tgggagtgac accagagcct tetgeaag atg ctt ctg att ctg ctg 56 

Met Leu Leu lie Leu Leu 
-15 

tea gtg gee ctg ctg gee ttc age tea get cag gac tta gat gaa gat 104 
Ser Val Ala Leu Leu Ala Phe Ser Ser Ala Gin Asp Leu Asp Glu Asp 
-10 -5 15 

gtc age caa gaa gac gtt ccc ttg gta ata tea gat gga gga gac tct 152 
Val Ser Gin Glu Asp Val Pro Leu Val lie Ser Asp Gly Gly Asp Ser 

10 15 20 

gag cag ttc ata gat gag gag cgt cag gga cca cct ttg gga gga cag 200 
Glu Gin Phe lie Asp Glu Glu Arg Gin Gly Pro Pro Leu Gly Gly Gin 

25 30 35 

caa tct caa ccc tct get ggt gat ggg aac cag aat gat ggc cct cag 248 
Gin Ser Gin Pro Ser Ala Gly Asp Gly Asn Gin Asn Asp Gly Pro Gin 

40 45 50 

cag gga cca ccc caa caa gga ggc cag cag caa caa ggt cca cca cct 296 
Gin Gly Pro Pro Gin Gin Gly Gly Gin Gin Gin Gin Gly Pro Pro Pro 
55 60 65 70 

cct cag gga aag cca caa gga cca cct ccc caa ggg ggc cgc cca caa 34 4 

Pro Gin Gly Lys Pro Gin Gly Pro Pro Pro Gin Gly Gly Arg Pro Gin 

75 80 85 

gga cct cca cag ggg cag tct cct cag taatctagga ttcaatgaca 391 
Gly Pro Pro Gin Gly Gin Ser Pro Gin 
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90 95 
ggaagtgaat aagaagatga cagtgtttca aatgccttga aacataatgt gatcatgctc 451 
taacttcaat ataccaataa aataatcagc ttgctaaaaa aaaaaaaaaa a 502 



<210> 9 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .353 



<220> 

<221> sig_peptide 

<222> 36. .83 

<223> Von Heijne matrix 

score 11.56948973398 
seq ILLSVALLAFSSA/QD 



<220> 

<221> polyA_site 
<222> 386. . 401 



<400> 9 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 
-15 

tea gtg gec ctg ctg gec ttc age tea get cag gac tta gat gaa gat 101 

Ser Val Ala Leu Leu Ala Phe Ser Ser Ala Gin Asp Leu Asp Glu Asp 
-10 -5 15 

gtc age caa gaa gac gtt ccc ttg gta ata tea gat gga gga gac tct 14 9 

Val Ser Gin Glu Asp Val Pro Leu Val lie Ser Asp Gly Gly Asp Ser 

10 15 20 

gag cag ttc ata gat gag gag cgt cag gga cca cct ttg gga gga cag 197 

Glu Gin Phe lie Asp Glu Glu Arg Gin Gly Pro Pro Leu Gly Gly Gin 

25 30 35 

caa tct caa ccc tct get ggt gat ggg aac cag gat gat ggc cct cag 245 

Gin Ser Gin Pro Ser Ala Gly Asp Gly Asn Gin Asp Asp Gly Pro Gin 

40 45 ~ 50 

cag gga cca ccc caa caa gga ggc cag cag caa caa ggt cca cca cct 293 

Gin Gly Pro Pro Gin Gin Gly Gly Gin Gin Gin Gin Gly Pro Pro Pro 
55 60 65 70 

cct cag gga aag cca caa gga cca ccc caa cag gga ggc cag tct tgt 341 

Pro Gin Gly Lys Pro Gin Gly Pro Pro Gin Gin Gly Gly Gin Ser Cys 

75 80 85 

tgc tgt gac aag taacagcett ttttttaaat tgttttttat acaaaaaaaa 393 
Cys Cys Asp Lys 

90 

aaaaaaaa 401 



<210> 10 

<211> 961 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. .467 



<220> 

<221> sig_peptide 
<222> 45. . 116 
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<223> Von Heijne matrix 

score 10.983328612314 
seq LAGLCCLVPVSLA/ED 



<220> 

<221> polyA_site 
<222> 946. . 961 

<220> 

<221> polyA_signal 
<222> 927 . . 932 



<400> 10 

ctcagcttca ggcaccacca ctgacctggg acagtgaatc gaca atg ccg tct tct 56 

Met Pro Ser Ser 

gtc teg tgg ggc ate etc ctg ctg gca ggc ctg tgc tgc ctg gtc cct 104 

Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys Cys Leu Val Pro 

-20 -15 -10 -5 

gtc tec ctg get gag gat ccc cag gga gat get gec cag aag aca gat 152 

Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala Gin Lys Thr Asp 

15 10 
aca tec cac cat gat cag gat cac cca acc ttc aac aag ate acc ccc 200 
Thr Ser His His Asp Gin Asp His Pro Thr Phe Asn Lys lie Thr Pro 

15 20 25 

aac ctg get gag ttc gec ttc age eta tac cgc cag ctg gca cac cag 248 
Asn Leu Ala Glu Phe Ala Phe Ser Leu Tyr Arg Gin Leu Ala His Gin 

30 35 40 

tec aac age acc aat ate ttc ttc tec cca gtg age ate get aca gec 296 
Ser Asn Ser Thr Asn lie Phe Phe Ser Pro Val Ser lie Ala Thr Ala 
45 50 55 60 

ttt gca atg etc tec ctg ggg acc aag get gac act cac gat gaa ate 34 4 

Phe Ala Met Leu Ser Leu Gly Thr Lys Ala Asp Thr His Asp Glu lie 

65 70 75 

ctg gag ggc ctg aat ttc aac etc acg gag att ccg gag get cag ate 392 
Leu Glu Gly Leu Asn Phe Asn Leu Thr Glu lie Pro Glu Ala Gin lie 

80 85 90 

cat gaa ggc ttc cag gaa etc etc cgt acc etc aac cag cca gac age 440 
His Glu Gly Phe Gin Glu Leu Leu Arg Thr Leu Asn Gin Pro Asp Ser 

95 100 105 

cag etc cag ctg acc acc ggc aag aat taatgettte ttaactaagt 487 
Gin Leu Gin Leu Thr Thr Gly Lys Asn 

110 115 
tactttgagt attaacaagt agcttatact gaggtgtgct tcaagggagc attatagggc 547 
agaaaacaaa ttaaaatgaa atgttgttgt tatagecatg cattgaccat taaatgeatt 607 
ttaatatctc ggcaaaatga gatttgaaac tcattagttt aactcttaaa acatagtgtt 667 
gttaaacagt attttttggg ggcgggtgga tagcttttts ttctatcttg tacagaaaac 727 
agtaatggaa tcactcatgt tcagaacctt gtaaatatgt ctgtaatatt aatgcttccc 787 
tttattaaca caaatataca ttcaaaaagg gagattttat ttagacacac attttctttt 847 
caccctaatc tatttagatt catacttatt gagaegggaa aaactcgctg taaaataatg 907 
ccaacctaga taatgetata ataaattatt ttgtacacaa aaaaaaaaaa aaaa 961 



<210> 11 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .462 



<220> 

<221> sig_peptide 
<222> 91 . . 180 
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<223> Von Heijne matrix 

score 9.99993961154359 
seq LLFGFTLVSGTGA/EK 



<220> 

<221> polyA_site 
<222> 628. . 643 



<220> 

<221> polyA_signal 
<222> 607 . . 612 



<400> 11 

acccctaccc cacgccccct cccgcgcgcg cggttaaatc cccgcacctg agcatcggct 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta ggc ccg 114 

Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 
eta gee gee gee etc etc etc age ctg ctg ctg ttc ggc ttc ace eta 162 
Leu Ala Ala Ala Leu Leu Leu Ser Leu Leu Leu Phe Gly Phe Thr Leu 

-20 -15 -10 

gtc tea ggc aca gga gca gag aag act ggc gtg tgc ccc gag etc cag 210 
Val Ser Gly Thr Gly Ala Glu Lys Thr Gly Val Cys Pro Glu Leu Gin 

-5 15 10 

get gac cag aac tgc acg caa gag tgc gtc teg gac age gaa tgc gee 258 
Ala Asp Gin Asn Cys Thr Gin Glu Cys Val Ser Asp Ser Glu Cys Ala 

15 20 25 

gac aac etc aag tgc tgc age gcg ggc tgt gee acc ttc tgc tct ctg 306 
Asp Asn Leu Lys Cys Cys Ser Ala Gly Cys Ala Thr Phe Cys Ser Leu 

30 "* ~ 35 4 0 

ccc aat gat aag gag ggt tec tgc ccc cag gtg aac att aac ttt ccc 354 
Pro Asn Asp Lys Glu Gly Ser Cys Pro Gin Val Asn lie Asn Phe Pro 

45 50 55 

cag etc ggc etc tgt egg gac cag tgc cag gtg gac age cag tgt cct 402 
Gin Leu Gly Leu Cys Arg Asp Gin Cys Gin Val Asp Ser Gin Cys Pro 

60 65 70 

ggc cag atg aaa tgc tgc cgc aat ggc tgt ggg aag gtg tec tgt gtc 450 
Gly Gin Met Lys Cys Cys Arg Asn Gly Cys Gly Lys Val Ser Cys Val 
75 80 85 90 

act ccc aat ttc tgagctccag ccaccaccag gctgagcagt gaggagagaa 502 
Thr Pro Asn Phe 

agtttctgcc tggccctgca tctggttcca gcccacctgc cctccccttt ttegggaetc 562 
tgtattccct cttgggctga ccacagcttc tccctttccc aaccaataaa gtaaccactt 622 
tcagcaaaaa aaaaaaaaaa a 643 



<210> 12 

<211> 577 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 28 . .399 



<220> 

<221> sig_peptide 

<222> 28 . . 117 

<223> Von Heijne matrix 

score 9.99993961154359 
seq LLFGFTLVSGTGA/EK 



<220> 

<221> polyA_site 
<222> 562. . 577 
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<220> 

<221> polyA_signal 
<222> 540. .545 



<400> 12 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta ggc ccg eta 54 

Met Pro Ala Cys Arg Leu Gly Pro Leu 

-30 -25 



gee 


gee 


gee 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


ace 


eta 


gtc 


102 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Phe 


Gly 


Phe 


Thr 


Leu 


Val 






-20 










-15 










-10 












tea 


ggc 


aca 


gga 


gca 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gee 


gac 


198 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


Cys 


Val 


Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 










15 










20 










25 








aac 


ate 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gee 


ace 


ttc 


tgc 


tct 


ctg 


ccc 


246 


Asn 


He 


Lys 


Cys 


Cys 


Ser 


Ala 


Gly 


Cys 


Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


Pro 








30 










35 










40 










aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


cag 


294 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


Pro 


Gin 


Val 


Asn 


He 


Asn 


Phe 


Pro 


Gin 






45 










50 










55 












etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


ggc 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


Gly 


Cys 


Gly 


Lys 


Val 


Ser 


Cys 


Val 


Thr 





80 85 90 



ccc aat ttc tgagctccag ccacaggctg agcagtgagg agagaaagtt 439 
Pro Asn Phe 

tctgcctggc cctgcatctg gttccagccc acctgccctc ccctttttcg ggactctgta 499 

ttccctcttg ggctgaccac agcttctccc tttcccaacc aataaagtaa ccactttcag 559 

ctaaaaaaaa aaaaaaaa 577 



<210> 13 

<211> 785 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 10. .216 



<220> 

<221> sig_peptide 

<222> 10. . 60 

<223> Von Heijne matrix 

score 9.5148977432302 
FVFLLFVISLAAA/AH 



<400> 13 

aacccactt atg ttt cct ggc ttt gtt ttt ttg etc ttt gta ata agt ctt 51 
Met Phe Pro Gly Phe Val Phe Leu Leu Phe Val He Ser Leu 
-15 -10 -5 

get get get get cat ctt tgg gtc ctg gee gee ttt atg ggc cgt ate 99 
Ala Ala Ala Ala His Leu Trp Val Leu Ala Ala Phe Met Gly Arg lie 

15 10 
ace gtg aag gtc tgc age ttc act cct gag gee age aag acc gtg age 147 
Thr Val Lys Val Cys Ser Phe Thr Pro Glu Ala Ser Lys Thr Val Ser 

15 20 25 

cca cca gaa gga gcg aac aac teg aga cgc act get ttt aag age tgt 195 



WO 02/083898 PCT/IB01/00914 



12 

Pro Pro Glu Gly Ala Asn Asn Ser Arg Arg Thr Ala Phe Lys Ser Cys 
30 35 40 45 

aac act cac cac aaa ggt ctg tagcatcact cctgaagtca gcgggaccac 24 6 
Asn Thr His His Lys Gly Leu 
50 

gaacctacca gaaagaagaa actccataca gatctgaaaa tctgaaggaa caaactccgg 306 

acacaccatc tttaagaact gtaacactca ctgcgagggt ccacggcttc atttttgaag 366 

tcagccagac caagaaccca ccaattccgc acacacttct gctgtttcgt tccattctca 426 

aggaaaaact gtgatgggtc catttttaat cacatgcaaa ttcatgggca aatctttgct 486 

atggttatgt actttgattt agtagagttg tatcacatgt tcactcatgt gcccacaata 54 6 

gcagggtata ccaataggct gcttcaaaac aatcatgaaa agtaggaggt ggtgtttcca 606 

aaggtagtgg tcttgccaga aaaaaatgga aatactgtga agataaaaat aaacatctat 666 

taacacaaaa atagaaaaat aatataggta cacatacaca tacttttcta catatttttt 726 

gcagtcatat cagaggaaca atacagagga acaaaagtgt aaatcaaaaa aaaaaaaaa 785 



<210> 14 

<211> 1154 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88 . . 318 

<220> 

<221> sig_peptide 

<222> 88 . . 186 

<223> Von Heijne matrix 

score 9.33181480765342 

seq MVLLSFALTLCSA/FW 

<220> 

<221> polyA_site 
<222> 1139. .1154 

<400> 14 

ctgtgggtgc ccaggaaggg actacacctc ttcgcagtgt tttatacctt tggtaatatc 60 
gcatcaattg ggagtaccat cttcctc atg gga cca gtg aaa cag ctg aag cga 114 

Met Gly Pro Val Lys Gin Leu Lys Arg 
-30 -25 



atg 


ttt 


gag 


cct 


act 


cgt 


ttg 


att 


gca 


act 


ate 


atg 


gtg 


ctg 


ttg 


agt 


162 


Met 


Phe 


Glu 


Pro 


Thr 


Arg 


Leu 


He 


Ala 


Thr 


He 


Met 


Val 


Leu 


Leu 


Ser 












-20 










-15 










-10 






ttt 


gca 


ctt 


acc 


ctg 


tgt 


tct 


gec 


ttt 


tgg 


tgg 


cat 


aac 


awg 


gga 


ctt 


210 


Phe 


Ala 


Leu 


Thr 
-5 


Leu 


Cys 


Ser 


Ala 


Phe 
1 


Trp 


Trp 


His 


Asn 
5 


Xaa 


Gly 


Leu 




gca 


ctt 


ate 


ttc 


tgc 


att 


ttg 


cag 


tct 


ttg 


gca 


ttg 


acg 


tgg 


tac 


age 


258 


Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


Gin 


Ser 


Leu 


Ala 


Leu 


Thr 


Trp 


Tyr 


Ser 






10 










15 










20 












ctt 


tec 


ttc 


ata 


cca 


ttt 


gca 


agg 


gat 


get 


gtg 


aag 


aag 


tgt 


ttt 


gee 


306 


Leu 


Ser 


Phe 


He 


Pro 


Phe 


Ala 


Arg 


Asp 


Ala 


Val 


Lys 


Lys 


Cys 


Phe 


Ala 




25 










30 










35 










40 





gtg tgt ctt gca taattcatgg ccagttttat gaagctttgg aaggcactat 358 
Val Cys Leu Ala 

ggacagaagc tggtggacag ttttgtaact atcttcgaaa cctctgtctt acagacatgt 418 

gecttttate ttgcagcaat gtgttgcttg tgattcgaac atttgagggt tacttttgga 478 

agcaacaata cattctcgaa cctgaatgtc agtagcacag gatgagaagt gggttctgta 538 

tcttgtggag tggaatcttc ctcatgtacc tgtttcctct ctggatgttg tcccactgaa 598 

ttcccatgaa tacaaaccta ttcagcaaca gcacataagc cttgggtgca agtgattccc 658 

aggtggcaaa aggcagcccc atcagagatc aegggagcaa cagtaaggga cagagttttg 718 

gggtccactt gtccctcagc atggaagcca tcaccgtggt ectgeataga gtgagtctac 778 

ttctactctg gcatctgaga acaagtgact ctgetttaga caagcccctg gagagectgg 838 

ccatggagtg aggtagaaaa gaagcacttt ttggtggtat atgctgtttc tgaatttgaa 898 



WO 02/083898 

tctaagcatt cttggtgttc tgattctgcg 
ggtgcagact ctttttctct tgggttttat 
tctttcttta tgacctcctg gcactagatg 
tgctgctgtt ttttttggac gagagcccat 
aaaaaaaaaa aaaaaa 
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13 

cctgactgtg gaatgtattt tctgacatta 958 
ttcccatccc tccctccctc tccctgcata 1018 
gggaaagaac agagcagcag cagcagaaaa 1078 
aggagttgaa aaatcctgct gctctcgtcc 1138 

1154 



<210> 15 

<211> 766 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84 . .521 



<220> 

<221> sig_peptide 

<222> 84 . . 161 

<223> Von Heijne matrix 

score 8.99656184291183 
seq GLLWAFCAPGARA/EE 



<220> 

<221> polyA_site 
<222> 751. .766 



<220> 

<221> polyA_signal 
<222> 728. .733 



<400> 15 

gaggaagagc gttttgggga cgggggctgg tgaggctcac gttggagggc ttcgcgtctg 60 
cttcggagac cgtaaggata ttg atg acc atg aga tec ctg etc aga ace ccc 113 

Met Thr Met Arg Ser Leu Leu Arg Thr Pro 
-25 -20 



ttc 


ctg 


tgt 


ggc 


ctg 


etc 


tgg 


gec 


ttt 


tgt 


gee 


cca 


ggc 


gee 


agg 


get 


161 


Phe 


Leu 


Cys 


Gly 


Leu 


Leu 


Trp 


Ala 


Phe 


Cys 


Ala 


Pro 


Gly 


Ala 


Arg 


Ala 






-15 










-10 










-5 












gag 


gag 


cct 


gca 


gee 


age 


ttc 


tec 


caa 


ccc 


ggc 


age 


atg 


ggc 


ctg 


gat 


209 


Glu 


Glu 


Pro 


Ala 


Ala 


Ser 


Phe 


Ser 


Gin 


Pro 


Gly 


Ser 


Met 


Gly 


Leu 


Asp 




1 








5 










10 










15 






aag 


aac 


aca 


gtg 


cac 


gac 


caa 


gag 


cat 


ate 


atg 


gag 


cat 


eta 


gaa 


ggt 


257 


Lys 


Asn 


Thr 


Val 


His 


Asp 


Gin 


Glu 


His 


He 


Met 


Glu 


His 


Leu 


Glu 


Gly 








20 










25 










30 








gtc 


ate 


aac 


aaa 


cca 


gag 


gcg 


gag 


atg 


teg 


cca 


caa 


gaa 


ttg 


cag 


etc 


305 


Val 


lie 


Asn 


Lys 


Pro 


Glu 


Ala 


Glu 


Met 


Ser 


Pro 


Gin 


Glu 


Leu 


Gin 


Leu 








35 










40 










45 










cat 


tac 


ttc 


aaa 


atg 


cat 


gat 


tat 


gat 


ggc 


aat 


aat 


ttg 


ctt 


gat 


ggc 


353 


His 


Tyr 


Phe 


Lys 


Met 


His 


Asp 


Tyr 


Asp 


Gly 


Asn 


Asn 


Leu 


Leu 


Asp 


Gly 






50 










55 










60 












tta 


gaa 


etc 


tec 


aca 


gec 


ate 


act 


cat 


gtc 


cat 


aag 


gag 


gaa 


ggg 


agt 


401 


Leu 


Glu 


Leu 


Ser 


Thr 


Ala 


He 


Thr 


His 


Val 


His 


Lys 


Glu 


Glu 


Gly 


Ser 




65 










70 










75 










80 




gaa 


cag 


gca 


cca 


eta 


atg 


agt 


gaa 


gat 


gaa 


ctg 


att 


aac 


ata 


ata 


gat 


449 


Glu 


Gin 


Ala 


Pro 


Leu 


Met 


Ser 


Glu 


Asp 


Glu 


Leu 


He 


Asn 


He 


He 


Asp 












85 










90 










95 






ggt 


gtt 


ttg 


aga 


gat 


gat 


gac 


aag 


aac 


aat 


gat 


gga 


tac 


att 


gac 


tat 


497 


Gly 


Val 


Leu 


Arg 


Asp 


Asp 


Asp 


Lys 


Asn 


Asn 


Asp 


Gly 


Tyr 


He 


Asp 


Tyr 










100 










105 










110 








get 


gaa 


ttt 


gca 


aaa 


tea 


ctg 


cag 


tagatgttat ttggccatct cctggttata 


551 


Ala 


Glu 


Phe 


Ala 


Lys 


Ser 


Leu 


Gin 





















115 120 



tacaaatgtg accegtgata atgtgattga acactttagt aatgcgaaat aactcatttc 611 
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caactactgc tgcagcattt tggtaaaaac ctgtagcgat tcgttacact ggggtgagaa 671 

gagataagag aaatgaaaga gaagagaaat gggacatcta atagtcccta agtgstatta 731 
aataccttat tggacaagca aaaaaaaaaa aaaaa 766 



<210> 16 

<211> 808 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .340 



<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.64329745298384 
AVLLLILFAI VFG/LL 



<220> 

<221> polyA_site 
<222> 754 . . 808 



<220> 

<221> polyA_signal 
<222> 728. .733 



<400> 16 
cttttcctct 
cttcagctca 
gacccagaac 



gag aca ate 
Glu Thr He 

ggc ctg gca 
Gly Leu Ala 

tta ttt gec 
Leu Phe Ala 
-5 

tgt gaa gcg 
Cys Glu Ala 
10 

gcaggggagg 
ggctcagggg 
tgtatctggc 
tggctccttg 
aggatggctc 
cacaggccca 
aaaaattcct 
aaaaaaaaaa 



caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 
gccctccaca aagtgtgagc ctgaaggacc accctgaatt geccttgtag 
agctaccagc agaatcagat tctc atg gac caa ctg gta ttc aaa 

Met Asp Gin Leu Val Phe Lys 
-45 -40 
tgg aat gat gcg ttc tgg cag aac ccc tgg gac cag ggg 
Trp Asn Asp Ala Phe Trp Gin Asn Pro Trp Asp Gin Gly 

-35 -30 -25 

gtg att ate tta ttc ate ace get gtc ctg ctt etc ate 
Val He He Leu Phe He Thr Ala Val Leu Leu Leu He 
-20 -15 -10 

ate gtg ttt ggt tta etc act tec aca gaa aac act cag 
He Val Phe Gly Leu Leu Thr Ser Thr Glu Asn Thr Gin 

1 5 
ggt gaa gag gag tgacctgact tgctggggac tgagatggca 
Gly Glu Glu Glu 
15 

cgagctgacc tgcccccatt ccagtggtgg gccccttcgc ggttccctct 
ccaagccctg gtgtcttcct ttcccaccag gaaaaagtct agtaaaatac 
ttagggttgg tcagactagt aagatgggga ggctggtctg agaccaattc 
accctattgt ttttagggtt ccccgaccag aaccctaaaa gcacatggag 
cactgcctca ggtggaagga gctatggcta acaaggttct ctaacaggct 
gecagcaatt tcacaaatcc ttgacagaga aagacacaac caaatgaaat 
tttcaaatct gctaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaramaaaa 
aaaaaaaa 



60 
120 
175 



223 



271 



319 



370 



430 
490 
550 
610 
670 
730 
790 
808 



<210> 17 

<211> 772 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 155. .340 



WO 02/083898 



<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.64329745298384 
AVLLL I L FAI V FG / LL 

<220> 

<221> polyA__site 
<222> 751. .772 

<220> 

<221> polyA__signal 
<222> 728 . .733 



PCT/IB01/00914 



15 



<400> 17 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 
gacccagaac agcaaccagc agaatcagat tctc atg gac caa ttg gta ttc aaa 

Met Asp Gin Leu Val Phe Lys 
-45 -40 
gag aca ate tgg aat gat gcg ttc tgg cag aac ccc tgg gac cag ggg 
Glu Thr lie Trp Asn Asp Ala Phe Trp Gin Asn Pro Trp Asp Gin Gly 

-35 -30 -25 

ggc ctg gca gtg att ate tta ttc ate acc get gtc ctg ctt etc ate 
Gly Leu Ala Val lie lie Leu Phe lie Thr Ala Val Leu Leu Leu lie 

-20 -15 -10 

tta ttt gec ate gtg ttt ggt tta etc act tec aca gaa aac act cag 
Leu Phe Ala lie Val Phe Gly Leu Leu Thr Ser Thr Glu Asn Thr Gin 

-5 15 
tgt gaa gcg ggt gaa gag gag tgacctgact tgctggggac tgagatggca 
Cys Glu Ala Gly Glu Glu Glu 
10 " 15 

gcaggggagg cgagctgacc tgcccccatt ccagtggtgg gccccttcgc ggttccctct 
ggctcagggg ccaagccctg gtgtcttcct ttcccaccag gaaaaagtct agtaaaatac 
tgtatctggc ttagggttgg tcagactagt aagatgggga ggctggtctg agaccaattc 
tggctccttg accctattgt ttttagggtt ccccgaccag aaccctaaaa gcacatggag 
aggatggctc cactgcctca ggtggaagga gctatggcta acaaggttct ctaacaggct 
cacaggccca gecagcaatt tcacaaatcc ttgacagaga aagacacaac caaatgaaat 
aaaaattcct tttcaaatct aaaaaaaaaa aaaaatgega aa 



60 
120 
175 



223 



271 



319 



370 



430 
490 
550 
610 
670 
730 
772 



<210> 18 

<211> 546 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .498 

<220> 

<221> sig_peptide 

<222> 13. . 153 

<223> Von Heijne matrix 

score 4.65474235719409 
seq LALSSLRSLLLFA/GM 



<220> 

<221> polyA_site 
<222> 533. .546 



<220> 
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<221> polyA_signal 
<222> 514 . . 519 



<400> 18 

aagcgctgac gc atg cgc ata get aac cgc acc egg ttc age teg cct ttc 51 

Met Arg lie Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 
-45 -40 -35 



ttg 


gee 


aga 


ggc 


gee 


ggt 


tgg 


act 


cac 


ggg 


egg 


ggc 


atg 


atg 


gtg 


gtg 


99 


Leu 


Ala 


Arg 


Gly Ala 


Gly 


Trp 


Thr 


His 


Gly 


Arg 


Gly 


Met 


Met 


Val 


Val 












-30 










-25 










-20 






ggt 


acg 


ggc 


acc 


teg 


ctg 


gcg 


etc 


tec 


tec 


etc 


egg 


tec 


ctg 


ctg 


etc 


147 


Gly 


Thr 


Gly 


Thr 


Ser 


Leu 


Ala 


Leu 


Ser 


Ser 


Leu 


Arg 


Ser 


Leu 


Leu 


Leu 










-15 










-10 










-5 








ttt 


get 


ggg 


atg 


cag 


atg 


tac 


age 


cgt 


cag 


ctg 


gee 


tec 


acc 


gag 


tgg 


195 


Phe 


Ala 


Gly 


Met 


Gin 


Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 








1 








5 










10 












etc 


acc 


ate 


cag 


ggc 


ggc 


ctg 


ctt 


ggt 


teg 


ggt 


etc 


ttc 


gtg 


ttc 


teg 


243 


Leu 


Thr 


He 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe 


Ser 




15 










20 










25 










30 




etc 


act 


gee 


ttc 


aat 


aat 


ctg 


gag 


aat 


ctt 


gtc 


ttt 


ggc 


aaa 


gga 


ttc 


291 


Leu 


Thr 


Ala 


Phe 


Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 












35 










40 










45 






caa 


gca 


aag 


ate 


ttc 


cct 


gag 


att 


etc 


ctg 


tgc 


etc 


ctg 


ttg 


get 


etc 


339 


Gin 


Ala 


Lys 


He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 










50 










55 










60 








ttt 


gca 


tct 


ggc 


etc 


ate 


cac 


cga 


gtc 


tgt 


gtc 


acc 


acy 


tgc 


ttc 


ate 


387 


Phe 


Ala 


Ser 


Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe 


lie 








65 










70 










75 










ttc 


tec 


atg 


gtt 


ggt 


ctg 


tac 


tac 


ate 


aac 


aag 


ate 


tec 


tec 


acc 


ctg 


435 


Phe 


Ser 


Met 


Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He 


Ser 


Ser 


Thr 


Leu 






80 










85 










90 












tac 


cag 


gca 


gca 


get 


cca 


gtc 


etc 


aca 


cca 


gee 


aag 


gtc 


aca 


ggc 


aag 


483 


Tyr 


Gin 


Ala 


Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 




95 










100 










105 










110 





age aag aag aga aac tgaccctgaa tgttcaataa agttgattct ttgcaaaaaa 538 
Ser Lys Lys Arg Asn 
115 

aaaaaaaa 546 



<210> 19 

<211> 767 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 132. .401 



<220> 

<221> sig_peptide 

<222> 132. .236 

<223> Von Heijne matrix 

score 8.35677668307059 
seq LGLGLVTLTASWA/AL 



<220> 

<221> polyA_site 
<222> 752. .767 



<220> 

<221> polyA___signal 
<222> 741. .746 



WO 02/083898 
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17 



<400> 19 
cctt cagcgc 
ctttggatcc 
cccgcgtcac 



Met 



tgg gca 
Trp Ala 



tgt gaa get 
Cys Glu Ala 
-20 

ctg acg gec age 
Leu Thr Ala Ser 
-5 

gga gga aac att 
Gly Gly Asn lie 

cag cag cac ttc 
Gin Gin His Phe 
30 

cca ctg ggc age 
Pro Leu Gly Ser 
45 

tgggcatcaa gatctaaccc 
tccgccggtt aegcttaatt 
ttctcttttt gacgatcttg 
tgaggggagg ctgagacagg 
gagatcgtgc ccatcctagg 
aaataaataa ataaataaat 



ccggccgaga gcccggtggc 
teggecaatt ctctgcttcg 
c atg gaa aca gga gee 
Met Glu Thr Gly Ala 
-35 
atg aga aga 
Arg Arg 



tgegactgag agcctggtta ctgccggcac 
cctacacttg gcgtgcggtg taccagcctc 
tct gca tec ate cca gag ctg ate 
Ser Ala Ser lie Pro Glu Leu lie 
-30 -25 
ate tgg age ctg gga ttg ggg ttg gtg act 
lie Trp Ser Leu Gly Leu Gly Leu Val Thr 



-15 

get ctt ttc cac gat 
Ala Leu Phe His Asp 
1 5 
gtg age gat etc age aca gta 
Val Ser Asp Leu Ser Thr Val 



ggc 
Gly 

aga 
Arg 



-10 

ttt gcg 
Phe Ala 



ttt 
Phe 



gtt 
Val 



15 

cag etc 
Gin Leu 

atg tgc 
Met Cys 



20 



etc 
Leu 



ctt gac gtg tgg acc agg aat 
Leu Asp Val Trp Thr Arg Asn 
35 

ttt gtt ttc ttg ttg 
Phe Val Phe Leu Leu 
50 

cacgaaccct 
ggtcagactg 
gggttctgag 
gaacccggga 
agactctatg 



gtt 

Val 

gca 
Ala 
25 
egg 



ctt 

Leu 

10 

caa 

Gin 

aag 



ttgaaccttc 
ttgacatatc 
ggcttcccaa 
agaattgett 
cgacagagag 
aaataaataa 



cca 
Pro 
55 

gttgtcaata 
gtgeeggata 
gaeggtttat 
ggcggaggtt 
tcaaaaataa 



ttc 
Phe Arg Lys 
4 0 

taatcaatgt 



ataaaatgee aaaaaaaaaa 



cctctgggtt 
aacttcttgg 
aaaagtgcac 
geagtgagee 
ataaataaat 
aaaaaa 



60 
120 
170 



218 



266 



314 



362 



411 



471 
531 
591 
651 
711 
767 



<210> 20 

<211> 596 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .478 



<220> 

<221> sig_peptide 

<222> 41. . 115 

<223> Von Heijne matrix 

score 8.18796433724836 
seq ATLLLVLCLQLGA/NK 



<220> 

<221> polyA_site 
<222> 581. .596 



<220> 

<221> polyA_signal 
<222> 560. . 565 



<400> 20 

accacttctc tgggacacat tgccttctgt tttctccagc atg cgc ttg etc cag 55 

Met Arg Leu Leu Gin 
-25 

etc ctg ttc agg gee age cct gee acc ctg etc ctg gtt etc tgc ctg 103 

Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu Leu Val Leu Cys Leu 

-20 -15 -10 -5 

cag ttg ggg gee aac aaa get cag gac aac act egg aag ate ata ata 151 

Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr Arg Lys lie lie lie 

1 5 ^ 10 

aag aat ttt gac att ccc aag tea gta cgt cca aat gac gaa gtc act 199 
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Lys 


Asn 


Phe 
15 


Asp 


He 


Pro 


Lys 


Ser 
20 


Val 


Arg 


Pro 


Asn 


Asp 

25 


Glu 


Val 


Thr 




gca 


gtg 


ctt 


gca 


gtt 


caa 


aca 


gaa 


ttg 


aaa 


gaa 


tgc 


atg 


gtg 


gtt 


aaa 


247 


Ala 


Val 
30 


Leu 


Ala 


Val 


Gin 


Thr 
35 


Glu 


Leu 


Lys 


Glu 


Cys 
40 


Met 


Val 


Val 


Lys 




act 


tac 


etc 


att 


age 


age 


ate 


cct 


eta 


caa 


ggk 


gem 


ttt 


aac 


tat 


arg 


295 


Thr 


Tyr 


Leu 


He 


Ser 


Ser 


He 


Pro 


Leu 


Gin 


Gly 


Ala 


Phe 


Asn 


Tyr 


Xaa 




45 










50 










55 










60 




tat 


act 


gec 


tgc 


eta 


tgt 


gac 


gac 


aat 


cca 


aaa 


acc 


ttc 


tac 


tgg 


gac 


343 


Tyr 


Thr 


Ala 


Cys 


Leu 
65 


Cys 


Asp 


Asp 


Asn 


Pro 
70 


Lys 


Thr 


Phe 


Tyr 


Trp 
75 


Asp 




ttt 


tac 


acc 


aac 


aga 


act 


gtg 


caa 


att 


gca 


gee 


gtc 


gtt 


gat 


gtt 


att 


391 


Phe 


Tyr 


Thr 


Asn 
80 


Arg 


Thr 


Val 


Gin 


He 
85 


Ala 


Ala 


Val 


Val 


Asp 
90 


Val 


He 




egg 


gaa 


tta 


ggc 


ate 


tgc 


cct 


gat 


gat 


get 


get 


gta 


ate 


ccc 


ate 


aaa 


439 


Arg 


Glu 


Leu 
95 


Gly 


He 


Cys 


Pro 


Asp 
100 


Asp 


Ala 


Ala 


Val 


He 
105 


Pro 


He 


Lys 




aac 


aac 


egg 


ttt 


tat 


act 


att 


gaa 


ate 


eta 


aag 


gta 


gaa 


taatggaagc 


488 


Asn 


Asn 
110 


Arg 


Phe 


Tyr 


Thr 


He 
115 


Glu 


He 


Leu 


Lys 


Val 
120 


Glu 











cctgtctgtt tgccacaccc aggtgatttc ctctaaagaa acttggctgg aatttctget 548 
gtggtctata aaataaactt ettaacatge tcaaaaaaaa aaaaaaaa 596 

<210> 21 

<211> 672 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 7. . 366 

<220> 

<221> sig_peptide 

<222> 7 . . 102 

<223> Von Heijne matrix 

score 7.64815044221151 

VT LG I G FFALAS A / LW 

<220> 

<221> polyA_site 
<222> 641 . . 672 

<400> 21 

agtcat atg get tea cct aaa ggt ttt ttt aat tac ttg acc tat ttt 48 
Met Ala Ser Pro Lys Gly Phe Phe Asn Tyr Leu Thr Tyr Phe 











-30 










-25 










-20 






ctt 


get 


get 


ggt 


get 


gtc 


act 


ttg 


gga 


att 


ggt 


ttc 


ttt 


get 


ttg 


gca 


96 


Leu 


Ala 


Ala 


Gly 
-15 


Ala 


Val 


Thr 


Leu 


Gly 
-10 


He 


Gly 


Phe 


Phe 


Ala 
-5 


Leu 


Ala 




tea 


get 


ttg 


tgg 


ttc 


ctg 


att 


tgc 


aaa 


cga 


aga 


gaa 


ata 


ttt 


caa 


aat 


144 


Ser 


Ala 


Leu 

1 


Trp 


Phe 


Leu 


He 
5 


Cys 


Lys 


Arg 


Arg 


Glu 
10 


He 


Phe 


Gin 


Asn 




tec 


aaa 


ttt 


aaa 


gca 


att 


gat 


gag 


aga 


tgc 


agg 


caa 


aga 


cca 


tea 


atg 


192 


Ser 


Lys 


Phe 


Lys 


Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 


Pro 


Ser 


Met 




15 










20 










25 










30 




gcg 


aaa 


att 


aaa 


tct 


cat 


tct 


cag 


tgt 


gtt 


ttt 


att 


tct 


cga 


aat 


ttt 


240 


Ala 


Lys 


He 


Lys 


Ser 
35 


His 


Ser 


Gin 


Cys 


Val 
40 


Phe 


He 


Ser 


Arg 


Asn 
45 


Phe 




cat 


act 


ggg 


aga 


ttc 


caa 


tta 


cag 


caa 


tta 


aag 


ate 


att 


eta 


aag 


atg 


288 


His 


Thr 


Gly 


Arg 
50 


Phe 


Gin 


Leu 


Gin 


Gin 

55 


Leu 


Lys 


He 


He 


Leu 
60 


Lys 


Met 




aac 


ccc 


aac 


ttg 


caa 


caa 


aaa 


ata 


tea 


ttt 


gtg 


ate 


cct 


cag 


aga 


cca 


336 
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Asn Pro Asn Leu Gin Gin Lys lie Ser Phe Val lie Pro Gin Arg Pro 

65 7 0 7 5 

get cca caa caa ate gca gca gtg tta cat taagcttatc aacattacca 
Ala Pro Gin Gin lie Ala Ala Val Leu His 

80 85 
tctgattctt attacageca aagtatagaa gcagctgatg actggttttc tgatgattct 
ctagtgaaaa ggaactctcc aatgecttet cteggggaac ctctaatgga aaaagtattt 
tcatacctgt caaccatttc attagaagag ggtactgaaa gtgtactgaa tgacacttta 
tgaccatcaa aaagatgact acattaaggg aaaatgttca tgaagaaaca cagaggttga 
aatataaaac cttcaacaga aaaaaaaaaa aaaatgcgaa aagctt 



386 



446 
506 
566 
626 
672 



<210> 22 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84 . . 317 



<220> 

<221> sig_peptide 

<222> 84 . . 140 

<223> Von Heijne matrix 

score 7.64030745849671 
seq ALVLALMI SMI SA/ DS 



<220> 

<221> polyA_site 
<222> 421. .438 



<220> 

<221> polyA_signal 
<222> 397. .402 



<400> 22 

atagaaaagg acatctcttg agacttcact tcagcttcac tgacttcttg actctcctct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 

Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 



get 


etc 


atg 


att 


tec 


atg 


att 


age 


get 


gat 


tea 


cat 


gaa 


aag 


aga 


cat 


161 


Ala 


Leu 


Met 


He 


Ser 
-5 


Met 


He 


Ser 


Ala 


Asp 
1 


Ser 


His 


Glu 


Lys 
5 


Arg 


His 




cat 


ggg 


tat 


aga 


aga 


aaa 


ttc 


cat 


gaa 


aag 


cat 


cat 


tea 


tac 


cat 


ate 


209 


His 


Gly 


Tyr 


Arg 


Arg 


Lys 


Phe 


His 


Glu 


Lys 


His 


His 


Ser 


Tyr 


His 


He 








10 










15 










20 










aca 


eta 


eta 


cca 


ctt 


ttt 


gaa 


gaa 


tea 


tea 


aag 


age 


aat 


gca 


aat 


gaa 


257 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 


Glu 


Glu 


Ser 


Ser 


Lys 


Ser 


Asn 


Ala 


Asn 


Glu 






25 










30 










35 












aaa 


cac 


tat 


aat 


tta 


ctg 


tat 


act 


ctt 


tgt 


ttc 


agg 


ata 


ctt 


gec 


ttt 


305 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 


Tyr 


Thr 


Leu 


Cys 


Phe 


Arg 


He 


Leu 


Ala 


Phe 




40 










45 










50 










55 





tea att gtc act tgatgatata attgeaattt aaactgttaa gctgtgttca 357 
Ser He Val Thr 

gtactgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca agcacatttt 417 

taaataaaaa aaawawaaaa a 438 



<210> 23 

<211> 617 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 141. .401 



<220> 

<221> sig_peptide 

<222> 141. .200 

<223> Von Heijne matrix 

score 7.42708462116258 
LLVFLAGFPVLDA/ND 



<220> 

<221> polyA_site 
<222> 603. . 617 



<400> 23 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggcccgagt ttcacccagt 60 
ccccactcca cggtgcagct gcggcttatc tctcagccca gcgagatgcc agccttcctg 120 
tcccgggcca gcgctctgac atg cag aag gtg acc ctg ggc ctg ctt gtg ttc 173 

Met Gin Lys Val Thr Leu Gly Leu Leu Val Phe 
-20 -15 -10 



ctg 


gca 


ggc 


ttt 


cct 


gtc 


ctg 


gac 


gec 


aat 


gac 


eta 


gaa 


gat 


aaa 


aac 


221 


Leu 


Ala 


Gly 


Phe 


Pro 
-5 


Val 


Leu 


Asp 


Ala 


Asn 
1 


Asp 


Leu 


Glu 


Asp 
5 


Lys 


Asn 




agt 


cct 


ttc 


tac 


tat 


gac 


tgg 


cac 


age 


etc 


cag 


gtt 


ggc 


ggg 


etc 


ate 


269 


Ser 


Pro 


Phe 


Tyr 


Tyr 


Asp 


Trp 


His 


Ser 


Leu 


Gin 


Val 


Gly 


Gly 


Leu 


He 








10 










15 










20 










tgc 


get 


ggg 


gtt 


ctg 


tgc 


gec 


atg 


ggc 


ate 


ate 


ate 


gtc 


atg 


agt 


gca 


317 


Cys 


Ala 


Gly 


Val 


Leu 


Cys 


Ala 


Met 


Gly 


lie 


lie 


He 


Val 


Met 


Ser 


Ala 






25 










30 










35 












aaa 


tgc 


aaa 


tgc 


aag 


ttt 


ggc 


cag 


aag 


tec 


ggt 


cac 


cat 


cca 


ggg 


gag 


365 


Lys 


Cys 


Lys 


Cys 


Lys 


Phe 


Gly 


Gin 


Lys 


Ser 


Gly 


His 


His 


Pro 


Gly 


Glu 




40 










45 










50 










55 




act 


cca 


cct 


etc 


ate 


acc 


cca 


ggc 


tea 


gec 


caa 


age 


tgatgaggac 




411 


Thr 


Pro 


Pro 


Leu 


lie 


Thr 


Pro 


Gly 


Ser 


Ala 


Gin 


Ser 













60 65 



agaccagctg aaattgggtg gaggacegtt ctctgtcccc aggtcctgtc tetgeacaga 471 

aacttgaact ccaggatgga attcttcctc ctctgctggg actcctttgc atggcagggc 531 

ctcatctcac ctctcgcaag agggtctctt tgttcaattt ttttttatct aaaatgattg 591 

tgcctctgcc caaaaaaaaa aaaaaa 617 



<210> 24 

<211> 673 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 69. .383 



<220> 

<221> sig_peptide 

<222> 69. . 125 

<223> Von Heijne matrix 

score 7.20796835452081 
VS I MLLLVT VS DC / AV 



<400> 24 

acaaggctga gcgggaggaa gegagaggea tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gec acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 
act gtg tct gac tgt get gtg ate aca ggg gec tgt gag cga gat gtc 158 
Thr Val Ser Asp Cys Ala Val He Thr Gly Ala Cys Glu Arg Asp Val 
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-5 15 

cag tgt ggg gca ggc acc tgc tgt gcc ate age 

Gin Cys Gly Ala Gly Thr Cys Cys Ala lie Ser 

15 20 
ctg egg atg tgc acc ccg ctg ggg egg kaa ggc 
Leu Arg Met Cys Thr Pro Leu Gly Arg Xaa Gly 

30 35 
ggc age cac aag ate ccc ttc ttc agg aaa cgc 
Gly Ser His Lys lie Pro Phe Phe Arg Lys Arg 

45 ~ 50 

cct tgc ttg ccc aac ctg ctg tgc tec agg ttc 
Pro Cys Leu Pro Asn Leu Leu Cys Ser Arg Phe 
60 65 70 

cgc tgc tec atg gac ttg aag aac ate aat ttt 
Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 "* 85 

ggatacccac catccttttc ctgagcacag cctggatttt 
gctcccatga ctctcccagt ccctasactg actaccctga 
catatgeaca caggcagaca tacctcccat catgacatgg 
tgtcacagct tgaggctgtg gtgtgaaagg tggccagcct 
ctgcactcaa aaaaaaaaaw aaatgcgaaa 



10 



ctg tgg ctt cga ggg 206 
Leu Trp Leu Arg Gly 
25 

gag gag tgc cac ccc 254 
Glu Glu Cys His Pro 
4 0 

aag cac cac acc tgt 302 

Lys His His Thr Cys 

55 

ccg gac ggc agg tac 350 
Pro Asp Gly Arg Tyr 
75 

taggegcttg cctggtctca 403 



tatttctgee atgaaaccca 463 
tctctcttgt etagtacgea 523 
tccccaggct ggcctgagga 583 
ggttctcttc cctgctcagg 643 

673 



<210> 25 

<211> 678 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 69. .383 



<220> 

<221> sig_peptide 

<222> 69. . 125 

<223> Von Heijne matrix 

score 7.20796835452081 
VSIMLLLVTVSDC/AV 



<220> 

<221> polyA_site 
<222> 652. .678 



<400> 25 

acaaggctga gcgggaggaa gegagaggea tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gcc acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser lie Met Leu Leu Leu Val 
-15 -10 



act 


gtg 


tct 


gac 


tgt 


get 


gtg 


ate 


aca 


ggg 


gcc 


tgt 


gag 


cga 


gat 


gtc 


158 


Thr 


Val 


Ser 


Asp 


Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 




-5 










1 








5 










10 






cag 


tgt 


ggg 


gca 


ggc 


acc 


tgc 


tgt 


gcc 


ate 


age 


ctg 


tgg 


ctt 


cga 


ggg 


206 


Gin 


Cys 


Gly 


Ala 


Gly 


Thr 


Cys 


Cys 


Ala 


He 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 










15 










20 










25 








ctg 


egg 


atg 


tgc 


acc 


ccg 


ctg 


ggg 


egg 


gaa 


ggc 


gag 


gag 


tgc 


cac 


ccc 


254 


Leu 


Arg 


Met 


Cys 


Thr 


Pro 


Leu 


Gly 


Arg 


Glu 


Gly 


Glu 


Glu 


Cys 


His 


Pro 








30 










35 










40 










ggc 


age 


cac 


aag 


ate 


ccc 


ttc 


ttc 


agg 


aaa 


cgc 


aag 


cac 


cac 


acc 


tgt 


302 


Gly 


Ser 


His 


Lys 


He 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 






45 










50 










55 












cct 


tgc 


ttg 


ccc 


aac 


ctg 


ctg 


tgc 


tec 


agg 


ttc 


ccg 


gac 


ggc 


agg 


tac 


350 


Pro 


Cys 


Leu 


Pro 


Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 




60 










65 










70 










75 




cgc 


tgc 


tec 


atg 


gac 


ttg 


aag 


aac 


ate 


aat 


ttt 


taggegcttg < 


cctggtctca 


403 
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Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 85 

ggatacccac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 463 

gctcccatga ctctcccagt ccctacactg actaccctga tctctcttgt ctagtacgca 523 

catatgcaca caggcagaca tacctcccat catgacatgg tccccaggct ggcctgagga 583 

tgtcacagct tgaggctgtg gtgtgaaagg tggccagcct ggttctcttc cctgctcagg 643 

ctgcactcaa aaaaaaaaaa aaatgcgaaa agctt 678 

<210> 26 

<211> 860 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 129. .434 



<220> 

<221> sig_peptide 

<222> 129. .236 

<223> Von Heijne matrix 

score 7.04760006506154 
seq PFFLITLVGVVVA/VV 



<220> 

<221> polyA_site 
<222> 845. .860 



<220> 

<221> polyA_signal 
<222> 826. . 831 



<400> 26 

tccggggccc gggagccaac cgagggcgtt cctgtcgggg ctgcagcggc gggagggagc 60 
ccagtggagg cgccctcccg aagcgccact gcccatgctg accacccagc cctccggctg 120 
ctgatgtc atg agt aac acc act gtg ccc aat gcc ccc cag gcc aac age 170 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 



gac 


tec 


atg 


gtg 


ggc 


tat 


gtg 


ttg 


ggg 


ccc 


ttc 


ttc 


etc 


ate 


acc 


ctg 


218 


Asp 


Ser 


Met 


Val 


Gly 
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Val 


Leu 


Gly 


Pro 


Phe 


Phe 


Leu 


He 


Thr 


Leu 
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-10 










gtc 


ggg 


gtg 


gtg 


gtg 


get 


gtg 


gta 


atg 


tat 


gta 


cag 


aag 


aaa 


aag 


egg 


266 


Val 


Gly 


Val 


Val 


Val 


Ala 


Val 


Val 


Met 


Tyr 


Val 


Gin 


Lys 


Lys 


Lys 


Arg 






-5 










1 








5 










10 




gtg 


gac 


egg 


ctg 


cgc 


cat 


cac 


ctg 


etc 


ccc 


atk 


tac 


age 


tat 


gac 


cca 


314 


Val 


Asp 


Arg 


Leu 


Arg 


His 


His 


Leu 


Leu 


Pro 


Xaa 


Tyr 


Ser 


Tyr 


Asp 


Pro 












15 










20 










25 






get 


gag 


gaa 


ctg 


cat 


gag 


get 


gag 


car 


cag 


ctg 


etc 


tct 


gac 


atg 


gga 


362 


Ala 


Glu 


Glu 


Leu 


His 


Glu 


Ala 


Glu 


Gin 


Gin 


Leu 


Leu 


Ser 


Asp 


Met 


Gly 










30 










35 










40 








gac 


ccc 


aag 


gtg 


gta 


cat 


ggc 


tgg 


cag 


akt 


ggc 


tac 


cag 


cac 


aag 


egg 


410 


Asp 


Pro 


Lys 


Val 


Val 


His 


Gly 


Trp 


Gin 


Xaa 


Gly 


Tyr 


Gin 


His 


Lys 


Arg 








45 










50 










55 










atg 


cca 


ctg 


ctg 


gat 


gtc 


aag 


acg 


tgacctgacc cccttgcccc acccttcaga 


464 


Met 


Pro 


Leu 


Leu 


Asp 


Val 


Lys 


Thr 





















60 65 



gcctggggtc ctggactgcc tggggccctg ccatctgctt cccctgctgt cacctggctc 524 

cccctgctgg gtgctgggtc tccatttctc cctccaccca ccctcagcag catctgcttc 584 

ccatgccctc accatcacct cactgccccc aggecttctg ccctttgtgg gtgttgagct 64 4 

caccgcccac ccacaggcac tcataggaag aggctttcct tctgggatgg eggeggctgg 704 

tagacacctt tgetttctet agccctcctg ggctgggctt gggcacaaat ccccaggcag 764 

gctttggagt tgtttccatg gtgatggggc cagatgtata gtattcagta tatattttgt 824 

aaataaaatg ttttgtggct aaaaaaaaaa aaaaaa 860 
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<210> 27 

<211> 1443 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .440 



<220> 

<221> sig_peptide 

<222> 30. .83 

<223> Von Heijne matrix 

score 7.06387130837154 
seq VLGLVLLSVTVQG/KV 



<220> 

<221> polyA_site 
<222> 1428 . . 1443 



<220> 

<221> polyA_signal 
<222> 1409. . 1414 



<400> 27 



cagcctagca ctctgaccta gcagtcaac 


atg 


aag 


get 


etc 


att 


gtt 


ctg 


ggg 


53 


















Met 


Lys 


Ala 


Leu 
-15 


He 


Val 


Leu 


Gly 




ctt 


gtc 


etc 


ctt 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


101 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gee 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


149 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


Met 


Asp 


Gly 


Tyr 


Arg 


Gly 










10 










15 










20 








ate 


age 


eta 


gca 


aac 


tgg 


atg 


tgt 


ttg 


gee 


aaa 


tgg 


gag 


agt 


ggt 


tac 


197 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 








25 










30 










35 










aac 


aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age 


act 


gat 


tat 


245 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


ace 


293 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




cca 


gga 


gca 


gtt 


aat 


gee 


tgt 


cat 


tta 


tec 


tgc 


agt 


ggg 


tgg 


cat 


gga 


341 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Gly 


Trp 


His 


Gly 












75 










80 










85 






gaa 


ate 


gtt 


gtc 


aaa 


aca 


gag 


atg 


tec 


gtc 


agt 


atg 


ttc 


aag 


gtt 


gtg 


389 


Glu 


He 


Val 


Val 


Lys 


Thr 


Glu 


Met 


Ser 


Val 


Ser 


Met 


Phe 


Lys 


Val 


Val 










90 










95 










100 








gag 


tgt 


aac 


tec 


aga 


att 


ttc 


ctt 


ctt 


cag 


etc 


att 


ttg 


tct 


etc 


tea 


437 


Glu 


Cys 


Asn 


Ser 


Arg 


He 


Phe 


Leu 


Leu 


Gin 


Leu 


He 


Leu 


Ser 


Leu 


Ser 








105 










110 










115 











cat taagggagta ggaattaagt gaaaggtcac actaccatta ttttcccctt 490 
His 

caaacaaata atatttttac agaagcagga gcaaaatatg gcctttcttc taagagatat 550 

aatgttcact aatgtggtta ttttatatta agcctacaac atttttcagt ttgeaaatag 610 

aactaatact ggtgaaaatt tacctaaaac cttggttatc aaatacatct ccagtacatt 670 

ccgttctttt tttttttttt tttttttttt ttgagacagt ctcgctctgt cgcccaggct 730 

ggagtgcagt ggegcaatet cggctcactg caacctccac ctcccgggtt cacgccattc 790 

tcctgcctca gcctcccgag tagctgggat tacgggcgcc cgccaccacg cccggctaat 850 

tttttgtatt tttagtagag acagggtttc acegttttag ccaggatggt ctcgatctcc 910 

tgaccttgtg atccacccac ctcggcctcc caaagtgctg cgattacagg cgtgagccac 970 
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tgcgcccggc cacattcagt tyttatcaaa gaaataaccc agacttaatc ttgaatgata 1030 

cgattatgcc caatattaag taaaaaatat aagaaaaggt katcttaaat agatcttagg 1090 

caaaatacca gctgatgaag gcatctgatg ccttcatctg ttcagtcatc tccaaaaaca 1150 

gtaaaaataa ccactttttg ttgggcaata tgaaattttt aaaggagtag aataccaaat 1210 

gatagaaaca gactgcctga attgagaatt ttgatttttt aaagtgtgtt tctttctaaa 1270 

ttgctgttcc ttaatttgat taatttaatt catgtattat gattaaatct gakgcakatg 1330 

agcttacaag tattgaaata attactaatt aatcacaaat gtgaagttat gcatgatgta 1390 

aaaaatacaa acattctaat taaaggcttt gcaacacaaa aaaaaaaaaa aaa 1443 



<210> 28 

<211> 622 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 82. .384 



<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.9066614461057 
seq VCMCVFSTQGALG/EM 



<220> 

<221> polyA_site 
<222> 607 . . 622 



<220> 

<221> polyA_signal 
<222> 570. .575 



<400> 28 

gctacctcct ggtgtctgtt gcctggtgtg tgggaaaccc 
gtcttctgtg ttgactgttg a atg agg gaa gag aaa 

Met Arg Glu Glu Lys 
-25 

gag aga gag agt gtg tgt gtg tgt atg tgt gtg 
Glu Arg Glu Ser Val Cys Val Cys Met Cys Val 

-15 -10 
get ttg ggg gaa atg get gca cac ttc ata gat 
Ala Leu Gly Glu Met Ala Ala His Phe lie Asp 

1 5 
agt gag ggg aat ggt cac aga ggg aca ctg gat 
Ser Glu Gly Asn Gly His Arg Gly Thr Leu Asp 

15 20 
caa gag tec tac ate ccc tec acc gee gac ccc 
Gin Glu Ser Tyr lie Pro Ser Thr Ala Asp Pro 
30 35 40 

gag ctg ctg cac aag aac ttg ccc gtg acc tec 
Glu Leu Leu His Lys Asn Leu Pro Val Thr Ser 



50 



55 



ccc tct gac etc gcg ate cca gee get gee etc 
Pro Ser Asp Leu Ala lie Pro Ala Ala Ala Leu 

65 70 
ctgccctcgc tgagcttgga ccacgtgcct gagacccgtc 
tcggggtggg gggagctcca caccggcggg gaacgtgctt 
ctctagaaca gagectcagg cacaggaggc acttcacatt 
gaatcagtgt catcctccca acaaaaaaaa aaaaaaaa 



ccacacgctg 
aag ccc ttt 
Lys Pro Phe 

ttt tec act 
Phe Ser Thr 

gaa aag ctg 
Glu Lys Leu 
10 

age ctg agt 
Ser Leu Ser 
25 

acc cag get 
Thr Gin Ala 

cga tec cag 
Arg Ser Gin 

tgacctcccg 



gaccacagaa 
gag aga 
Glu Arg 
-20 
caa gga 
Gin Gly 
-5 

agg ccc 
Arg Pro 

teg gac 
Ser Asp 

ggc cct 
Gly Pro 
45 

ccg ctg 
Pro Leu 
60 

atcccagccg 



tttcctcatt taaaccctgc 
gttcacagtg tgttcttagc 
tattgaataa atagatgaat 



60 
111 



159 



207 



255 



303 



351 



404 



464 
524 
584 
622 



<210> 29 
<211> 658 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . . 497 



<220> 

<221> sig_peptide 

<222> 114 . . 191 

<223> Von Heijne matrix 

score 6.8337397741578 
seq VFLSLFLIQLLIS/FS 



<400> 29 

ctcagcaccc agggcggtgg taggtcacag tctctgggcg ggtctcagtg tccaacactg 60 
tagctggtgc ctgccaggtt cccagtggct ggggtcacca ggtctgaaga gag atg 116 

Met 

tgc tgg ctg egg gca tgg ggc cag ate etc ctg cca gtt ttc etc tec 164 

Cys Trp Leu Arg Ala Trp Gly Gin lie Leu Leu Pro Val Phe Leu Ser 

-25 * " -20 -15 -10 

etc ttt etc ate caa ttg ctt ate age ttc tea gag aat ggt ttt ate 212 

Leu Phe Leu lie Gin Leu Leu lie Ser Phe Ser Glu Asn Gly Phe lie 

-5 15 
cac age ccc agg aac aat cag aaa cca aga gat ggg aat gaa gag gaa 260 
His Ser Pro Arg Asn Asn Gin Lys Pro Arg Asp Gly Asn Glu Glu Glu 

10 15 20 

tgt get gta aag aag agt tgt caa ttg tgc aca gaa gat aag aaa tgt 308 
Cys Ala Val Lys Lys Ser Cys Gin Leu Cys Thr Glu Asp Lys Lys Cys 

25 " 30 ~ 35 

gtt tgg tgt agt gaa gaa aaa gca tgc raa aaa tac tgt ttt ccc tat 356 
Val Trp Cys Ser Glu Glu Lys Ala Cys Xaa Lys Tyr Cys Phe Pro Tyr 
40 45 50 55 

ttc ggt tgt cga ttc agt tct ata tat tgg tta aac tgt aaa gtt gac 404 
Phe Gly Cys Arg Phe Ser Ser lie Tyr Trp Leu Asn Cys Lys Val Asp 

60 65 70 

atg ttt gga ate atg atg ctt eta etc att gca gta tta att aca gga 452 
Met Phe Gly lie Met Met Leu Leu Leu lie Ala Val Leu lie Thr Gly 

75 80 85 

ttc gtt tgg tac tgc tgc gec tat cac ttt tac ctg cag gat ata 497 
Phe Val Trp Tyr Cys Cys Ala Tyr His Phe Tyr Leu Gin Asp lie 

90 95 100 

tgatgaatag ataattgaaa agagatcctc cagaaagagc agaaggaagt ttcttcaatg 557 
gcttccttca ggattttaat catccttaca gectctttga gaatgattga acttccaaat 617 
tccctgaagt taaaatttta aattctatta aacatttttt c 658 



<210> 30 
<211> 619 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 74 . .499 



<220> 

<221> sig_peptide 

<222> 74 . . 145 

<223> Von Heijne matrix 

score 6.56258453552342 
seq LSICLSAVATATG/AE 



<220> 
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<221> polyA_site 
<222> 556. . 619 



<400> 30 

agcgactagg acgcgcctgc gcagaggcgg cagcaccacc ggggttgact ccgggggcgc 60 
ggcgaggaga gac atg agg ctg age tgg ttc egg gtc ctg aca gta ctg 109 
Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu 
-20 -15 



tec 


ate 


tgc 


ctg 


age 


gee 


gtg 


gee 


acg 


gee 


acg 


ggg 


gee 


gag 


ggc 


aaa 


157 


Ser 


He 


Cys 


Leu 


Ser 


Ala 


Val 


Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Lys 








-10 










-5 










1 










agg 


aag 


ctg 


cag 


ate 


ggg 


gtc 


aag 


aag 


egg 


gtg 


gac 


cac 


tgt 


ccc 


ate 


205 


Arg 


Lys 


Leu 


Gin 


He 


Gly 


Val 


Lys 


Lys 


Arg 


Val 


Asp 


His 


Cys 


Pro 


He 




5 










10 










15 










20 




aaa 


teg 


cgc 


aaa 


ggg 


gat 


gtc 


ctg 


cac 


atg 


cac 


tac 


acg 


ggt 


aag 


ctg 


253 


Lys 


Ser 


Arg 


Lys 


Gly 


Asp 


Val 


Leu 


His 


Met 


His 


Tyr 


Thr 


Gly 


Lys 


Leu 












25 










30 










35 






gaa 


gat 


ggg 


aca 


gag 


ttt 


gac 


age 


age 


ctg 


ccc 


cag 


aac 


cag 


ccc 


ttt 


301 


Glu 


Asp 


Gly 


Thr 


Glu 


Phe 


Asp 


Ser 


Ser 


Leu 


Pro 


Gin 


Asn 


Gin 


Pro 


Phe 










40 










45 










50 








gtc 


ttc 


tec 


ctt 


ggc 


aca 


ggc 


cag 


gtc 


ate 


aag 


ggc 


tgg 


gac 


cag 


ggg 


349 


Val 


Phe 


Ser 


Leu 


Gly 


Thr 


Gly 


Gin 


Val 


He 


Lys 


Gly 


Trp 


Asp 


Gin 


Gly 








55 










60 










65 










ctg 


ctg 


ggg 


atg 


tgt 


gag 


ggg 


gaa 


aag 


cgc 


aas 


ykg 


gtg 


ate 


cca 


tec 


397 


Leu 


Leu 


Gly 


Met 


Cys 


Glu 


Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 






70 










75 










80 












gag 


eta 


ggg 


tat 


gga 


gas 


egg 


gga 


get 


ccc 


cca 


aag 


att 


cca 


ggc 


ggt 


445 


Glu 


Leu 


Gly 


Tyr 


Gly 


Xaa 


Arg 


Gly 


Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 




85 










90 










95 










100 




gca 


acc 


ctg 


gtg 


ttc 


gar 


gtg 


gag 


ctg 


etc 


aaa 


ata 


gag 


cga 


cga 


act 


493 


Ala 


Thr 


Leu 


Val 


Phe 


Glu 


Val 


Glu 


Leu 


Leu 


Lys 


He 


Glu 


Arg 


Arg 


Thr 





105 110 115 



gag ctg taaccagact ggggaggggc agggggagag gcccccatca gggaccagac 54 9 
Glu Leu 

tgttccaaaa aaaaaacaaa aaacaaaaac aaacaaaaaa acacttaaaa gecgaaaaaa 609 

aaaaaaaaaa 619 



<210> 31 

<211> 573 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 65. .274 



<220> 

<221> sig_peptide 

<222> 65. . 136 

<223> Von Heijne matrix 

score 6.54728084640774 
ILLTRPACLGSWA/EI 



<220> 

<221> polyA_site 
<222> 556. . 573 



<220> 

<221> polyA_signal 
<222> 540. . 545 



<400> 31 

cactgattct ttggccatcg ctactacaat tttagaattg ctggcaattt tttctgeaca 60 
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ctgg atg get tct tec ace agt gtc tea aca gga caa ate ttg ctg aca 109 
Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin lie Leu Leu Thr 
-20 -15 -10 

aga cct get tgc ttg ggg tec tgg get gag ate egg tea ccg gtg agg 157 
Arg Pro Ala Cys Leu Gly Ser Trp Ala Glu lie Arg Ser Pro Val Arg 

-5 15 
ace ate tec ate gee age gac ttc cca aca gca egg gtg agt etc tgg 205 
Thr lie Ser lie Ala Ser Asp Phe Pro Thr Ala Arg Val Ser Leu Trp 

10 15 20 

gtg ccg ccc gca cct ggg atg gtt cct att aag ate tec ggc tgt gca 253 
Val Pro Pro Ala Pro Gly Met Val Pro lie Lys lie Ser Gly Cys Ala 

25 ~ 30 35 

aac tgg gee ttc tea ccg gca tagatgatat cacacatcat ggcaagctca 304 
Asn Trp Ala Phe Ser Pro Ala 
40 45 

cagcccccgc caaaggcata gecattgaca geagegatga ctggattctt gacctgggtg 364 
aggtggtccc agtgcttcaa gaacttgctg gagtaacagt cctggaaact caggttctgc 424 
atttccttga tatcagctct agtgaaaagg gcggtagttg gtggtggaac cagaaaegga 484 
cgccggtgct tggagcggtt cttaaattat atttaaaaaa gtaacttttt gtataaataa 544 
aagaaaatgg gacaaaaaaa aaaaaaaaa 573 



<210> 32 

<211> 497 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 51 . . 389 



<220> 

<221> sig_peptide 

<222> 51. .119 

<223> Von Heijne matrix 

score 6.43297166361141 
seq LGALSGAAALGFA/SY 



<220> 

<221> polyA_site 
<222> 482. .497 



<220> 

<221> polyA_signal 
<222> 462. .467 



<400> 32 

ctcttgggtg gectgggegg cgttggtccg gtgcgtcctg ttctacagct atg gec 56 

Met Ala 

ggg cca get gca get ttc cgc cgc ttg ggc gee ttg tec gga get gcg 104 
Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly Ala Leu Ser Gly Ala Ala 

-20 -15 -10 

gee tta ggc ttc get tec tac ggg gcg cac ggc gee maw ttc cca gat 152 
Ala Leu Gly Phe Ala Ser Tyr Gly Ala His Gly Ala Xaa Phe Pro Asp 
-5 1 5 10 



gec 


tac 


ggg 


aag 


gag 


ctg 


ttt 


gac 


aag 


gec 


aac 


aaa 


cac 


cac 


ttc 


tta 


Ala 


Tyr 


Gly 


Lys 


Glu 


Leu 


Phe 


Asp 


Lys 


Ala 


Asn 


Lys 


His 


His 


Phe 


Leu 








15 










20 










25 






cac 


age 


ctg 


gee 


ctg 


tta 


ggg 


gtg 


ccc 


cat 


tgc 


aga 


aag 


cca 


etc 


tgg 


His 


Ser 


Leu 


Ala 


Leu 


Leu 


Gly 


Val 


Pro 


His 


Cys 


Arg 


Lys 


Pro 


Leu 


Trp 






30 










35 










40 








get 


ggg 


tta 


ttg 


eta 


get 


tec 


gga 


acg 


ace 


tta 


ttc 


tgc 


ace 


age 


ttt 


Ala 


Gly 


Leu 


Leu 


Leu 


Ala 


Ser 


Gly 


Thr 


Thr 


Leu 


Phe 


Cys 


Thr 


Ser 


Phe 




45 










50 










55 
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tac tac cag get ctg agt gga gac ccc age ate cag act ttg gee cct 344 

Tyr Tyr Gin Ala Leu Ser Gly Asp Pro Ser lie Gin Thr Leu Ala Pro 

60 65 70 75 

gcg gga ggg ace ctg eta etc ttg ggc tgg ctt gee ttg get ctt 389 

Ala Gly Gly Thr Leu Leu Leu Leu Gly Trp Leu Ala Leu Ala Leu 

80 85 90 

tgagctccct tttgcttaat tactgggttt tetgegcagt tttttttttt aaagagttgg 449 
agtaagaaga ggattaaaaa ggaaaggcaa acaaaaaaaa aaaaaaaa 4 97 



<210> 33 

<211> 484 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 14 . .352 



<220> 

<221> sig_peptide 

<222> 14 . . 82 

<223> Von Heijne matrix 

score 6.43297166361141 
seq LGALSGAAALGFA/SY 



<220> 

<221> polyA_site 
<222> 469. .484 



<220> 

<221> polyA_signal 
<222> 443. .448 



<400> 33 

ctgttctaca get atg gee ggg cca get gca get ttc cgc cgc ttg ggc 4 9 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 
-20 -15 



gee 


ttg 


tec 


gga 


get 


gcg 


gee 


tta 


ggc 


ttc 


get 


tec 


tac 


ggg 


gcg 


cac 


97 


Ala 


Leu 


Ser 


Gly 


Ala 


Ala 


Ala 


Leu 


Gly 


Phe 


Ala 


Ser 


Tyr 


Gly 


Ala 


His 






-10 










-5 










1 








5 




ggc 


gee 


ma a 


ttc 


cca 


gat 


gec 


tac 


ggg 


aag 


gag 


ctg 


ttt 


gac 


aag 


gee 


145 


Gly 


Ala 


Xaa 


Phe 


Pro 


Asp 


Ala 


Tyr 


Gly 


Lys 


Glu 


Leu 


Phe 


Asp 


Lys 


Ala 












10 










15 










20 






aac 


aaa 


cac 


cac 


tty 


tta 


cac 


age 


ctg 


gee 


ctg 


tta 


ggg 


gtg 


ccc 


cat 


193 


Asn 


Lys 


His 


His 


Phe 


Leu 


His 


Ser 


Leu 


Ala 


Leu 


Leu 


Gly 


Val 


Pro 


His 










25 










30 










35 








tgc 


aga 


aag 


cca 


etc 


tgg 


get 


ggg 


tta 


ttg 


eta 


get 


tec 


gga 


acg 


acc 


241 


Cys 


Arg 


Lys 


Pro 


Leu 


Trp 


Ala 


Gly 


Leu 


Leu 


Leu 


Ala 


Ser 


Gly 


Thr 


Thr 








40 










45 










50 










tta 


tty 


tgc 


ace 


age 


ttt 


tac 


tac 


cag 


get 


ytg 


agt 


gga 


gac 


ccc 


age 


289 


Leu 


Phe 


Cys 


Thr 


Ser 


Phe 


Tyr 


Tyr 


Gin 


Ala 


Leu 


Ser 


Gly 


Asp 


Pro 


Ser 






55 










60 










65 












ate 


cag 


act 


ttg 


gee 


cct 


gcg 


gga 


ggg 


ace 


ctg 


eta 


etc 


ttg 


ggc 


tgg 


337 


He 


Gin 


Thr 


Leu 


Ala 


Pro 


Ala 


Gly 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Gly 


Trp 




70 










75 










80 










85 





ctt gee ttg get ctt tgagctccct tttgattaat tactgggttt tctgggcagt 392 
Leu Ala Leu Ala Leu 
90 

tttttttttt taaagagttg gagtaagaag aggattaaaa aggaaaggca aataaacttt 452 
ggagtctttg ttcatcaaaa aaaaaaaaaa aa 484 



<210> 34 
<211> 739 
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<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 111. .272 



<220> 

<221> sig_pepticie 

<222> 111. . 161 

<223> Von Heijne matrix 

score 6.48133858229514 
LTFAQLLFATVLG/IA 



<220> 

<221> polyA__site 
<222> 718. .739 



<220> 

<221> polyA_signal 
<222> 688. . 693 



<400> 34 

aattgtagac gtcagtgttg gtgttaacat ttcagatttg aagagatttc ctgggtgtgg 
agtgtgactt tccaaaacca gcttttcctt gagctgtatt tgttgcagca atg ttt 

Met Phe 

agg aga ttg act ttt gca caa ctg ctt ttt gcc act gtc ctt gga att 
Arg Arg Leu Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly lie 
-15 ~ -10 -5 ' 1 

get gga gga gta tat att ttt caa cca gta ttt gaa cag tat gcc aaa 
Ala Gly Gly Val Tyr He Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys 

5 10 15 

gat cag aag gaa tta aaa gaa aag atg cag ttg gta caa gaa tea gaa 
Asp Gin Lys Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu 

20 25 30 

gag aag aaa agt taatactaca tggagttagg cctggcgcag tggctcacgc 
Glu Lys Lys Ser 
35 

ctgtaatccc agcactttgg gaggecgagg cgggtggatc aggtggtcag gagttcaaga 
ccagcctgac caacatggtg aaaccctgtc tctactaaaa atacaaaaat cagccaggct 
tggtggcatg cgcctgtaat cccagctact tgggaggctg aggcaggagc atcacttgaa 
cctgggaggc agaggtggca gtgagtcaag atcacgctgc tgcactccag cctgggtgac 
agagegagae tccatctaaa aaaacaaaaa gcaaaaaaac ccacaaatac ctcatggaga 
tgaactgtaa taattgetta aagttccatt taattatgtt aactctaatc tagcaaaaac 
atagatgtac ttaaaaataa atcatggata atgatttttt aacccaaaaa aaaaaaaaaa 
tgtgaaa 



60 
116 

164 



212 



260 



312 



372 
432 
492 
552 
612 
672 
732 
739 



<210> 35 

<211> 432 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 47. .295 



<220> 

<221> sig_peptide 

<222> 47 . . 91 

<223> Von Heijne matrix 

score 6.32210467320472 
seq LFLSLPVLVVVLS/IV 



WO 02/083898 



PCT/IB01/00914 



30 

<220> 

<221> polyA_site 
<222> 420. . 432 



<220> 

<221> polyA_signal 
<222> 395. . 400 



<400> 35 

aaccaagccc tccagcaagg attcagagtg cccctccggc ctcgcc atg agg etc 55 

Met Arg Leu 
-15 

103 



151 



ttc 


ctg 


teg 


ctt 


ccg 


gtc 


ctg 


gtg 


gtg 


gtt 


ctg 


teg 


ate 


gtc 


ttg 


gaa 


Phe 


Leu 


Ser 


Leu 


Pro 


Val 


Leu 


Val 


Val 


Val 


Leu 


Ser 


He 


Val 


Leu 


Glu 






-10 










-5 










1 








ggc 


cca 


gee 


cca 


gee 


cag 


ggg 


acc 


cca 


gac 


gtc 


tec 


agt 


gee 


ttg 


gat 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin 


Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 


Ala 


Leu 


Asp 


5 










10 










15 










20 


aag 


ctg 


aag 


gag 


ttt 


gga 


aac 


aca 


ctg 


gag 


gac 


aag 


get 


egg 


gaa 


etc 


Lys 


Leu 


Lys 


Glu 


Phe 


Gly 


Asn 


Thr 


Leu 


Glu 


Asp 


Lys 


Ala 


Arg 


Glu 


Leu 










25 










30 










35 




ate 


age 


cgc 


ate 


aaa 


cag 


agt 


gaa 


ctt 


tct 


gee 


aag 


atg 


egg 


gag 


tgg 


lie 


Ser 


Arg 


He 


Lys 


Gin 


Ser 


Glu 


Leu 


Ser 


Ala 


Lys 


Met 


Arg 


Glu 


Trp 








40 










45 










50 






ttt 


tea 


gag 


aca 


ttt 


cag 


aaa 


gtg 


aag 


gat 


aaa 


etc 


aag 


att 


gac 


tea 


Phe 


Ser 


Glu 


Thr 


Phe 


Gin 


Lys 


Val 


Lys 


Asp 


Lys 


Leu 


Lys 


He 


Asp 


Ser 






55 










60 










65 









199 



247 



295 



tgaggacctg aagggtgaca tcccaggagg ggcctctgaa atttcccaca ccccagcgcc 355 
tgtgctgagg actccctcca tgtggcccca ggtgccacca ataaaaatcc tacagaaaac 415 
taawaaaaaa aaaaaaa 432 



<210> 36 

<211> 839 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 66. .359 



<220> 

<221> sig_peptide 

<222> 66. . 194 

<223> Von Heijne matrix 

score 6.07207487250823 
seq FLI YVALLRVSEC/LP 



<220> 

<221> polyA_site 
<222> 824 . . 839 



<220> 

<221> polyA_signal 
<222> 805. .810 



<400> 36 

gggegegagg cggccaccgt ggagagcaga gcgcggcggc tggaagctgc taagtcagag 60 
ccgcg atg ttc egg att gag ggc etc gcg ccg aag ctg gac ccg gag gag 110 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tec ata egg aac ttt 158 
Met Lys Arg Lys Met Arg Glu Asp Val lie Ser Ser He Arg Asn Phe 
-25 -20 -15 



WO 02/083898 



-10 
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31 



-5 



10 



25 



30 



gag 


tgt 


etc 


ccg 


ggc 


tgt 


206 


Glu 


Cys 


Leu 
1 


Pro 


Gly 


Cys 




ggg 


cac 


ccc 


tta 


act 


eta 


254 


Gly 


His 


Pro 


Leu 


Thr 


Leu 




15 










20 




aac 


ggt 


age 


ggg 


gag 


caa 


302 


Asn 


Gly 


Ser 


Gly 


Glu 


Gin 












35 






cag 


cat 


atg 


aag 


aca 


gga 


350 


Gin 


His 


Met 


Lys 


Thr 


Gly 





40 



45 



50 



cat cac ata tgaatgcacg atatgaagag cctggttaca gtttcgactc 
His His lie 
55 

ctctctgcaa gtgaataggc ccagaaaggt gtaagagact ctttgaatgg 
ctgcttgtta agaacaagtt tggctctggt aactgacctt caaagctaaa 
atttgggaag tatgaaacga tgtctcgtga tctggtgtac ccttatccct 
gcctctgaca atactggtat aattgtaaat aatgtcaaac teegttttet 
aagggagctg tgtctgaaat ggcactgtct tgtcagtcat ttctgtttac 
tgcccagagt gtatttgtga aragtctctt atattatgtt ttgtggaaat 
ccacaatgac atttaagcac aggatcatta ttagtctatg tttttaataa 
tatgaaaaaa aaaaaaaaaa 



acataaaatt 
atataaaact 
gtgacgtttg 
agcaagtatt 
ctttttcttc 
cagcacacaa 
acatatcaat 



399 



459 
519 
579 
639 
699 
759 
819 
839 



<210> 37 

<211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
VVS FALI ATLVYA/ LF 



<220> 

<221> polyA_site 
<222> 1083. .1114 



<400> 37 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gageggtect ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gec 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 


Ser 








-40 










-35 










-30 










gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


Ser 






-25 










-20 










-15 












ttc 


get 


ttg 


att 


get 


ace 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


He 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 










10 










15 










20 








cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Ala 


Asp 


Ser 


Thr 


Ala 


Val 


Leu 





25 30 35 



WO 02/083898 PCT/IB01/00914 



32 

cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagctaacgc ctataatcct 832 

agcactttgg gaggccaagg caggtggatc acttgagccc aggagtttga gaccagcctt 892 

ggaaacatga tgaaacccca tctctagaaa aaacaccaaa aaattggaca agagtgttgg 952 

cacatgcctg tagtccctgc ttcttgggag gctgaaatgg gaggatcacc tgagcccagg 1012 

aggttgaggc tatagtgagc catgatcgca ctattgcact cccacctggg tggcagtgag 1072 

acccttcctc aaaaaacaag aaaagaaaaa aaaaaaaaaa aa 1114 



<210> 38 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117 . . 362 

<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 

seq VVSFALIATLVYA/LF 

<220> 

<221> polyA_site 
<222> 1097. . 1112 

<220> 

<221> polyA_signal 
<222> 1089. . 1094 

<400> 38 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 


Ser 








-40 










-35 










-30 










gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


lie 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


Ser 






-25 










-20 










-15 












ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


lie 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 










10 










15 










20 








cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Ala 


Asp 


Ser 


Thr 


Ala 


Val 


Leu 





25 30 35 



cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 



WO 02/083898 



PCT/IB01/00914 



cttactccta 
ggatattaat 
agcatgttca 
caagtattct 
gcactttggg 
gaaacatgat 
acatgcctgt 
ggttgaggct 
cccttcctca 



acagtatttg 
gattataacc 
aagttagagg 
ttgactatga 
aggccaaggc 
gaaaccccat 
agtccctgct 
atagtgagcc 
aaaaacaaga 



gtgaagatga 
ggagattctc 
aagacacatt 
gaaatcttgg 
aggtggatca 
ctctagaaaa 
tcttgggagg 
atgatcgcac 
aaagaaaaaa 



33 

tcagtctcag 
agggcaaccc 
tcttctcttt 
ccaggcacag 
cttgagccca 
aacaccaaaa 
ctgaaatggg 
tattgcactc 
aaaaaaaaaa 



gatgttctga 
agatctgtaa 
tgtaaagtga 
tagtaacgcc 
ggagtttgag 
aattggacaa 
aggatcacct 
ccacctgggt 



gattgcat ca 
gtaatgctaa 
ggtttaccaa 
tataatccta 
accagccttg 
gagtgttggc 
gagcccagga 
ggcagtgaga 



652 
712 
772 
832 
892 
952 
1012 
1072 
1112 



<210> 39 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097 . . 1112 



<220> 

<221> polyA__signal 
<222> 1089. . 1094 



<400> 39 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 


Ser 








-40 










-35 










-30 










gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


Ser 






-25 










-20 










-15 












ttc 


get 


ttg 


att 


get 


ace 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


He 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 










10 










15 










20 








cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


gst 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Xaa 


Asp 


Ser 


Thr 


Ala 


Val 


Leu 





25 30 35 

cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct geattattge ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgeattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgeatea 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag accagccttg 892 



WO 02/083898 PCT/IB01/00914 

34 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 
cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 1112 



<210> 40 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097 . . 1112 



<220> 

<221> polyA_signal 
<222> 1089. . 1094 



<400> 40 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 

tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 

gcc aaa tat caa ggt gaa gtt caa agt ttg aaa ctg gat gat gat tea 167 

Ala Lys Tyr Gin Gly Glu Val Gin Ser Leu Lys Leu Asp Asp Asp Ser 

-40 -35 -30 

gtt ata gaa gga gta age gac caa gta ctt gtg gca gtt gtg gtc agt 215 

Val lie Glu Gly Val Ser Asp Gin Val Leu Val Ala Val Val Val Ser 

-25 -20 -15 

ttc get ttg att get acc ctg gta tat gca ctt ttc aga aat gta cat 263 

Phe Ala Leu lie Ala Thr Leu Val Tyr Ala Leu Phe Arg Asn Val His 
-10 -5 15 

caa aac att cac cca gaa aac cag gag eta gta agg gta ctt cga gaa 311 

Gin Asn lie His Pro Glu Asn Gin Glu Leu Val Arg Val Leu Arg Glu 

10 15 20 

cag ctt caa aca gaa cag gat gca cct get gac teg aca gca gtt eta 359 

Gin Leu Gin Thr Glu Gin Asp Ala Pro Ala Asp Ser Thr Ala Val Leu 

25 30 35 

cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct geattattge ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgeattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgeatea 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgee tataatccta 832 

gcactttggg aggecaagge aggtggatca cttgagccca ggagtttgag accagccttg 892 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 

cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 1112 



WO 02/083898 

<210> 41 
<211> 508 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17 . .313 

<220> 

<221> sig__peptide 
<222> 17 . . 100 
<223> Von Heijne matrix 

score 5.67441405983879 

seq LGYLVLSEGAVLA/SX 

<220> 

<221> polyA_site 
<222> 493. .508 

<220> 

<221> polyA_signal 
<222> 468 . . 473 

<400> 41 

agcaccgaag actgcg atg act tct gca ctg acc cag ggg ctg gag cga ate 52 

Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie 
-25 -20 



cca 


gac 


cag 


etc 


ggc 


tac 


ctg 


gta 


ctg 


agt 


gaa 


ggt 


gca 


gtg 


ctg 


gcg 


100 


Pro 


Asp 


Gin 


Leu 


Gly 


Tyr 


Leu 


Val 


Leu 


Ser 


Glu 


Gly 


Ala 


Val 


Leu 


Ala 






-15 










-10 










-5 












tea 


tyt 


ggg 


gac 


ctg 


gag 


aat 


gat 


gag 


cag 


gca 


gee 


agt 


gee 


ate 


tyt 


148 


Ser 


Xaa 


Gly 


Asp 


Leu 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Xaa 




1 








5 










10 










15 






gag 


ctg 


gtc 


age 


aca 


gee 


tgc 


ggt 


ttc 


egg 


ctg 


cac 


cgc 


ggc 


atg 


aat 


196 


Glu 


Leu 


Val 


Ser 


Thr 


Ala 


Cys 


Gly 


Phe 


Arg 


Leu 


His 


Arg 


Gly 


Met 


Asn 










20 










25 










30 








gtg 


ccc 


ttc 


aag 


cgc 


ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


cac 


aca 


ctg 


ctg 


244 


Val 


Pro 


Phe 


Lys 


Arg 


Leu 


Ser 


Val 


Val 


Phe 


Gly 


Glu 


His 


Thr 


Leu 


Leu 








35 










40 










45 










gtg 


acg 


gtg 


tea 


gga 


cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aac 


cga 


292 


Val 


Thr 


Val 


Ser 


Gly 


Gin 


Arg 


Val 


Phe 


Val 


Val 


Lys 


Arg 


Gin 


Asn 


Arg 






50 










55 










60 












ggt 


egg 


gag 


ccc 


att 


gat 


gtc 


tgagcctgcc ggagggegag ggtcggagaa 


343 


Gly 


Arg 


Glu 


Pro 


He 


Asp 


Val 























65 70 

gcggattggg ccttctgctc gcccacctcc cacccctacc tggaegggee caggcttggg 403 

gactctgagc tgtgttaagg agaacaaggg caaggagacc tccctttgtg ctccctcact 463 

ccctaataaa catgagtctg atgttctcca aaaaaaaaaa aaaaa 508 

<210> 42 

<211> 623 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. . 365 

<220> 

<221> sig_peptide 

<222> 69. . 152 

<223> Von Heijne matrix 



PCT/IB01/00914 

35 



WO 02/083898 

score 5.54338579743238 
seq LGYLVLSEGAVLA/SS 

<220> 

<221> polyA_site 
<222> 608. . 623 

<220> 

<221> polyA_signal 
<222> 582. .587 



PCT/IB01/00914 



36 



<400> 42 

acgtgaccgg ggcctgaagc cggaagctac ctatctggta gggagctccc ccagcaccga 
agactgcg atg act tct gca ctg acc cag ggg ctg gag cga ate cca gac 
Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie Pro Asp 
-25 ^ -20 " -15 

cag etc ggc tac ctg gta ctg agt gaa ggt gca gtg ctg gcg tea tct 
Gin Leu Gly Tyr Leu Val Leu Ser Glu Gly Ala Val Leu Ala Ser Ser 

-10 -5 1 

ggg gac ctg gag aat gat gag cag gca gec agt gec ate tyt gag ctg 
Gly Asp Leu Glu Asn Asp Glu Gin Ala Ala Ser Ala lie Xaa Glu Leu 

5 10 15 

gtc age aca gee tgc ggt ttc egg ctg cac cgc ggc atg aat gtg ccc 
Val Ser Thr Ala Cys Gly Phe Arg Leu His Arg Gly Met Asn Val Pro 

20 25 ~ 30 

ttc aag cgc ctg tct gtg gtc ttt gga gaa cac aca ctg ctg gtg acg 
Phe Lys Arg Leu Ser Val Val Phe Gly Glu His Thr Leu Leu Val Thr 
35 40 45 50 

gtg tea gga cag agg gtg ttt gtg gtg aag agg cag aac cga ggt egg 
Val Ser Gly Gin Arg Val Phe Val Val Lys Arg Gin Asn Arg Gly Arg 

55 60 65 

gag ccc att gat gtc tgagcctgcc ggagggegag ggtcggagaa gcggattggg 
Glu Pro lie Asp Val 
70 

tcctgggcct ctgtgatgag gcaggcacac ctgtcggtct tggcttgctg ctagaactag 
ggccttytgs tcgcccacct cccaccccta cctggacggg cccaggcttg gggactytga 
gctgtgttaa ggagaacaag ggcaaggaga cctccctttg tgctccctca ctccctaata 
aacatgagtc tgatgttctc cgaaaaaaaa aaaaaaaa 



60 
110 



158 



206 



254 



302 



350 



405 



465 
525 
585 
623 



<210> 43 

<211> 1166 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .559 

<220> 

<221> sig_peptide 
<222> 2. . 46 

<223> Von Heijne matrix 

score 5.34456047740956 
seq VMAASMARGGVSA/RV 

<220> 

<221> polyA_site 
<222> 1151. . 1166 



<400> 43 

g atg cat gtc atg gee gee tec atg gee egg gga ggc gtg agt gee agg 

Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 



49 



WO 02/083898 
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gtt 


eta 


ctg 


cag 


get 


gcc 


agg 


ggc 


ace 


tgk 


tgg 


aac 


aga 


cct 


ggg 


ggc 


97 


Val 


Leu 


Leu 


Gin 
5 


Ala 


Ala 


Arg 


Gly 


Thr 
10 


Xaa 


Trp 


Asn 


Arg 


Pro 
15 


Gly 


Gly 




act 


tec 


ggg 


teg 


ggg 


gag 


ggg 


gtg 


gcg 


ctg 


ggg 


aca 


ace 


aga 


aag 


ttt 


145 


Thr 


Ser 


Gly 
20 


Ser 


Gly 


Glu 


Gly 


Val 
25 


Ala 


Leu 


Gly 


Thr 


Thr 

30 


Arg 


Lys 


Phe 




caa 


gcg 


aca 


kgc 


teg 


cgc 


ccg 


get 


gga 


gag 


gag 


gac 


gcg 


ggc 


ggc 


ccg 


193 


Gin 


Ala 
35 


Thr 


Xaa 


Ser 


Arg 


Pro 
40 


Ala 


Gly 


Glu 


Glu 


Asp 
45 


Ala 


Gly 


Gly 


Pro 




gag 


egg 


ccc 


ggg 


gac 


gtg 


gtg 


aac 


gtg 


gtg 


ttc 


gta 


gac 


cgc 


tea 


ggc 


241 


Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe 


Val 


Asp 


Arg 


Ser 


Gly 




50 










55 










60 










65 




cag 


egg 


ate 


cca 


gtg 


agt 


ggc 


aga 


gtc 


ggg 


gac 


aat 


gtt 


ctt 


cac 


ctg 


289 


Gin 


Arg 


He 


Pro 


Val 
70 


Ser 


Gly 


Arg 


Val 


Gly 
75 


Asp 


Asn 


Val 


Leu 


His 
80 


Leu 




gcc 


cag 


cgc 


cac 


ggg 


gtg 


gac 


ctg 


gaa 


ggg 


gcc 


tgt 


gaa 


gcc 


tec 


ctg 


337 


Ala 


Gin 


Arg 


His 
85 


Gly 


Val 


Asp 


Leu 


Glu 
90 


Gly 


Ala 


Cys 


Glu 


Ala 
95 


Ser 


Leu 




gcc 


tgc 


tec 


ace 


tgc 


cat 


gtg 


tat 


gtg 


agt 


gaa 


gac 


cac 


ctg 


gat 


etc 


385 


Ala 


Cys 


Ser 
100 


Thr 


Cys 


His 


Val 


Tyr 
105 


Val 


Ser 


Glu 


Asp 


His 
110 


Leu 


Asp 


Leu 




ctg 


cct 


cct 


ccc 


gag 


gag 


agg 


gaa 


gac 


gac 


atg 


eta 


gac 


atg 


gcc 


ccc 


433 


Leu 


Pro 
115 


Pro 


Pro 


Glu 


Glu 


Arg 
120 


Glu 


Asp 


Asp 


Met 


Leu 
125 


Asp 


Met 


Ala 


Pro 




etc 


etc 


cag 


gag 


aac 


teg 


egg 


ctg 


ggc 


tgc 


cag 


att 


gtg 


ctg 


aca 


ccg 


481 


Leu 


Leu 


Gin 


Glu 


Asn 


Ser 


Arg 


Leu 


Gly 


Cys 


Gin 


He 


Val 


Leu 


Thr 


Pro 




130 










135 










140 










145 




gag 


ctg 


gaa 


rga 


gcg 


gaa 


ttc 


ace 


ctg 


ccc 


aag 


ate 


ace 


agg 


aac 


ttc 


529 


Glu 


Leu 


Glu 


Xaa 


Ala 
150 


Glu 


Phe 


Thr 


Leu 


Pro 
155 


Lys 


He 


Thr 


Arg 


Asn 
160 


Phe 




tac 


gtg 


gat 


ggc 


cat 


gt c 


ccc 


aag 


ccc 


cac 


tgacatgaac acctggacca 


579 


Tyr 


Val 


Asp 


Gly 
165 


His 


Val 


Pro 


Lys 


Pro 
170 


His 

















ttccacattg ccatggcccc agggeccaga ttgagggaat agccaggtgc cagccctgcc 639 

cagagtgegg acaggcccgg gagagaegtg gaagcccctg tgaaggacaa cacccctgct 699 

tgggagagag tcccatgtcc aggctctggt ggggacaggg cccctagtgg ggtggccttc 759 

cccaggcccc tgagaatcag ggtttgagta ggagtggact catattggag ctgcaataaa 819 

tcgataacac aggccccagc atgttgagtg tccttggggg acagatctgg ggtctacagg 879 

tggctcacac ctgtaatcct agcactttgg gaagecaaga tgggaggatc acttgaggee 939 

aggagcttaa gaccatcctg ggcaacatgg cgagacctcg tctctataaa aacagtaaaa 999 

attagcgtgg agtggtggta agtgcctgta gtcccagcta ctegggagae tgaggtggga 1059 

ggatcactta geccagggga tttaggctgc agtggagcgg tgatcaggee attccactcc 1119 

agcctgggtg acagagtgag gccctgtctc taaaaaaaaa aaaaaaa 1166 



<210> 44 

<211> 649 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 18. .386 



<220> 

<221> sig_peptide 

<222> 18 . . 161 

<223> Von Heijne matrix 

score 5.26064226788865 
seq LLMLLLFLSELQY/YL 



<220> 

<221> polyA_site 
<222> 634 . . 649 



WO 02/083898 
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38 



<220> 

<221> polyA_signal 
<222> 611. . 616 



<400> 44 
ctttccggcc 



gcc 
Ala 

gcc 
Ala 

tec 

Ser 

-5 

gtg 

Val 



tac ccc 
Tyr Pro 
-35 
acc gtg 
Thr Val 
-20 

gag ctg 
Glu Leu 

gac aag 
Asp Lys 



ttt ccg cac 
Phe Pro His 
30 

gcc gga gaa 
Ala Gly Glu 
45 

eta gat aaa 
Leu Asp Lys 
60 

taatgcagga 
agagcaaagt 
atggagatga 
cmctgctgtg 
ttcagtcaaa 



ggtcccc atg gag gcg ctg ggg aag ctg aag cag ttc gat 
Met Glu Ala Leu Gly Lys Leu Lys Gin Phe Asp 
-45 -40 
aag act ttg gag gac ttc egg gtc aag acc tgc ggg ggc 
Lys Thr Leu Glu Asp Phe Arg Val Lys Thr Cys Gly Gly 

-30 -25 
acc att gtc agt ggc ctt etc atg ctg eta ctg ttc ctg 
Thr lie Val Ser Gly Leu Leu Met Leu Leu Leu Phe Leu 

-15 -10 
cag tat tac etc acc acg gag gtg cat cet gag etc tac 
Gin Tyr Tyr Leu Thr Thr Glu Val His Pro Glu Leu Tyr 

1 5 10 

teg egg gga gat aaa ctg aag ate aac ate gat gta ctt 
Ser Arg Gly Asp Lys Leu Lys lie Asn lie Asp Val Leu 
15 20 25 

atg cet tgt gcc tat ctg agt att gat gcc atg gat gtg 
Met Pro Cys Ala Tyr Leu Ser lie Asp Ala Met Asp Val 

35 40 
cag car ctg gat gtg gaa cac aac ctg ttc aag caa cga 
Gin Gin Leu Asp Val Glu His Asn Leu Phe Lys Gin Arg 

50 55 
gat ggc ate ccc gtg age tea gag get gag egg cat gmt 
Asp Gly lie Pro Val Ser Ser Glu Ala Glu Arg His Xaa 
65 70 75 

caccagtgga accaacataa tttgetatag gactggatgt ggagtggagg 
catggttgtt gattccaacg ctttcagcct gagcaaatga atgacttgag 
ctggaggaga aggttttttt ggtgggaggg aggagtggct attggatatc 
taacaaagta ctccaaaagt tagtagtttg atacaataaa aatgtgtttt 
aaaaaaaaaa aaa 



50 



98 



146 



194 



242 



290 



338 



386 



446 
506 
566 
626 
649 



<210> 45 

<211> 836 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 87. .317 



<220> 

<221> sig_peptide 

<222> 87 . . 173 

<223> Von Heijne matrix 

score 5.19818357425197 
seq LLLKLQRL PQAE P / VE 



<220> 

<221> polyA_site 
<222> 821. .836 



<220> 

<221> polyA_signal 
<222> 805. .810 



<400> 45 

gagtgtgggg ccagccgtgg aggctccagg tgttctctct gccccagcag agcccggcag 
gagccccaac aggaagecag cgegge atg get gcc acc gac ttc gtg cag gag 



60 
113 



WO 02/083898 
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Met Ala Ala Thr Asp Phe Val Gin Glu 
-25 



atg 


cgc 


gee 


gtg 


ggc 


gag 


agg 


ctg 


ctg 


etc 


aag 


ctg 


cag 


aga 


ctg 


ccc 


Met 


Arg 


Ala 


Val 


Gly 


Glu 


Arg 


Leu 


Leu 


Leu 


Lys 


Leu 


Gin 


Arg 


Leu 


Pro 


-20 










-15 










-10 










-5 


cag 


get 


gag 


ccc 


gtg 


gag 


ate 


gtg 


gec 


ttc 


tea 


gtc 


ate 


ate 


ctt 


ttc 


Gin 


Ala 


Glu 


Pro 


Val 


Glu 


He 


Val 


Ala 


Phe 


Ser 


Val 


He 


He 


Leu 


Phe 










1 








5 










10 






aca 


get 


act 


gtt 


ctg 


ctg 


ttg 


ctg 


ctg 


ata 


gee 


tgc 


age 


tgc 


tgc 


tgc 


Thr 


Ala 


Thr 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


He 


Ala 


Cys 


Ser 


Cys 


Cys 


Cys 






15 










20 










25 








act 


cac 


tgc 


tgc 


tgc 


cct 


gag 


egg 


aga 


ggc 


agg 


aag 


gtc 


cag 


gtg 


cag 


Thr 


His 


Cys 


Cys 


Cys 


Pro 


Glu 


Arg 


Arg 


Gly 


Arg 


Lys 


Val 


Gin 


Val 


Gin 




30 










35 










40 











cca 
Pro 



cca 
Pro 



tgaeggaegg gcgatggctg aggagaagct ggagaggaga 



ccg aca 
Pro Thr 
45 

tggccaatgc catgacacag gccatcagcc tggccctgca 
ggctcccctg gccccagctc tggcccagcc caggtacctg 
taccaaggaa acaagggctg gtataggtgc aaacctctca 
cagctgtttc aaagactggg tcaactgcct 
aaaacaagga agtaggggtc cccatacctt 
tggcccttgg ggctctgctg atacatgeca 
tgeaataget tctgcatccg agctcccgcc 
gtatttgeca gtttccaagt aaaagctttt 



gctggggagt 
actttttaac 
tgtgggcaat 
aggggctttg 
ttctagtcca 



gcccttaccc ctcaagacca 
gacactgaca acttgagccc 
tetgecagtg gacactgggt 
gggcttcttc gcctacctgc 
gatggagaac agtccccacc 
aagaggagca aggcaatcag 
agagegtgag catgtcagta 
gtgaaaaaaa aaaaaaaaa 



161 



209 



257 



305 



357 



417 
477 
537 
597 
657 
717 
777 
836 



<210> 46 

<211> 482 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 17. .337 



<220> 

<221> sig_peptide 

<222> 17 . . 97 

<223> Von Heijne matrix 

score 4.95181310301952 
seq VLSTLSLVQFSPS/GR 



<220> 

<221> polyA_site 
<222> 467 . .482 



<220> 

<221> polyA__signal 
<222> 442. .447 



<400> 46 

actttagctg ctgcag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 52 

Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 
gat cct gtg ctt age act ctt age ctg gtg cag ttc age ccc agt gga 100 
Asp Pro Val Leu Ser Thr Leu Ser Leu Val Gin Phe Ser Pro Ser Gly 
-15 -10 -5 1 

agg ace cag gac ctg etc tct cca ggg gtg gag aac ctg teg gtg ctg 148 
Arg Thr Gin Asp Leu Leu Ser Pro Gly Val Glu Asn Leu Ser Val Leu 

5 10 15 

gac gtg tec cct ctg ggc ttg gee tgc tgt ctg etc act etc ace atg 196 
Asp Val Ser Pro Leu Gly Leu Ala Cys Cys Leu Leu Thr Leu Thr Met 
20 25 30 



WO 02/083898 PCT/IB01/00914 

40 

244 



292 



tec 


tgc 


cca 


ggg 


cct 


gac 


cct 


cct 


gag 


ggg 


ccc 


ggg 


ace 


cag 


cgt 


gtg 


Ser 


Cys 


Pro 


Gly 


Pro 


Asp 


Pro 


Pro 


Glu 


Gly 


Pro 


Gly 


Thr 


Gin 


Arg 


Val 




35 










40 










45 










tgg 


caa 


ggg 


get 


eta 


egg 


ate 


eta 


cag 


etc 


cca 


gga 


gee 


cca 


gat 


ggg 


Trp 


Gin 


Gly 


Ala 


Leu 


Arg 


He 


Leu 


Gin 


Leu 


Pro 


Gly Ala 


Pro 


Asp 


Gly 


50 










55 










60 










65 


gtt 


tea 


cca 


tat 


cag 


cca 


gtc 


tgg 


tct 


cga 


act 


cct 


gac 


etc 


aag 




Val 


Ser 


Pro 


Tyr 


Gin 


Pro 


Val 


Trp 


Ser 


Arg 


Thr 


Pro 


Asp 


Leu 


Lys 












70 










75 










80 





337 



tgattcgeca gcctcgtcct cccaaagtgc tgggattaca gatgtgagcc accatgccca 397 
tggagaatgt attacttttg taatttaaaa aataatagta ataaaataaa gatctttaca 457 
atgtttcaga aaaaaaaaaa aaaaa 482 



<210> 47 

<211> 806 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136. .471 



<220> 

<221> sig_peptide 

<222> 136. .252 

<223> Von Heijne matrix 

score 4.89253405263876 
seq IILFSAIVGFIYG/YV 



<220> 

<221> polyA_site 
<222> 792. .806 



<220> 

<221> polyA_signal 
<222> 775. .780 



<400> 47 

ttcgctctcc egggcttaga aggcccggct actgacgcgc agtgccagac cttacccctc 60 
aeggtcctta agtctcggtc gccctcgcct cgcagcctgc cacccgcgct cagctgcccg 120 
cctcctcagc cagee atg ctg gag cat ctg age teg ctg ccc acg cag atg 171 
Met Leu Glu His Leu Ser Ser Leu Pro Thr Gin Met 
-35 -30 



gat 


tac 


aag 


ggc 


cag 


aag 


eta 


get 


gaa 


cag 


atg 


ttt 


cag 


gga 


att 


att 


219 


Asp 


Tyr 


Lys 


Gly 


Gin 


Lys 


Leu 


Ala 


Glu 


Gin 


Met 


Phe 


Gin 


Gly 


He 


lie 








-25 










-20 










-15 










ctt 


ttt 


tct 


gca 


ata 


gtt 


gga 


ttt 


ate 


tac 


ggg 


tac 


gtg 


get 


gaa 


cag 


267 


Leu 


Phe 


Ser 


Ala 


He 


Val 


Gly 


Phe 


He 


Tyr 


Gly 


Tyr 


Val 


Ala 


Glu 


Gin 






-10 










-5 










1 








5 




ttc 


ggg 


tgg 


act 


gtc 


tat 


ata 


gtt 


atg 


gee 


gga 


ttt 


get 


ttt 


tea 


tgt 


315 


Phe 


Gly 


Trp 


Thr 


Val 


Tyr 


He 


Val 


Met 


Ala 


Gly 


Phe 


Ala 


Phe 


Ser 


Cys 












10 










15 










20 






ttg 


ctg 


aca 


ctt 


cct 


cca 


tgg 


ccc 


ate 


tat 


cgc 


egg 


cat 


cct 


etc 


aag 


363 


Leu 


Leu 


Thr 


Leu 


Pro 


Pro 


Trp 


Pro 


He 


Tyr 


Arg 


Arg 


His 


Pro 


Leu 


Lys 










25 










30 










35 








tgg 


tta 


cct 


gtt 


caa 


gca 


cag 


acg 


aca 


aga 


aac 


cag 


ggg 


aaa 


gaa 


aaa 


411 


Trp 


Leu 


Pro 


Val 


Gin 


Ala 


Gin 


Thr 


Thr 


Arg 


Asn 


Gin 


Gly 


Lys 


Glu 


Lys 








40 










45 










50 










tta 


aga 


ggc 


atg 


eta 


aaa 


ata 


att 


gag 


gtt 


ttc 


atg 


att 


cag 


cac 


ctg 


459 


Leu 


Arg 


Gly 


Met 


Leu 


Lys 


He 


He 


Glu 


Val 


Phe 


Met 


He 


Gin 


His 


Leu 





55 60 65 

ctt ttg ttt ctg tgagatgagc taaattgett tcatacccca gataagagct 511 
Leu Leu Phe Leu 
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70 

aaaaccacct aatgctctta tggcacagct gtgtatagat ttagttctct ttatacttca 571 

tttctagccc agttgggttt tgatttatat aagtagttta gaccttctct tcataatctt 631 

gctctgagat ggggaacaga acacacaagt atgaagtttc tttcaggtgt aaataatgaa 691 

aaataaatgc ctcataaatg atagtacaat gtaactatca aagttttata attcattatg 751 

agttaaccat tttaatgttt ccaattaaac ctcatagtgc aaaaaaaaaa aaaaa 806 



<210> 48 

<211> 582 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148 . .360 

<220> 

<221> sig_peptide 

<222> 148. .216 

<223> Von Heijne matrix 

score 5.918755252201 

SALMLLPCRPVLT / S V 

<220> 

<221> polyA_site 
<222> 566. .582 

<220> 

<221> polyA_signal 
<222> 544 . .549 

<400> 48 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgckct sygctctctg aacctggcgc ccccgaccgt cgccgcccct 120 
gccccgagtc tgttccccgc cgcccag atg atg aac aat ggc etc etc caa cag 174 

Met Met Asn Asn Gly Leu Leu Gin Gin 
-20 -15 



ccc 


tct 


gec 


ttg 


atg 


ttg 


etc 


ccc 


tgc 


cgc 


cca 


gtt 


ctt 


act 


tct 


gtg 


222 


Pro 


Ser 


Ala 


Leu 


Met 
-10 


Leu 


Leu 


Pro 


Cys 


Arg 
-5 


Pro 


Val 


Leu 


Thr 


Ser 
1 


Val 




gec 


ctt 


aat 


gec 


aac 


ttt 


gtg 


tec 


tgg 


aag 


agt 


cgt 


ace 


aag 


tac 


ace 


270 


Ala 


Leu 


Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser 
10 


Trp 


Lys 


Ser 


Arg 


Thr 
15 


Lys 


Tyr 


Thr 




att 


aca 


cca 


gtg 


aag 


atg 


agg 


aag 


tct 


ggg 


ggc 


cga 


gac 


cac 


aca 


ggt 


318 


He 


Thr 

20 


Pro 


Val 


Lys 


Met 


Arg 
25 


Lys 


Ser 


Gly 


Gly 


Arg 
30 


Asp 


His 


Thr 


Gly 




get 


gga 


aac 


gtg 


cgt 


age 


aac 


agt 


agg 


ccg 


agt 


ate 


caa 


cgt 






360 


Ala 


Gly 


Asn 


Val 


Arg 


Ser 


Asn 


Ser 


Arg 


Pro 


Ser 


He 


Gin 


Arg 








35 










40 










45 















tgatcataac aaaegggtea ttggcaaggc aggtegcaac cgctggctgg geaagaggee 420 

taacagtggg cggtggcacc gcaagggggg ctgggctggc cgaaagattc ggccactacc 480 

ccccatgaag agttacgtga agctgccttc tgcttctgcc caaagctgat atccctgtac 540 

tctaataaaa tgcccccccc ccctcaaaaa aaaaaaaaaa aa 582 



<210> 49 

<211> 583 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 149. .361 
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<220> 

<221> sig_peptide 

<222> 149. .217 

<223> Von Heijne matrix 

score 5.918755252201 

S ALMLLPCRPVLT / S V 



<220> 

<221> polyA_site 
<222> 567 . . 583 



<220> 

<221> polyA_signal 
<222> 545. . 550 



<400> 49 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgcgct ctgcstctct gaacctggcg cccccgaccg tcgccgcccc 120 
tgccccgagt ctgttccccg ccgcccag atg atg aac aat ggc etc etc caa 172 

Met Met Asn Asn Gly Leu Leu Gin 
-20 

tgc cgc cca gtt ctt act tct 220 
Cys Arg Pro Val Leu Thr Ser 

-5 1 
tgg aag agt cgt acc aag tac 268 
Trp Lys Ser Arg Thr Lys Tyr 
15 

tct ggg ggc cga gac cac aca 316 
Ser Gly Gly Arg Asp His Thr 
30 

agg ccg agt ate caa cgt 361 
Arg Pro Ser lie Gin Arg 
45 

tgatcataac aaaegggtea ttggcaaggc aggtegcaac cgctggctgg geaagaggee 421 
taacagtggg cggtggcacc gcaagggggg ctgggctggc cgaaagattc ggccactacc 481 
ccccatgaag agttacgtga agctgccttc tgcttctgcc caaagctgat atccctgtac 541 
tctaataaaa tgcccccccc ccctcaaaaa aaaaaaaaaa aa 583 



cag 


ccc 


tct 


gec 


ttg 


atg 


ttg 


etc 


ccc 


Gin 


Pro 


Ser 


Ala 


Leu 


Met 


Leu 


Leu 


Pro 


-15 










-10 








gtg 


gec 


ctt 


aat 


gee 


aac 


ttt 


gtg 


tec 


Val 


Ala 


Leu 


Asn 


Ala 


Asn 


Phe 


Val 


Ser 








5 










10 


acc 


att 


aca 


cca 


gtg 


aag 


atg 


agg 


aag 


Thr 


He 


Thr 


Pro 


Val 


Lys 


Met 


Arg 


Lys 






20 










25 




ggt 


get 


gga 


aac 


gtg 


cgt 


age 


aac 


agt 


Gly 


Ala 


Gly 


Asn 


Val 


Arg 


Ser 


Asn 


Ser 




35 










40 







<210> 50 

<211> 746 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83. .358 



<220> 

<221> sig_peptide 

<222> 83. . 127 

<223> Von Heijne matrix 

score 4.68058603039206 
VLAGSLLGPTSRS /AA 



<400> 50 

ctttttttcc tttgttccag ggctggagcg gctctgggct ccggaatcgc ccgcagccgg 60 
tactgeggga cccactgcgg at atg get gtc ttg get gga tec ctg ttg ggc 112 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly 

-15 -10 
ccc acg agt agg teg gca gcg ttg ctg ggt ggc agg tgg etc cag ccc 160 
Pro Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro 
-5 1 5 10 

egg gec tgg ctg ggg ttc cca gac gee tgg ggc etc ccc acc ccg cag 208 
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Arg Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro 

15 ~ 20 

cag gcc egg ggc aag get cgc ggg aat gag tat cag ccg 

Gin Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro 

30 35 40 

aaa cgc aag aac aag cac ggc tgg gtc egg cgc ctg age 

Lys Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Ser 

4 5 50 55 

ggc gts cag gtc ate ctt cgc cga atg etc aag ggc cgc 

Gly Val Gin Val lie Leu Arg Arg Met Leu Lys Gly Arg 
60 65 70 
age cat tgaggatege gaegcagteg gcgggaccct catggaagca 
Ser His 

ctcggacctt gcctggcgct atttttgeag ggagctgggg agcaggaa 

gagtgetetc catattgtgg ggttgaagtc tggatgggag ettgecaa 

gctttttaat taggaagcat ttcgaacctg cgcaacagac caaagaac 

atccgtgtac ccagtaccct gactaccgac tacctacaac ccgtccct 

gttcttttga agctgatctc aggcategga ttatttcttc tgtaaata 
tctctccaag atgagagctc attaaaagac aattacaa 



Thr Pro Gin 

25 

age aac ate 
Ser Asn lie 

acg cca gcc 
Thr Pro Ala 

aag teg ctg 
Lys Ser Leu 
7 5 

tcgccctcgc 

eg cctcggacct 
gt ccctttttag 
ag tacaaagaac 
gc cccatcctga 
tt tcagaatgta 



256 



304 



352 



408 

468 
528 
588 
648 
708 
746 



<210> 51 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. . 348 



<220> 

<221> sig_peptide 

<222> 73. .117 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 



<400> 51 

gttgttccag ggctggagcg gctctgggct ccggaatcgc ccgcagccgg tactgeggga 
cccactgcag at atg get gtc ttg get gga tec ctg ttg ggc ccc acg agt 
Met Ala Val Leu Ala Gly Ser Leu Leu Gly Pro Thr Ser 
-15 -10 -5 

gcg ttg ctg ggt ggc agg tgg etc car ccc egg gcc tgg 
Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro Arg Ala Trp 

5 10 
cca gac gcc tgg ggc etc ccc ace ccg cag cag gcc egg 
Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin Gin Ala Arg 

20 25 30 

cgc ggg aat gag tat cag ccg age aat ate aaa cgc aag 
Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie Lys Arg Lys 

35 40 45 

ggc tgg gtc egg cgc ctg arc acg ccg gcc ggc gtg cag 
Gly Trp Val Arg Arg Leu Xaa Thr Pro Ala Gly Val Gin 
50 55 60 

gtc ate ctt cgc cga atg etc aag ggc cgc aag teg ctg age cat 
Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu Ser His 

65 70 75 

tgaggatege gaegcagteg gcgggaccct catggaagca tcgccctcgc ctcggacctt 
gcctggcgct atttttgeag ggagctgggg agcaggaacg cctcggacct gagtgetetc 
catattgtgg ggttgaagtc tggatgggag ettgecaagt ccctttttag gctttttaat 
taggaagcat ttcgaacctg cgcaacagac caaagaacag tacaaagaac atccgtgtac 
ccagtaccct gactaccgac tacctacaac ccgtccctgc cccatcctga gttcttttga 
agctgatctc aggcategga ttatttcttc tgtaaatatt tcagaatgta tctctccaag 
atgagagctc attaaaagat aat 



agg 


teg 


gca 


Arg 


Ser 


Ala 






1 


ctg 


ggg 


ttc 


Leu 


Gly 


Phe 


15 






ggc 


aag 


get 


Gly 


Lys 


Ala 


aac 


aag 


cac 


Asn 


Lys 


His 



60 
111 



159 



207 



255 



303 



348 



408 
468 
528 
588 
648 
708 
731 
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<210> 52 

<211> 773 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .297 



<220> 

<221> sig_peptide 

<222> 22. . 66 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 



<220> 

<221> polyA_site 
<222> 758. .773 



<220> 

<221> polyA_signal 
<222> 740. .745 



<400> 52 

actgcgggac ccactgcgga t atg get gtc ttg get gga tec ctg ttg ggc 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly 
-15 -10 
ccc acg agt agg teg gca gcg ttg ctg ggt ggc agg tgg etc cag ccc 
Pro Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro 
-5 1 5 10 

egg gee tgg ctg ggg ttc cca gac gee tgg ggc etc ccc ace ccg cag 
Arg Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin 

15 20 25 

cag gee egg ggc aag get cgc ggg aat gar tat cag ccg age aat ate 
Gin Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie 

30 ~ 35 40 

aaa cgc aag aac aag cac ggc tgg gtc egg cgc ctg arc acg ccg gee 
Lys Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Xaa Thr Pro Ala 

4 5 50 55 

ggc gtg mrg gtc ate ctt cgc cga atg etc aag ggc cgc aag teg ctg 
Gly Val Xaa Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu 
60 65 70 75 

age cat tgaggatege gaegcagteg gcgggaccct catggaagca tcgccctcgc 
Ser His 

ctcggacctt gcctggcgct atttttgeag ggagctgggg agcaggaacg cctcggacct 
gagtgetetc catattgtgg gtttgaagtc tggatgggag ettgecaagt ccctttttag 
gctttttaat taggaagcat ttcgaacctg cgcaacagac caaagaacag tacaaagaac 
atccgtgtac ccagtaccct gactaccgac tacctacaac ccgtccctgc cccatcctga 
gttcttttga agctgatctc aggcategga ttatttcttc tgtaaatatt tcagaatgta 
tctctccaag atgagagctc attaaaagat aattacaaag cttatcacat ccaaaagaat 
tatcaataat tttgaaatat tattaaacgt gtaataaatg ttcaaagttc aaaaaaaaaa 
aaaaaa 



51 



99 



147 



195 



243 



291 



347 

407 
467 
527 
587 
647 
707 
767 
773 



<210> 53 

<211> 740 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 80. .355 



WO 02/083898 



PCT/IB01/00914 



45 



<220> 

<221> sig_peptide 

<222> 80. . 124 

<223> Von Heijne matrix 

score 6.4806443024348 
seq VLAGSLLXPTSRS/AA 



<400> 53 

ataatccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 
tgcgggaccc actgcggat atg get gtc ttg get gga tec ctg ttg grc ccc 

Met Ala Val Leu Ala Gly Ser Leu Leu Xaa Pro 
-15 -10 -5 

acg agt agg teg gca gcg ttg ctg ggt rgc agg tgg etc cag ccc egg 
Thr Ser Arg Ser Ala Ala Leu Leu Gly Xaa Arg Trp Leu Gin Pro Arg 

15 10 
gee tgg ctg ggg ttc cca gac gee tgg ggc etc ccc ace ccg cag cag 
Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin Gin 

15 20 25 

gee egg ggc aag get cgc ggg aat gag tat cag ccg age aac ate aaa 
Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie Lys 

30 35 40 

cgc aag aac aag cac ggc tgg gtc egg cgc ctg arc acg ccg gee ggc 
Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Xaa Thr Pro Ala Gly 
4 5 50 55 60 

kwr smg ktc ate ctt cgc cga atg etc aag ggc cgc aag teg ctg age 
Xaa Xaa Xaa lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu Ser 

65 70 75 

cat tgaggatege gaegcagtsg seggggaace ctcatggaag catcgccctc 
His 

gcctcggacc ttgcctggcg etatttttge agggagctgg ggagcaggaa cgcctcggac 
ctgagtgctc tccatattgt ggggttgaag tctggatggg agettgecaa gtcccttttt 
aggcttttta attaggaagc atttcgaacc tgcgcaacag accaaagaac agtacaaaga 
acatccgtgt acccagtacc ctgactaccg actacctaca acccgtccct gccccatcct 
gagttctttt gaagctgatc tcaggcatcg gattatttct tctgtaaata tttcagaatg 
tatctctcca agatgagagc tcattaaaag acaat 



60 
112 



160 



208 



256 



304 



352 



405 

465 
525 
585 
645 
705 
740 



<210> 54 

<211> 638 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .397 



<220> 

<221> sig_peptide 

<222> 137. .253 

<223> Von Heijne matrix 

score 4.61343345943784 
seq LQGARVLLTGANA/GV 



<220> 

<221> polyA__site 
<222> 623. . 638 



<220> 

<221> polyA_signal 
<222> 616. . 621 



<400> 54 

ggggtgcggt cggggacccg geaggaggeg gecgagaaga gaggacegtg ggggcgttcg 



60 
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cgtggctccc agcccgggac cccacccccg ctggacagtg ggggaaactg aggcctgagc 120 
gggcccacac aggacc atg aag gtg ctt etc etc aca ggg ctg ggg gec ctg 172 

Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 

ttc ttc gee tat tat tgg gat gac aac ttc gac cca gec age etc cag 220 

Phe Phe Ala Tyr Tyr Trp Asp Asp Asn Phe Asp Pro Ala Ser Leu Gin 

-25 " * -20 -15 

gga gcg cga gtg ctg ctg aca ggg gee aac get ggt gtt ggt gag gag 268 

Gly Ala Arg Val Leu Leu Thr Gly Ala Asn Ala Gly Val Gly Glu Glu 

-10 -5 15 

ctg gee tat cac tac gcg cgt ctg ggc tec cac ctg gtg etc act gee 316 

Leu Ala Tyr His Tyr Ala Arg Leu Gly Ser His Leu Val Leu Thr Ala 

10 15 20 

cac act gag get etc ctg cag aag gca egg tgg etc acg ctt gta gtc 364 

His Thr Glu Ala Leu Leu Gin Lys Ala Arg Trp Leu Thr Leu Val Val 

25 30 35 

age act ttg gga ggc cga gga aag tgg ate acc tgaggtcagg agttcaagac 417 
Ser Thr Leu Gly Gly Arg Gly Lys Trp lie Thr 

40 45 

cagcctggcc aacatggega aacctggtct ctactaagaa gacaaaaatt aggecaggea 477 

tggtggctca tgcctgtaat cctagcactt tgggaggeca aggeggatgg atcacttgag 537 

gtcaggagtt caaaaccagc ctggccaaca tggtgaaacc ctgtctctac taaaaataca 597 

aaaaataaaa ataaaataaa taaacaaaaa aaaaaaaaaa a 638 



<210> 55 

<211> 577 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 18. .392 



<220> 

<221> sig_peptide 

<222> 18. . 62 

<223> Von Heijne matrix 

score 4.60228634566274 
seq LLTHNLLSSHVRG/VG 



<220> 

<221> polyA_site 
<222> 562. . 577 



<220> 

<221> polyA__signal 
<222> 545. .550 



<400> 55 

tttttttgtg eggegae atg aaa ctg ctt acc cac aat ctg ctg age teg 50 

Met Lys Leu Leu Thr His Asn Leu Leu Ser Ser 













-15 










-10 










-5 




cat 


gtg 


egg 


ggg 


gtg 


ggg 


tec 


cgt 


ggc 


ttc 


ccc 


ctg 


cgc 


etc 


cag 


gec 


98 


His 


Val 


Arg 


Gly 


Val 


Gly 


Ser 


Arg 


Gly 


Phe 


Pro 


Leu 


Arg 


Leu 


Gin 


Ala 












1 








5 










10 








acc 


gag 


gtc 


cgt 


ate 


tgc 


cct 


gtg 


gaa 


ttc 


aac 


ccc 


aac 


ttc 


gtg 


gcg 


146 


Thr 


Glu 


Val 


Arg 


He 


Cys 


Pro 


Val 


Glu 


Phe 


Asn 


Pro 


Asn 


Phe 


Val 


Ala 








15 










20 










25 










cgt 


atr 


ata 


cct 


aaa 


gtg 


gag 


tgg 


teg 


gcg 


ttc 


ctg 


gag 


gcg 


gec 


gat 


194 


Arg 


Xaa 


He 


Pro 


Lys 


Val 


Glu 


Trp 


Ser 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp 






30 










35 










40 












aac 


ttg 


cgt 


ctg 


ate 


cag 


gtg 


ccg 


aaa 


ggg 


ccg 


gtt 


gag 


gga 


tat 


gag 


242 


Asn 


Leu 


Arg 


Leu 


He 


Gin 


Val 


Pro 


Lys 


Gly 


Pro 


Val 


Glu 


Gly 


Tyr 


Glu 
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45 50 55 60 

gag aat gag gag ttt ctg agg acc atg cac cac ctg ctg ctg gag gtg 290 

Glu Asn Glu Glu Phe Leu Arg Thr Met His His Leu Leu Leu Glu Val 

65 70 75 

gaa gtg ata gag ggc acc ctg cag tgc ccg gaa tct gga cgt atg ttc 338 
Glu Val He Glu Gly Thr Leu Gin Cys Pro Glu Ser Gly Arg Met Phe 

80 85 90 

ccc ate age cgc ggg ate ccm amc atg ctg ctg agt gaa gag gaa act 386 
Pro He Ser Arg Gly He Pro Xaa Met Leu Leu Ser Glu Glu Glu Thr 

95 100 105 

gag agt tgattgtgsc aggegecagt ttttcttgtt atgactgttt atttttgttg 442 
Glu Ser 
110 

atctataccc tgtttccgaa ttctgccgtg tgtatcccca acccttgacc caatgacacc 502 
aaacacagtg tttttgagct eggtattata tatttttttc tcattaaagg tttaaaacca 562 
aaaaaaaaaa aaaaa 577 



<210> 56 

<211> 521 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 82. .360 



<220> 

<221> sig_peptide 

<222> 82. . 147 

<223> Von Heijne matrix 

score 4.53239781724629 
seq FFLLITIALGTKT/ES 



<220> 

<221> polyA_site 
<222> 506. .521 



<220> 

<221> polyA_signal 
<222> 486. .491 



<400> 56 

attttctcaa caattcctca ccgcaggagc ctctgaagct cccaccaggc cagctctcct 60 



cccacaacag cttcccacag c 


atg 


aag 


ate 


tec 


gtg 


get 


gee 


att 


ccc 


ttc 


111 














Met 


Lys 


He 
-20 


Ser 


Val 


Ala 


Ala 


He 
-15 


Pro 


Phe 




ttc 


etc 


etc 


ate 


acc 


ate 


gee 


eta 


ggg 


acc 


aag 


act 


gaa 


tec 


tec 


tea 


159 


Phe 


Leu 


Leu 
-10 


He 


Thr 


lie 


Ala 


Leu 
-5 


Gly 


Thr 


Lys 


Thr 


Glu 
1 


Ser 


Ser 


Ser 




egg 


gga 


cct 


tac 


cac 


ccc 


tea 


gag 


tgc 


tgc 


ttc 


acc 


tac 


act 


acc 


tac 


207 


Arg 


Gly 


Pro 


Tyr 


His 


Pro 


Ser 


Glu 


Cys 


Cys 


Phe 


Thr 


Tyr 


Thr 


Thr 


Tyr 




5 










10 










15 










20 




aag 


ate 


ccg 


cgt 


cag 


egg 


att 


atg 


gat 


tac 


tat 


gag 


acc 


aac 


age 


cag 


255 


Lys 


He 


Pro 


Arg 


Gin 
25 


Arg 


He 


Met 


Asp 


Tyr 
30 


Tyr 


Glu 


Thr 


Asn 


Ser 
35 


Gin 




tgc 


tec 


aag 


ccc 


gga 


att 


gtc 


ttc 


ate 


acc 


aaa 


agg 


ggc 


cat 


tec 


gty 


303 


Cys 


Ser 


Lys 


Pro 
40 


Gly 


He 


Val 


Phe 


He 
45 


Thr 


Lys 


Arg 


Gly 


His 
50 


Ser 


Val 




tgt 


acc 


aac 


ccc 


agt 


gac 


aag 


tgg 


gtc 


cag 


gac 


tat 


ate 


aag 


gac 


atg 


351 


Cys 


Thr 


Asn 
55 


Pro 


Ser 


Asp 


Lys 


Trp 
60 


Val 


Gin 


Asp 


Tyr 


He 
65 


Lys 


Asp 


Met 




aag 


gag 


aac 


tgagtgaccc agaaggggtg gcgaaggcac agctcagaga 






400 


Lys 


Glu 


Asn 
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70 

cataaagaga agatgccaag gccccctcct 
ccctcttgga gcttccctgc tttgaattaa 
a 
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ccacccaccg ctaactctca gccccagtca 460 
agaccactca tgcccaaaaa aaaaaaaaaa 520 

521 



<210> 57 
<211> 588 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .308 



<220> 

<221> sig_peptide 

<222> 93. .218 

<223> Von Heijne matrix 

score 4.46006149913644 
seq LRLEVAAAPGTPA/QP 



<220> 

<221> polyA_site 
<222> 573. . 588 



<220> 

<221> polyA_signal 
<222> 558. .563 



<400> 57 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 



tgcagaagca cttgcgggtc tgcagcctac gc 


atg 


aat 


agg 


ttg 


gca 


ggt 


gtg 


113 




















Met 


Asn 


Arg 
-40 


Leu 


Ala 


Gly 


Val 




ggc 


tgg 


egg 


gtg 


gac 


tac 


acc 


ctg 


age 


tec 


age 


ctg 


ctg 


caa 


tec 


gtg 


161 


Gly 


Trp 


Arg 


Val 


Asp 


Tyr 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Leu 


Gin 


Ser 


Val 




-35 










-30 










-25 










-20 




gaa 


gag 


ccc 


atg 


gtg 


cac 


ctg 


egg 


ctg 


gag 


gtg 


gca 


get 


gec 


cca 


ggg 


209 


Glu 


Glu 


Pro 


Met 


Val 


His 


Leu 


Arg 


Leu 


Glu 


Val 


Ala 


Ala 


Ala 


Pro 


Gly 












-15 










-10 










-5 






acc 


cca 


gec 


cag 


cct 


gtt 


gec 


atg 


tec 


etc 


tea 


gca 


gac 


aag 


ttc 


cag 


257 


Thr 


Pro 


Ala 


Gin 


Pro 


Val 


Ala 


Met 


Ser 


Leu 


Ser 


Ala 


Asp 


Lys 


Phe 


Gin 










1 








5 










10 










gtc 


etc 


ctg 


gca 


gaa 


ctg 


aag 


cag 


gec 


cag 


acc 


ctg 


atg 


age 


tec 


ctg 


305 


Val 


Leu 


Leu 


Ala 


Glu 


Leu 


Lys 


Gin 


Ala 


Gin 


Thr 


Leu 


Met 


Ser 


Ser 


Leu 






15 










20 










25 













ggc tgaggagaag ggtgttccag gcctgtgtgg agccgccctg cccgtatgga 358 

Gly 

30 

gtcacgccct ctgaactget ettegggagg cagccctggt tetaggatge tgaggccctg 418 

gcccggactc tggcctccca gatccccagc tgcctcactt ctctcttgag aacttggctc 478 

agggctcctg aggacctttc ccagcattac cttcccttcc cttgaaaggc aattgttggc 538 

tgttttcata agcaggaaaa ataaacagaa gtacaaaaaa aaaaaaaaaa 588 



<210> 58 

<211> 521 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 113. .364 
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<220> 

<221> sig_peptide 

<222> 113. . 172 

<223> Von Heijne matrix 

score 4.37180298395146 

SLLLSLPPHQGLT/FS 



<220> 

<221> polyA__site 
<222> 500 . . 521 



<400> 58 

ttttttacat ggtgttccca cagctgggag gacacccaca tggtcggcgt gcaggatatt 60 

tcgctggacc ctagaaaagc caccacgacc tgtgggccat gatgctaccc ca atg get 118 

Met Ala 
-20 

get get get gtt cct tct ctt ctt ctt tct ctt cct cct cac cag ggg 166 

Ala Ala Ala Val Pro Ser Leu Leu Leu Ser Leu Pro Pro His Gin Gly 

-15 -10 -5 

etc act ttc tec aac aaa ata caa cct ttt gga get caa gga gtc ttg 214 

Leu Thr Phe Ser Asn Lys lie Gin Pro Phe Gly Ala Gin Gly Val Leu 

15 10 

cat ccg gaa cca gga ctg cga gac tgg ctg ctg cca acg tgc tec aga 262 

His Pro Glu Pro Gly Leu Arg Asp Trp Leu Leu Pro Thr Cys Ser Arg 
15 20 ~ 25 30 

caa ttg cga gtc gca ctg ccg gag aag ggg tec gag ggc agt ctg tgt 310 

Gin Leu Arg Val Ala Leu Pro Glu Lys Gly Ser Glu Gly Ser Leu Cys 

35 40 45 

caa acg cag ctg cca get act cca tgc ttc ctg cct teg aat acg gtg 358 

Gin Thr Gin Leu Pro Ala Thr Pro Cys Phe Leu Pro Ser Asn Thr Val 

50 55 60 

aga acg tgaagtcatg agetgetget aaggcatgtg gcaaccttga agagaaggtc 414 
Arg Thr 

aagagctacc agccaccaaa agaatgecag cacttcctgt gtctttgett tggattcatg 474 

agaaatatac gttcctattt gcttcaaaaa aaaaaaaaaa tgcgaaa 521 



<210> 59 

<211> 516 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83. .244 



<220> 

<221> sig_peptide 

<222> 83. . 139 

<223> Von Heijne matrix 

score 4.24406450728052 
seq LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 501. .516 



<400> 59 

actgetatag cgatataact cagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 

Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 
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-5 



tat ttt cag 
Tyr Phe Gin 
10 

get cat gec 
Ala His Ala 
25 

actttgggag 
catggtgaaa 
gcctgtagtc 
gagcttacag 
ctgttcaaaa 



ttt act tta aga aat 
Phe Thr Leu Arg Asn 
15 

tgt aat ccc age act 
Cys Asn Pro Ser Thr 
30 

tccgaggtgg gcagatcatg 
ccccgtctct actaaaaata 
ccagctactc gggaggctga 
tgagecgaga tcgcgccact 
aaaaaaaaaa aa 



50 

1 

ggt tta atg 
Gly Leu Met 

ggc tea cgc 
Gly Ser Arg 

aggtcaggag 
caaaaaaatt 
ggcaggagaa 
gcactccagc 



tgg cca ggt gcg gtg 
Trp Pro Gly Ala Val 
20 

ctg taatcccagc 

Leu 

35 

atcgagacca tcctggctaa 
agecaggcat ggtggcgggc 
tggcgtgaac ctgggaggcg 
ctgggcgaca gagegagact 



208 



254 



314 
374 
434 
494 
516 



<210> 60 

<211> 517 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83. .244 



<220> 

<221> sig_peptide 

<222> 83. . 139 

<223> Von Heijne matrix 

score 4.24406450728052 
LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 502. .517 



<400> 60 

actgetatag cgatataact cagatgttct caggacagca 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt 

Met Lys Phe Asn Phe 
-15 

tec aaa gtt eta gat gac act ttc caa agt gtc 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val 

-5 1 
tat ttt cag ttt act tta aga aat ggt tta atg 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met 

10 15 
get cat gee tgt aat ccc age act ggc tea cgc 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg 

25 30 
actttgggag tccgaggtgg gcagatcatg aggtcaggag 
catggtgaaa ccccgtytct actaaaaata caaaaaaatt 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa 
gagcttacag tgagecgaga tcgcgccact gcactccagc 
ctgtttcaaa aaaaaaaaaa aaa 



agtttacatg tgaatagaga 
gta ctt ttg ata ctt 
Val Leu Leu lie Leu 
-10 

aag aaa tgg tta aat 
Lys Lys Trp Leu Asn 
5 

tgg cca ggt gcg gtg 
Trp Pro Gly Ala Val 
20 

ctg taatcccagc 
Leu 

35 

atcgagacca tcctggctaa 
agecaggcat ggtggcgggc 
tggcgtgaac ctgggaggcg 
ctgggcgaca gagegagact 



60 
112 



160 



208 



254 



314 
374 
434 
494 
517 



<210> 61 

<211> 518 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84 . .245 



<220> 
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<221> sig_peptide 

<222> 84 . .140 

<223> Von Heijne matrix 

score 4.24406450728052 

LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 503. .518 



<400> 61 

aactgctata gcgatataac tcagatgttc tcaggacagc aagtttacat gtgaatagag 
agggagggac gtgttaggaa gac atg aaa ttc aat ttt gta ctt ttg ata ctt 

Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 " 15 20 

get cat gee tgt aat ccc age act ggc tea cgc ctg taatcccagc 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

actttgggag tccgaggtgg gcagatcatg aggtcaggag atcgagacca tcctggctaa 
catggtgaaa ccccgtctct actaaaaata caaaaaaatt agecaggcat ggtggcgggc 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 
gagcttacag tgagecgaga tcgcgccact gcactccagc ctgggcgaca gagegagact 
ctgtttcaaa aaaaaaaaaa aaa 



60 
113 



161 



209 



255 



315 
375 
435 
495 
518 



<210> 62 

<211> 548 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 57 . .329 



<220> 

<221> sig_peptide 

<222> 57. . 113 

<223> Von Heijne matrix 

score 4.2427958051416 
seq FVLLLSEIVSISA/LS 



<220> 

<221> polyA_site 
<222> 533. . 548 



<220> 

<221> polyA_signal 
<222> 505. . 510 



<400> 62 

agttgtcttt ggtagttttt ttgeactaac ttcaggaacc agctcatgat ctcagg atg 59 

Met 

tat gga aaa ata ate ttt gta tta eta ttg tea gaa att gtg age ata 107 
Tyr Gly Lys lie lie Phe Val Leu Leu Leu Ser Glu lie Val Ser lie 

-15 -10 -5 

tea gca tta agt acc act gag gtg gca atg cac act tea acc tct tct 155 
Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser Ser 
15 10 
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52 



tea 


gtc 


aca 


aag 


agt 


tac 


ate 


tea 


tea 


cag 


aca 


aat 


gga 


gaa 


atg 


gga 


203 


Ser 


Val 


Thr 


Lys 


Ser 


Tyr 


He 


Ser 


Ser 


Gin 


Thr 


Asn 


Gly 


Glu 


Met 


Gly 




15 










20 










25 










30 




caa 


ctt 


gtc 


cat 


cgt 


ttc 


act 


gta 


cca 


get 


cct 


gta 


gtg 


ata 


ata 


etc 


251 


Gin 


Leu 


Val 


His 


Arg 


Phe 


Thr 


Val 


Pro 


Ala 


Pro 


Val 


Val 


He 


He 


Leu 












35 










40 










45 






att 


att 


ttg 


tgt 


gtg 


atg 


get 


ggt 


att 


att 


gga 


acg 


ate 


etc 


tta 


att 


299 


He 


He 


Leu 


Cys 


Val 


Met 


Ala 


Gly 


He 


He 


Gly 


Thr 


He 


Leu 


Leu 


He 










50 










55 










60 








tct 


tac 


agt 


att 


cgc 


cga 


ctg 


ata 


aag 


gca 


tgaggatgtg gcctgcatgc 


349 


Ser 


Tyr 


Ser 


He 


Arg 


Arg 


Leu 


He 


Lys 


Ala 

















65 70 

tgectgatet tgectagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 409 

cacctgctat tectttgett atgcccctac ccctggctat cctaattccc tgttctcctg 469 

cctcactatt actgtattct ctacttctaa ataaaaataa aacaaaatac aaaccgttaa 529 

aataaaaaaa aaaaaaaaa 548 

<210> 63 

<211> 722 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 331. . 672 

<220> 

<221> sig_peptide 

<222> 331. . 426 

<223> Von Heijne matrix 

score 4.21873806574589 

seq LLGSGLFVFSLTA/ FN 

<220> 

<221> polyA_site 
<222> 707 . . 722 

<220> 

<221> polyA_signal 
<222> 688 . . 693 

<400> 63 

tttttcctcg tcctctcccg acagagctga cgtgtcctgg gttccaccgg gagegggcat 60 
ttccaccgga egggagggtt cggggtgtcc ggggctgggg aataegtagg ggttgccgcg 120 
cggtgtgggg agttggggcg tgtggctgca gtccegggag ttcttggagg gggtcggccc 180 
accgagcttc cggaccggct gatctgcccg tagcttgecg gagggagggc ggagectgae 240 
tctccgtccc ttctcccatc ccctccagtg gtgggtacgg gcacctcgct ggcgctctcc 300 
tccctcctgt ccctgctgct ctttgctggg atg cag atg tac age cgt cag ctg 354 

Met Gin Met Tyr Ser Arg Gin Leu 
-30 -25 
gee tec ace gag tgg etc ace ate cag ggc ggc ctg ctt ggt teg ggt 402 
Ala Ser Thr Glu Trp Leu Thr He Gin Gly Gly Leu Leu Gly Ser Gly 

-20 -15 -10 

etc ttc gtg ttc teg etc act gee ttc aat aat ctg gag aat ctt gtc 450 
Leu Phe Val Phe Ser Leu Thr Ala Phe Asn Asn Leu Glu Asn Leu Val 

-5 15 
ttt ggc aaa gga ttc caa gca aag ate ttc cct gag att etc ctg tgc 498 
Phe Gly Lys Gly Phe Gin Ala Lys He Phe Pro Glu He Leu Leu Cys 

10 "* " 15 20 

etc ctg ttg get etc ttt gca tct ggc etc ate cac cga gtc tgt gtc 546 
Leu Leu Leu Ala Leu Phe Ala Ser Gly Leu He His Arg Val Cys Val 
25 30 35 40 

ace ace tgc ttc ate ttc tec atg gtt ggt ctg tac tac ate aac aag 594 



WO 02/083898 

Thr Thr Cys Phe lie Phe Ser Met Val 
45 

aty tec tec acc ctg tac cag gca gca 
lie Ser Ser Thr Leu Tyr Gin Ala Ala 

60 65 
aag gtc aca ggc aag age aag aag aga 
Lys Val Thr Gly Lys Ser Lys Lys Arg 

75 80 
agttgattct ttgcaaaaaa aaaaaaaaaa 



PCT/IB01/00914 

53 

Gly Leu Tyr Tyr lie Asn Lys 
50 55 
get cca gtc etc aca cca gee 642 
Ala Pro Val Leu Thr Pro Ala 
70 

aac tgaccctgaa tgttcaataa 692 
Asn 

722 



<210> 64 

<211> 479 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 96. . 320 

<220> 

<221> sig_peptide 

<222> 96. .200 

<223> Von Heijne matrix 

score 4.14350049367799 
seq MAVAFVLSLGVAA/LY 



<220> 

<221> polyA_site 
<222> 464 . . 479 



<220> 

<221> polyA_signal 
<222> 437 . .442 

<400> 64 

attctgcgcc tgcgcgcgtc tacagcaegg ttcgtttttc ctttagtcag gaaggacgtt 60 
ggtgttgagc ataegtatea aggacagtaa ytacc atg get ccc gaa gtt ttg 113 

Met Ala Pro Glu Val Leu 























-35 










-30 




cca 


aaa 


cct 


egg 


atg 


cgt 


ggc 


ctt 


ctg 


gee 


agg 


cgt 


ctg 


cga 


aat 


cat 


161 


Pro 


Lys 


Pro 


Arg 


Met 
-25 


Arg 


Gly 


Leu 


Leu 


Ala 
-20 


Arg 


Arg 


Leu 


Arg 


Asn 
-15 


His 




atg 


get 


gta 


gca 


ttc 


gtg 


eta 


tec 


ctg 


ggg 


gtt 


gca 


get 


ttg 


tat 


aag 


209 


Met 


Ala 


Val 


Ala 
-10 


Phe 


Val 


Leu 


Ser 


Leu 
-5 


Gly 


Val 


Ala 


Ala 


Leu 
1 


Tyr 


Lys 




ttt 


cgt 


gtg 


get 


gat 


caa 


aga 


aag 


aac 


gca 


tac 


gca 


gat 


ttc 


tac 


aga 


257 


Phe 


Arg 
5 


Val 


Ala 


Asp 


Gin 


Arg 
10 


Lys 


Asn 


Ala 


Tyr 


Ala 
15 


Asp 


Phe 


Tyr 


Arg 




aac 


tac 


gat 


gtc 


atg 


aaa 


gat 


ttt 


gag 


gag 


atg 


agg 


aag 


get 


ggt 


ate 


305 


Asn 


Tyr 


Asp 


Val 


Met 


Lys 


Asp 


Phe 


Glu 


Glu 


Met 


Arg 


Lys 


Ala 


Gly 


He 




20 










25 










30 










35 





ttt cag agt gta aag taatcttgga atataaagaa tttcttcagg ttgaattacc 360 
Phe Gin Ser Val Lys 
40 

tagaagtttg tcactgactt gtgttcctga actatgacac atgaatatgt gggctaagaa 420 
atagttcctc ttgataaata aacaattaac aaatactttg gacaaaaaaa aaaaaaaaa 479 



<210> 65 

<211> 1627 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 90. .413 



<220> 

<221> sig__peptide 

<222> 90. .209 

<223> Von Heijne matrix 

score 4.06476262296284 
seq VTAPSMVAPVTFA/SI 



<220> 

<221> polyA_site 
<222> 1612 . .1627 



<220> 

<221> polyA_signal 
<222> 1577. .1582 



<400> 65 

aatctctgcg aaagcttcca ggttcgctca tcgtaggtca ctggaaaaaa ttgtagcatt 
gctaaagagt atttcagaga aaattttgc atg caa aaa ttt agg aag atg agt 

Met Gin Lys Phe Arg Lys Met Ser 
-40 -35 
gaa act cat cat tct gta ate tct gtg aaa get tec agt ccc tgg eta 
Glu Thr His His Ser Val lie Ser Val Lys Ala Ser Ser Pro Trp Leu 

-30 -25 -20 

tct tct tea gtg act get cca tec atg gta gee cca gtc act ttt gca 
Ser Ser Ser Val Thr Ala Pro Ser Met Val Ala Pro Val Thr Phe Ala 

-15 -10 -5 

tct att gta gaa gaa gaa eta caa caa gaa gca get ctt att aga agt 
Ser lie Val Glu Glu Glu Leu Gin Gin Glu Ala Ala Leu lie Arg Ser 
15 10 15 

cga gaa aaa ccg ttg get ctg att cag att gag gag cat gee ata caa 
Arg Glu Lys Pro Leu Ala Leu lie Gin lie Glu Glu His Ala lie Gin 

20 25 30 

gat tta ttg gtt ttc tat gag gca ttt ggc aac cct gaa gag ttt gtc 
Asp Leu Leu Val Phe Tyr Glu Ala Phe Gly Asn Pro Glu Glu Phe Val 

35 40 45 

att gtt gaa agg aca ccg cag gga cca ctg gca gta cct atg tgg aat 
lie Val Glu Arg Thr Pro Gin Gly Pro Leu Ala Val Pro Met Trp Asn 

50 55 60 

aag cat gga tgc tagttcactg tggagttgag atgeatttta cataattatg 
Lys His Gly Cys 
65 

agtttgttca tataaagaaa agctgtggaa aagagtctta gagattttgt aatatcattc 
taaatagatt aagaaaagat ataatttctt tactgeagtt aaatcatata atgtttgtat 
gattaaaaat aaatttctca gaattgtgat tttagtaact ttatataaaa tgtgtgagac 
aaaaacttat taaggttaaa tagaattgtt tcttstgaat ratctaacaa aggaraatat 
aagtgattga atcataagat ataagggggg taaagtatta aaaataactt ttttgtttga 
taacttgaga atttagaaga ttttgccaag tatgtgttgt tgettgaett cttaaatatg 
gcattgatga atttaaagta ggagcatcag ttattacttc tgattcatta atggccagaa 
ttttgtgttt ggtgtaatag ttgtgtcacc attcttgttg ctttttaaaa atcaggctaa 
tcatgtggtc catgtctctt caaagcttga cctgcacaaa tgccatattt cyatttggac 
cacatattct ecattttgea ttgagcagta gagtacagtg gaaagggaat aagaatactg 
attattctga acagtttagt cccaagagaa tagegtttta aaaaagaaaa acaagatttg 
gagtcattgt gggttatttt tggtgggatg gaggatctta aaaatgecta attgtgagag 
aatcaattgc tgaaagtgtt aaaatttctg aaaataaatg cttaattaca tatacaggaa 
ttaaatagtt tggaagaggg ttggattatc attaccttta caatactgta taatcagaag 
ttctctgaac ctcaattgta tatctagaca taaaaattgt tttctgtata ggatgttgtt 
tggtttgttt ctgagtgttt aaattttgea aaaacaaatg ttaaatttgt gcttcagtac 
ctagataaat tggaaaggtt aatgttctag tttctggaag gtaagcctgg gagacacata 
agcaattcac tgctataatt tagttgatgt aaaatgaegg aaactggctc aatatgtcag 
gtttaactct gcccaaaagc agcagacatg taagcagatg tgcaataaaa aatgatcttg 



60 
113 



161 



209 



257 



305 



353 



401 



453 



513 
573 
633 
693 
753 
813 
873 
933 
993 
1053 
1113 
1173 
1233 
1293 
1353 
1413 
1473 
1533 
1593 
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atccatttca aaaaaaagaa aaaaaaaaaa aaaa 

<210> 66 
<211> 555 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .389 

<220> 

<221> sig_peptide 
<222> 9. . 149 
<223> Von Heijne matrix 

score 3.92635967755835 

seq LSSDLQQVGGASA/RI 

<220> 

<221> polyA_site 
<222> 540. .555 

<220> 

<221> polyA_signal 
<222> 512. .517 

<400> 66 

tgggagtg atg ttc acg cct ctg aca gtg aaa tac gcg tac tac gac act 50 
Met Phe Thr Pro Leu Thr Val Lys Tyr Ala Tyr Tyr Asp Thr 
-45 -40 -35 



gaa 


cgc 


ate 


gga 


gtt 


gac 


ctg 


ate 


atg 


aag 


acc 


tgc 


ttt 


age 


ccc 


aac 


98 


Glu 


Arg 


He 


Gly 


Val 


Asp 


Leu 


He 


Met 


Lys 


Thr 


Cys 


Phe 


Ser 


Pro 


Asn 










-30 










-25 










-20 








aga 


gtg 


att 


gga 


etc 


tea 


agt 


gac 


ttg 


cag 


caa 


gta 


gga 


ggg 


gca 


tea 


146 


Arg 


Val 


He 


Gly 


Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly 


Gly 


Ala 


Ser 








-15 










-10 










-5 










get 


cgc 


ate 


cag 


gat 


gee 


ctg 


agt 


aca 


gtg 


ttg 


caa 


tat 


gca 


gag 


gat 


194 


Ala 


Arg 


He 


Gin 


Asp 


Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr 


Ala 


Glu 


Asp 






1 








5 










10 










15 




gta 


ctg 


tct 


gga 


aag 


gtg 


tea 


get 


gac 


aat 


act 


gtg 


ggc 


cgc 


ttc 


ctg 


242 


Val 


Leu 


Ser 


Gly 


Lys 


Val 


Ser 


Ala 


Asp 


Asn 


Thr 


Val 


Gly 


Arg 


Phe 


Leu 












20 










25 










30 






atg 


age 


ctg 


gtt 


aac 


caa 


gta 


ccg 


aaa 


ata 


gtt 


ccc 


gat 


gac 


ttt 


gag 


290 


Met 


Ser 


Leu 


Val 


Asn 


Gin 


Val 


Pro 


Lys 


He 


Val 


Pro 


Asp 


Asp 


Phe 


Glu 










35 










40 










45 








acc 


atg 


etc 


aac 


age 


aac 


ate 


aat 


gac 


ctt 


ttg 


atg 


gtg 


acc 


tac 


ctg 


338 


Thr 


Met 


Leu 


Asn 


Ser 


Asn 


He 


Asn 


Asp 


Leu 


Leu 


Met 


Val 


Thr 


Tyr 


Leu 








50 










55 










60 










gec 


aac 


etc 


aca 


cag 


tea 


cag 


att 


gca 


etc 


aat 


gaa 


aaa 


ctt 


gta 


aac 


386 


Ala 


Asn 


Leu 


Thr 


Gin 


Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu 


Lys 


Leu 


Val 


Asn 





65 70 75 

ctg tgaatggacc ccaagcagta cacttgetgg tctaggtatt aaccccagga 439 
Leu 
80 

ctcagaagtg aaggagaaat gggttttttg tggtcttgag tcacactgag atagtcagtt 499 
gtgtgtgact ctaataaacg gagcctacct tttgtaaatt aaaaaaaaaa aaaaaa 555 

<210> 67 

<211> 422 

<212> DNA 

<213> Homo sapiens 



PCT/IB01/00914 

1627 



<220> 
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<221> CDS 
<222> 24 . . 278 



<220> 

<221> sig_peptide 

<222> 24 . . 119 

<223> Von Heijne matrix 

score 3.85301646018147 
seq L AH I Q AL ISGIEA/QL 



<220> 

<221> polyA_site 
<222> 407 . . 422 



<220> 

<221> polyA_signal 
<222> 384 . .389 



<400> 67 

ttgagctgca gtcacagctg age atg aaa get gee ttg gaa gac aca ctg gca 53 

Met Lys Ala Ala Leu Glu Asp Thr Leu Ala 
-30 -25 



gaa 


acg 


gag 


gcg 


cgc 


ttt 


gga 


gee 


cag 


ctg 


gcg 


cat 


ate 


cag 


gcg 


ctg 


101 


Glu 


Thr 


Glu 


Ala 


Arg 


Phe 


Gly 


Ala 


Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 








-20 










-15 










-10 










ate 


age 


ggt 


att 


gaa 


gee 


cag 


ctg 


ggc 


gat 


gtg 


cga 


get 


gat 


agt 


gag 


149 


He 


Ser 


Gly 


He 


Glu 


Ala 


Gin 


Leu 


Gly 


Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 






-5 










1 








5 










10 




egg 


cag 


aat 


cag 


gag 


tac 


cag 


egg 


etc 


atg 


gac 


ate 


aag 


teg 


egg 


ctg 


197 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 


Gin 


Arg 


Leu 


Met 


Asp 


He 


Lys 


Ser 


Arg 


Leu 












15 










20 










25 






gag 


cag 


gag 


att 


gee 


ace 


tac 


cgc 


age 


ctg 


etc 


gag 


gga 


cag 


gaa 


gat 


245 


Glu 


Gin 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu 


Glu 


Gly 


Gin 


Glu 


Asp 










30 










35 










40 








cac 


tac 


aac 


aat 


ttg 


tct 


gee 


tec 


aag 


gtc 


etc 


tgaggcagca ggctctgggg 


298 


His 


Tyr 


Asn 


Asn 


Leu 


Ser 


Ala 


Ser 


Lys 


Val 


Leu 















45 50 

ettctgetgt cctttggagg gtgtcttctg ggtagaggga tgggaaggaa gggaccctta 358 

cccccggctc ttctcctgac ctgccaataa aaatttatgg tccaaggcaa aaaaaaaaaa 418 

aaaa 422 



<210> 68 

<211> 630 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 232. .534 



<220> 

<221> sig_peptide 

<222> 232. .306 

<223> Von Heijne matrix 

score 3.78142478107177 
seq AKTCLVLCSRVLS/VI 



<220> 

<221> polyA_site 
<222> 615. . 630 



<220> 

<221> polyA_signal 
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<222> 595. . 600 
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<400> 68 

tatcactgtt acgaaccaag gatttacaga tcactggcaa aaattctgag aactttcaca 60 
ccagtatact gtccaagccc attaagtggc atcacacctc tcttttatgt agctcagaca 120 
agacagtcta atatcttcaa aatactactg caatatggaa tcttagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtactc tacccttcga gagtaagagt a atg gtt 237 

Met Val 
-25 

gat cgt gaa ttg get gac ate cat gaa gat gec aaa aca tgt ttg gta 285 
Asp Arg Glu Leu Ala Asp lie His Glu Asp Ala Lys Thr Cys Leu Val 

-20 -15 -10 

eta tgt tec aga gtg ctt tct gtc att tea gtc aag gaa ata aag aca 333 
Leu Cys Ser Arg Val Leu Ser Val lie Ser Val Lys Glu lie Lys Thr 

-5 1 5 

cag ctg agt tta gga aga cat cca att att tea aat tgg ttt gat tac 381 
Gin Leu Ser Leu Gly Arg His Pro lie lie Ser Asn Trp Phe Asp Tyr 
10 15 20 25 

att cct tea aca aga tac aaa gat cca tgt gaa eta tta cat ctt tgc 429 
lie Pro Ser Thr Arg Tyr Lys Asp Pro Cys Glu Leu Leu His Leu Cys 

30 35 4 0 

aga eta ace ate agg aat caa eta tta ace aac aat atg etc cca gat 477 
Arg Leu Thr lie Arg Asn Gin Leu Leu Thr Asn Asn Met Leu Pro Asp 

4 5 50 55 

gga ata ttt tea ctt eta att cct get cgt eta caa aac tat ctg aat 525 
Gly lie Phe Ser Leu Leu lie Pro Ala Arg Leu Gin Asn Tyr Leu Asn 

60 65 70 

tta gaa ate taacatacgt cagtgtccta agttccttaa caatgettae 574 
Leu Glu lie 
75 

caatgtatgg cttagaagtt aataaaaatt cacttcatgc aaaaaaaaaa aaaaat 630 

<210> 69 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80 . . 220 

<223> Von Heijne matrix 

score 3.72491712201476 

seq PLLSLHSRGGSSS/ES 

<220> 

<221> polyA_site 
<222> 715. . 730 

<220> 

<221> polyA__signal 
<222> 697 . . 702 

<400> 69 

attttgegaa eggegagcag cggcggcggc geggagagae geageggagg ttttcctggt 60 
ttcggacccc ageggcegg atg gtg aaa tec tec ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 
age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gee ace 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 
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-35 










-30 






-25 










ate 


cac 


gec 


age 


cgc 


acc 


atg 


ccg 


etc 


eta age ctg 


cac 


age cgc 


ggc 


208 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser Leu 


His 


Ser Arg 


Gly 




-20 










-15 








-10 






-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec etc cac 


tgc 


tgt agt 


aac 


256 


Gly 


Ser 


Ser 


Ser 


Glu 
1 


Ser 


Ser 


Arg 


Val 
5 


Ser Leu His 


Cys 


Cys Ser 
10 


Asn 




ccg 


ggt 


ccg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcccct ■ 


cacccacccc 




303 


Pro 


Gly 


Pro 
15 


Gly 


Pro 


Arg 


Trp 


Cys 
20 


Ser 













tgaagatccc aggtgggcga ggaggaggag tagggccgcc teggggctgg gcatccggcc 363 

cctggggcca ccccttgtca gccgggtggg taggaaccgt agactcgctc atctcgcctg 423 

ggtttgtccg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 483 

cttgaagttt aatattgtgt ttgtgatact gaagtatttg ctttaattct aaataaaaat 543 

ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg tactttgagg 603 

agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaactt ggaaagatga 663 

ggtattaatc ccctccattg ctctccaaaa gecaataaag tgattacacc caaaaaaaaa 723 

aaaaaaa 730 



<210> 70 

<211> 730 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 715. .730 



<220> 

<221> polyA_signal 
<222> 697 . .702 



<400> 70 

attttgegaa eggegagcag cggcggcggc geggagagae geageggagg ttttcctggt 60 
ttcggacccc ageggcegg atg gtg aaa tec tec ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gec acc 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 " -30 -25 

ate cac gec age cgc acc atg ccg etc eta age ctg cac age cgc ggc 208 
He His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 256 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

1 5 10 

ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 303 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga ggaggaggag tagggccgcc teggggctgg gcatccggcc 363 
cctggggcca ccccttgtca gccgggtggg taggaaccgt agactcgctc atctcgcctg 423 
ggtttgtccg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 483 
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cttgaagttt aatattgtgt ttgtgatact gaagtatttg ctttaattct aaataaaaat 543 

ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg tactttgagg 603 

agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaactt ggaaagatga 663 

ggtattaatc ccctccattg ctctccaaaa gccaataaag tgattacacc caaaaaaaaa 723 

aaaaaaa 730 



<210> 71 

<211> 863 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. . 220 

<223> Von Heijne matrix 

score 3.72491712201476 
PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 844 . . 863 



<220> 

<221> polyA_signal 
<222> 827 . . 832 



<400> 71 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 

Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 



-45 



-40 



age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gee ace 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 



-35 



-30 



-25 



ate cac gee age cgc acc atg ccg etc eta age ctg cac age cgc ggc 
lie His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctaacttat 
tctactccga tgatcggctg aatgtaacag aggaactaac gtccaacgac aagacgagga 
ttctcaacgt ctcagcctct acatcgagat cccgggcggc gcgctgcccg aggggagcaa 
ggacagcttt gcagttctcc tggagttcgc tgaggagcag ctgcgagccg accatgtctt 
catttgette cacaagaacc gcgaggacag agccgccttg ctccgaacct tcagcttttt 
gggctttgag attgtgagac cggggcatcc ccttgtcccc aagagacccg aegcttgett 
catggcctac aegttcgaga gagagtcttc gggagaggag gaggagtagg gccgcctcgg 
ggctgggcat ccggcccctg gggccacccc ttgtcagccg ggtgggtagg aacegtagae 
tcgctcatct cgcctgggtt tgtccgcatg ttgtaatcgt gcaaataaac gctcactccg 
aattaaaaaa aaaaaaaaaa 



60 
112 



160 



208 



256 



303 



363 
423 
483 
543 
603 
663 
723 
783 
843 
863 



<210> 72 

<211> 730 

<212> DNA 

<213> Homo sapiens 



WO 02/083898 



PCT/IB01/00914 



60 

<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 715. .730 



<220> 

<221> polyA_signal 
<222> 697. .702 



<400> 72 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 

Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 
age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gee ace 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gee age cgc acc atg ccg etc eta age ctg cac age cgc ggc 
lie His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cay tgy tgt agt aas 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Xaa 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga ggaggaggag tagggccgcc teggggctgg gcatccggcc 
cctggggcca ccccttgtca gccgggtggg taggaaccgt agactcgctc atctcgcctg 
ggtttgtccg catgttgtaa tcgtgcaaat aaacgctcac tccgaattag cggtgtattt 
cttgaagttt aatattgtgt ttgtgatact gaagtatttg ctttaattct aaataaaaat 
ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg tactttgagg 
agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaactt ggaaagatga 
ggtattaatc ccctccattg ctctccaaaa gecaataaag tgattacacc caaaaaaaaa 
aaaaaaa 



60 
112 



160 



208 



256 



303 



363 
423 
483 
543 
603 
663 
723 
730 



<210> 73 

<211> 879 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. . 395 



<220> 

<221> sig_peptide 
<222> 9.-56 

<223> Von Heijne matrix 

score 3.61791418904325 
LLLLLRALRRG PG / PG 



<220> 

<221> polyA_site 
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<222> 864 . . 879 
<400> 73 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gec etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 



cca 


ggc 


ccg 


ggt 


cct 


egg 


ccg 


ctg 


tgg 


ggc 


cca 


ggc 


ccg 


gcc 


tgg 


agt 


98 


Pro 


Gly 


Pro 
1 


Gly 


Pro 


Arg 


Pro 
5 


Leu 


Trp 


Gly 


Pro 


Gly 
10 


Pro 


Ala 


Trp 


Ser 




cca 


ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gcc 


age 


146 


Pro 


Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met 


Ala 


Ser 




15 










20 










25 










30 




agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


ggc 


cga 


get 


ctg 


cag 


age 


194 


Arg 


Pro 


Pro 


Gly 


Asp 
35 


Leu 


Ala 


Glu 


Ala 


Gly 
40 


Gly 


Arg 


Ala 


Leu 


Gin 
45 


Ser 




tta 


caa 


ttg 


aga 


ctg 


eta 


ace 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 
50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


He 


Asn 
60 


Gly 


Leu 




ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 


Lys 
65 


Gin 


His 


Leu 


Val 


Gin 
70 


Asn 


Pro 


Val 


Arg 


Leu 
75 


Trp 


Gin 


Leu 




tta 


ggt 


ggt 


act 


ttc 


twt 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cak 


aag 


aat 


338 


Leu 


Gly 
80 


Gly 


Thr 


Phe 


Xaa 


Phe 
85 


Asn 


Thr 


Ser 


Arg 


Leu 
90 


Lys 


Xaa 


Lys 


Asn 




aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
He Phe lie 
atttctgttt tccttttagc 
gcagtggctc acacttgtaa 
ggcctatacc acatagegag 
tcacacctgt aatcccagca 
caacatggtg aaaccccgtc 
gectgtaate ccagctactc 
gaggttgctg tgagtggaga 
tccatctcaa aaaaaaaaaa 

<210> 74 
<211> 879 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .395 

<220> 

<221> sig_peptide 
<222> 9. .56 
<223> Von Heijne matrix 

score 3.61791418904325 

LLLLLRALRRG PG / PG 

<220> 

<221> polyA_site 
<222> 864 .. 879 

<400> 74 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gcc etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 ~ -5 

cca ggc ccg ggt cct egg ccg ctg tgg ggc cca ggc ccg gcc tgg agt 98 



aaggtgaaac cagtctggaa aatggggaga tgggccgggt 4 95 

tegaaacget ttgggaggcc caggtggaag gatcacttga 555 

accctgtctc actgeaaatt aaaaggctgg gcgtggtggc 615 

ctttgggagg ctgaggcagg cggatcacct gcaccctggc 675 

tttactaaaa atagaaaatt ageegggegt gatggcacac 735 

gggaggctga ggcaggagaa ttgettgaac ctgggaggtg 795 

teatgecatt gcactccagc ctgagcaaca agagcaaaac 855 

aaaa 879 
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Pro 


Gly 


Pro 
1 


Gly 


Pro 


Arg 


Pro 
5 


Leu 


Trp 


Gly 


Pro 


Gly 
10 


Pro 


Ala 


Trp 


Ser 




cca 


ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gcc 


age 


146 


Pro 


Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met 


Ala 


Ser 




15 










20 










25 










30 




agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


ggc 


cga 


get 


ctg 


cag 


age 


194 


Arg 


Pro 


Pro 


Gly 


Asp 
35 


Leu 


Ala 


Glu 


Ala 


Gly 
40 


Gly 


Arg 


Ala 


Leu 


Gin 
45 


Ser 




tta 


caa 


ttg 


aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 
50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


He 


Asn 
60 


Gly 


Leu 




ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 


Lys 
65 


Gin 


His 


Leu 


Val 


Gin 
70 


Asn 


Pro 


Val 


Arg 


Leu 

75 


Trp 


Gin 


Leu 




tta 


ggt 


ggt 


act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cag 


aag 


aat 


338 


Leu 


Gly 
80 


Gly 


Thr 


Phe 


Tyr 


Phe 
85 


Asn 


Thr 


Ser 


Arg 


Leu 
90 


Lys 


Gin 


Lys 


Asn 




aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
He Phe He 
rtttctgttt weettttage 
gcagtggctc acacttgtaa 
ggcctatacc acatagctag 
tcacacctgt aatcccagca 
caacatggtg aaaccccgtc 
gectgtaate ccagctactc 
gaggttgctg tgagtggaga 
tccatcccaa aaaaaaaaaa 

<210> 75 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .342 

<220> 

<221> sig_peptide 
<222> 16. .114 
<223> Von Heijne matrix 

score 3.66786754931606 

seq LAVFQMLKSMCAG/QR 

<220> 

<221> polyA_site 
<222> 446. .464 

<220> 

<221> polyA_signal 
<222> 412. .417 

<400> 75 

gaatgegect gcgtt atg gcg gcc tec ggc gcc cca agg ate ctg gtg gac 51 
Met Ala Ala Ser Gly Ala Pro Arg He Leu Val Asp 
-30 -25 
ctg ctg aag ctg aac gtg gcc ccc etc gcc gtc ttc cag atg etc aag 99 
Leu Leu Lys Leu Asn Val Ala Pro Leu Ala Val Phe Gin Met Leu Lys 

-20 -15 -10 

tec atg tgt gcc ggg cag agg eta gcg age gag ccc cag gac cct gcg 147 



aaggtgaaac cagtctggam aatggggaga tgggccgggt 495 

tegaaacget ttgggaggcc caggtggaag gatcacttga 555 

accctgtctc actgeaaatt aaaaggctgg gcgtggtggc 615 

ctttgggagg ctgaggcagg cggatcacct gcaccctggc 675 

tttactaaaa atagaaaatt ageegggegt gatggcacac 735 

gggaggctga ggcaggagaa ttgettgaac ctgggaggtg 795 

teatgecatt gcactccagc ctgagcaaca agagcaaaac 855 

aaaa 879 
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ace taggatgggg cagagacttg ttgcatcttt gtccccagca aaggctacat 392 
Thr 

gttacctcct tcaattgata ataaaccttt ctgagatgea gagggtccag gtcaaaaaaa 452 

aaaaaaaaaa aa 464 

<210> 76 

<211> 624 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .367 

<220> 

<221> sig_peptide 

<222> 59. . 166 

<223> Von Heijne matrix 

score 3.66643500261364 

seq RALSTFLFGSIRG/AA 

<220> 

<221> polyA_site 
<222> 609. . 624 

<220> 

<221> polyA_signal 
<222> 588 . . 593 

<400> 76 

agtgcgcaga cgcaggggtc ggcgccgggt gagagcgtgc ggceggatte accacaac 58 
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agg cac aag cag aga cag atg tagacccttt ccctccagag tcacgcacat 
Arg His Lys Gin Arg Gin Met 
65 

actcgtcatc gcatcacttg ggagaatggt tgtatcttat ggaaggaatt atcacatcaa 
ggagtcaggg gaaagtgact ggaagcaaac gccctaaaag ttacccatca cgtttcagtg 
taaatgagta actatagaag acattgcgtt atcttatttc maaaacgttc caactaaaaa 
acattttcct attaaaatag accttccgaa caaaaaaaaa aaaaaaa 



397 



457 
517 
577 
624 



<210> 77 
<211> 1158 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 



<220> 

<221> sig_peptide 

<222> 58 . . 165 

<223> Von Heijne matrix 

score 3.68684848854945 
seq SVSLSGFTVGTLS/ET 



<220> 

<221> polyA_site 
<222> 1143. . 1158 



<400> 77 

ctgtgggaga ggccaggatg ctgcattagg cacaggataa cctgggaacc caggcac 
atg ggt cct get etc cga agt ctg caa gtc aag aag gga aca gag cac 
Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 

-35 -30 -25 

gec gac cct etc cct ttc ccc tct gtc tct ctt agt ggc ttt aca gtg 
Ala Asp Pro Leu Pro Phe Pro Ser Val Ser Leu Ser Gly Phe Thr Val 
-20 * -15 -10 -5 

ggt acc ctg tea gaa acc age act ggg ggc cct gee acc ccc aca tgg 
Gly Thr Leu Ser Glu Thr Ser Thr Gly Gly Pro Ala Thr Pro Thr Trp 

15 10 
aag gag tgt cct ate tgt aag gag cgc ttt cct get gag agt gac aag 
Lys Glu Cys Pro lie Cys Lys Glu Arg Phe Pro Ala Glu Ser Asp Lys 

15 20 25 

gat gee ctg gag gac cac atg gat gga cac ttc ttt ttc age acc cag 
Asp Ala Leu Glu Asp His Met Asp Gly His Phe Phe Phe Ser Thr Gin 

30 35 40 

gac ccc ttc acc ttt gag tgatcttact ccctcgtaca tgcacaaata 
Asp Pro Phe Thr Phe Glu 
45 50 

cacactcatg cacacacaca ctcacacaca tgcatacact taggtttcat gcccattttc 
tatcacactg ggctccatga tattctgttc cctaagaact gcttctgtgt gccctgtttt 
catcccaaga tttctcactt catcctctcc tacctggctc ttttgtccca gggaggggtc 
ctgttcggaa gcagtggctg aatttatccc ctgaaagtgg ttttggagga acegggatgg 
aggaggcett cccctgtggg aatagaatcg tccactccta gccctggttg cttctgatac 
acagccactg cacacacaca ctcacactca cactcccttg tytgatgece caaagecaat 
tcctggggca ccctaccctc tcttatttgg agtttccgtt ggtttacctg agttttctct 
ggggtctgea cagaggcagc agcatggaca tcatggcctc tcaggtccct tttggttctc 
agtttcattg gttcctcttt ctgttccccc attgacttct gtgccccacc etagectttt 
ccataacctt aggtattcag tttggagggg ttttttgtat ttttgaggat tcctgtattc 
tgtatcctct cctcgcatct cctcacatgg aaagaaataa tgtatttgtg ccttctgtga 
ggaatggggg gaacaagtgg tcccaggtat ccccatttcc aaggcccccc tccctctcca 
ggtcccccca cagcaataaa agcttccccc tgatatccat ccctttgtag tttgaacaaa 
tatatttata tgatatgaaa aaaaaaaaaa aaa 
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405 
465 
525 
585 
645 
705 
765 
825 
885 
945 
1005 
1065 
1125 
1158 
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<210> 78 

<211> 641 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 187 . . 342 



<220> 

<221> sig_peptide 

<222> 187. .276 

<223> Von Heijne matrix 

score 3.57432920610252 
seq VKLLLKMVSSATT/ER 



<220> 

<221> polyA_site 
<222> 626. . 641 



<400> 78 

attctaaatc aataaatata gcttacccac agacttttta tgggatgtaa agtatgtatt 60 
acaacaggta aatccatttc tctgttttgg gaaagcaggt taatgttttg tttgttttgt 120 
tgctgcatat aatgttgcac tatatatata ttctaatgca gttggtctcc taattctaca 180 
ggataa atg tct age tgt ggc att gtt gga tea tea gtt teg ttt cag 228 

Met Ser Ser Cys Gly lie Val Gly Ser Ser Val Ser Phe Gin 

-30 -25 -20 

tta gat get gtg aaa ttg etc ttg aaa atg gtg tec tct gec acc aca 276 
Leu Asp Ala Val Lys Leu Leu Leu Lys Met Val Ser Ser Ala Thr Thr 

-15 ~ -10 -5 

gaa egg tgt tgt aat gga agt gec aat ttt cat aaa aac ttg tgt gca 324 
Glu Arg Cys Cys Asn Gly Ser Ala Asn Phe His Lys Asn Leu Cys Ala 
1 ' 5 10 15 

aca ggt att aag aat ttt tgaggtcggg egcagtaget cacgcctgta 372 
Thr Gly lie Lys Asn Phe 
2 0 

atcccagcac tttgggaggc ctaggcaggt ggatcacctg aggtcaggag ttccagacca 432 
gcctgaccaa catggtgaaa tcccatctcc actaaataca aaaaattagc tgggcgtggt 4 92 
ggcatacacc tgtaatccca gctacttggg aggctaagac aggagaattg cttgaaccca 552 
ggaggeggag gttgcagtga gecgagattg caccactaca ctccagcctg ggtgatgagc 612 
gaaactccat ctcaaaaaaa aaaaaaaaa 641 



<210> 79 

<211> 660 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2 . . 658 



<220> 

<221> sig_peptide 

<222> 2. . 121 

<223> Von Heijne matrix 

score 6.4143157541603 
ITMSATVVDAVNA/AP 



<400> 79 

t cct awt cga aaa att acg aga ttg get tgg att ctg teg gat gga ctt 4 9 

Pro Xaa Arg Lys lie Thr Arg Leu Ala Trp He Leu Ser Asp Gly Leu 
-40 -35 -30 -25 

ggg get age tgc ggc ggg get gga gga ggc cag ata acc atg tea gee 97 
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<210> 80 

<211> 647 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. . 645 

<220> 

<221> sig_peptide 

<222> 1. . 108 

<223> Von Heijne matrix 

score 5.52542399647426 
I TMSATVXDAVNA/AP 
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<221> sig_peptide 


























<222> 1. 


. .108 




























<223> Von Heijne matrix 


























score 


6.4143157541603 
























seq I TMSATVVDAVNA/AP 






















<400> 81 






























att 


acg 


aga 


ttg 


get 


tgg 


att 


ctg 


teg 


gat 


gga 


ctt 


ggg 


get 


age 


tgc 


48 


lie 


Thr 


Arg 


Leu 


Ala 


Trp 


He 


Leu 


Ser 


Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 






-35 










-30 










-25 












ggc 


ggg 


get 


gga 


gga 


ggc 


cag 


ata 


ace 


atg 


tea 


gee 


aca 


gtt 


gta 


gat 


96 


Gly 


Gly Ala 


Gly 


Gly 


Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 




-20 










-15 










-10 










-5 




gca 


gtt 


aat 


get 


gca 


ccc 


eta 


teg 


ggg 


tec 


aaa 


gaa 


atg 


agt 


ttg 


gaa 


144 


Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 












1 








5 










10 








gaa 


cca 


aag 


aag 


atg 


ace 


aga 


gag 


gac 


tgg 


aga 


aag 


aag 


aag 


gag 


eta 


192 
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Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 








15 










20 










25 










gaa 


gaa 


cag 


cga 


aaa 


ttg 


ggc 


aat 


get 


cct 


gca 


gaa 


gtt 


gat 


gaa 


gaa 


240 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 






30 










35 










40 












gga 


aaa 


gac 


ate 


aac 


ccc 


cat 


att 


cct 


cag 


tat 


att 


tct 


tea 


gtg 


cca 


288 


Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 




45 










50 










55 










60 




tgg 


tat 


att 


gat 


cct 


tea 


aaa 


aga 


cct 


act 


tta 


aaa 


cac 


cag 


aga 


cca 


336 


Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 












65 










70 










75 






caa 


cca 


gaa 


aaa 


caa 


aag 


cag 


ttc 


age 


tea 


tct 


gga 


gaa 


tgg 


tac 


aag 


384 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 










80 










85 










90 








agg 


ggt 


gta 


aaa 


gag 


aat 


tec 


ata 


att 


act 


aag 


tac 


cgc 


aaa 


gga 


gca 


432 


Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 








95 










100 










105 










tgt 


gaa 


aat 


tgt 


ggg 


gec 


atg 


aca 


cac 


aaa 


aag 


raa 


gac 


tgc 


ttt 


gag 


480 


Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Xaa 


Asp 


Cys 


Phe 


Glu 






110 










115 










120 












aga 


cct 


agg 


cga 


gtt 


gga 


gee 


aaa 


ttt 


aca 


ggt 


act 


aat 


ata 


get 


cca 


528 


Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


' Gly 


Thr 


Asn 


He 


Ala 


Pro 




125 










130 










135 










140 




gat 


gaa 


cat 


gtc 


cag 


cct 


caa 


ctg 


atg 


ttt 


gac 


tat 


gat 


ggg 


aag 


agg 


576 


Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 












145 










150 










155 






gat 


egg 


tgg 


aat 


ggc 


tac 


aat 


cca 


gaa 


gaa 


cac 


atg 


aaa 


att 


gtt 


gaa 


624 


Asp 


Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 


Lys 


He 


Val 


Glu 










160 










165 










170 








gag 


tat 


gec 


aaa 


gtt 


gat 


t 




















643 


Glu 


Tyr 


Ala 


Lys 


Val 


Asp 
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<210> 82 

<211> 1200 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<220> 

<221> sig_peptide 

<222> 29. . 73 

<223> Von Heijne matrix 

score 4.5958301326157 

seq VLVSXCSARLLQQ/EE 

<220> 

<221> polyA_site 
<222> 1185. . 1200 

<400> 82 

acttggegag tgagacgctg atgggagg atg gac gta ctg gtg tct rag tgc 52 

Met Asp Val Leu Val Ser Xaa Cys 
-15 -10 
tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

-5 15 
gaa att gac egg ttg aaa aac tgt ggc tgt tta gga get tct cca aat 148 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 
10 15 " ~ 20 25 



WO 02/083898 



PCT/IB01/00914 



69 



cga ctg 
Arg Leu 
4 0 

act aaa 
Thr Lys 
55 

cat gca 
His Ala 

gtg aca 
Val Thr 



agt get atg 
Ser Ala Met 



ttg gag cag tta caa gaa gaa aat tta aaa tta aag tat 
Leu Glu Gin Leu Gin Glu Glu Asn Leu Lys Leu Lys Tyr 

30 35 
att ctt cga aag agt ctt cag gca gaa agg aac aaa cca 
lie Leu Arg Lys Ser Leu Gin Ala Glu Arg Asn Lys Pro 

4 5 50 
atg att aac att att age cgc eta caa gag gtc ttt ggt 
Met lie Asn lie lie Ser Arg Leu Gin Glu Val Phe Gly 

60 65 70 

aag get gca tat cca gat ttg gaa aat cct cct ctg eta 
Lys Ala Ala Tyr Pro Asp Leu Glu Asn Pro Pro Leu Leu 

75 8 0 8 5 

agt cag cag gee aag ttt ggg gac tat cag tgt aat 
Ser Gin Gin Ala Lys Phe Gly Asp Tyr Gin Cys Asn 
90 95 100 

att tct cag gtg atg tat tgt cat gac tct tgg 
lie Ser Gin Val Met Tyr Cys His Asp Ser Trp 

110 115 
aag tat tat tat cat cat tgc cat tta cag aaa 
Lys Tyr Tyr Tyr His His Cys His Leu Gin Lys 

125 130 
tccttagtga aaaggacatt tgccacagtt tgaaaaactt gagaaaggag ttgggggggt 
atatgtttta acttttttag gcacaatttt taaggtttgg ttaaatttta tatgtattct 
caatatttaa gggcaatcat tggtactctt ttgtttaggt atttccctcc tgctgtgtcc 
aggaktgctg tgtggtggtr atgagtgctg ggaggtgaaa aattaaaata agecatttae 
cagtcagcat cccaattaaa tatttgatgt aactgtgatc tttgagccag gcttatatat 
tcattttcaa gcagaggagt tccccatttt aaatagaggc attgtctgat gtgtttatgg 
ttaactgeat ctggcttggg tctttctgtt ttcctttctt tgctgaattg gaaggggtta 
ctctgaagag tccaggtctt acagtgtggt ttatttctca agtgtgaata ttgcacacct 
tcatggcttg aaaattagaa atgtaaatat gctggaacca gacttcataa ctgtagactt 
tgttatgtca tcattaaaaa gttcgaggtg ttgtaatccc agctactagg gaggctgagg 
caggagaatt gcttgaaccc aggggeggag gttacagtga gecgagatea caccactgca 
ctccagcctg ggcaacagag cgagactcca tctctaaaaa aaaaaaaaaa a 



aat 
Asn 

aat 
Asn 

att 
He 

cca 
Pro 

ggt 
Gly 
105 
ttt 
Phe 



ctg ttt gat ttt 
Leu Phe Asp Phe 
120 

taatactatt acaagttgta 



196 



244 



292 



340 



388 



436 



489 



549 
609 
669 
729 
789 
849 
909 
969 
1029 
1089 
1149 
1200 



<210> 83 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1. .351 



<220> 

<221> sig_peptide 

<222> 1. . 129 

<223> Von Heijne matrix 

score 4.10872570449087 
LGQVAAAAAAANA/AS 



<220> 

<221> polyA__site 
<222> 336. . 351 



<220> 

<221> polyA_signal 
<222> 306. . 311 



<400> 83 

ctt tct cca get caa gcg tct ggt att tgg tgt agg gac age get ttt 48 

Leu Ser Pro Ala Gin Ala Ser Gly He Trp Cys Arg Asp Ser Ala Phe 

-40 -35 -30 

tec ggg tgg age gag cgt gga tec agt teg egg egg ggt tgt ttg ggt 96 
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Ser Gly Trp Ser Glu Arg 
-25 

caa gtt get get get get 
Gin Val Ala Ala Ala Ala 
-10 

tea gcg aac age ctg gga 
Ser Ala Asn Ser Leu Gly 
10 

acg age cgc ccg tec cag 
Thr Ser Arg Pro Ser Gin 
25 

ttc ccc ccg teg ccg ctg 
Phe Pro Pro Ser Pro Leu 
40 

tgt tac tgc ttc eta eta 
Cys Tyr Cys Phe Leu Leu 
55 

aaa aaa aaa aaa aaa 
Lys Lys Lys Lys Lys 
70 



70 

Gly Ser Ser Ser Arg Arg 
-2 0 

gcg get gcg aat get get 
Ala Ala Ala Asn Ala Ala 
-5 1 
teg ggt ggt cgc tgc aag 
Ser Gly Gly Arg Cys Lys 
15 

teg egg ccc cga tec ctg 
Ser Arg Pro Arg Ser Leu 
30 

ccg cct tct cct gca gga 
Pro Pro Ser Pro Ala Gly 
45 

ata aat get gac ctg ate 
lie Asn Ala Asp Leu lie 
60 65 
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Gly Cys Leu Gly 
-15 

tct cct cct cct 144 
Ser Pro Pro Pro 
5 

ctg agg get ccg 192 
Leu Arg Ala Pro 
20 

ccc cag gee egg 240 
Pro Gin Ala Arg 
35 

acg agt tgc aca 288 

Thr Ser Cys Thr 

50 

aaa tgg age cca 336 
Lys Trp Ser Pro 

351 



<210> 84 

<211> 438 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 15. .278 



<220> 

<221> sig_peptide 

<222> 15 . . 122 

<223> Von Heijne matrix 

score 3.79790641648006 
seq I LMRDFS PSG I FG/ AF 



<220> 

<221> polyA_site 
<222> 423. .438 



<220> 

<221> polyA_signal 
<222> 396. .401 



<400> 84 



accaggactc caaa 


atg 


gcg 


tea 


gtt 


gta 










Met 


Ala 


Ser 


Val 


Val 












-35 








ctg 


gag 


gtc 


aaa 


ctg 


ggg 


gag 


ctg 


cca 


Leu 


Glu 


Val 


Lys 


Leu 


Gly 


Glu 


Leu 


Pro 










-20 










ttc 


agt 


cct 


agt 


ggc 


att 


ttc 


gga 


gcg 


Phe 


Ser 


Pro 


Ser 


Gly 


He 


Phe 


Gly 


Ala 








-5 










1 


tac 


tac 


aac 


aag 


tac 


ate 


aat 


gtg 


aag 


Tyr 


Tyr 


Asn 


Lys 


Tyr 


He 


Asn 


Val 


Lys 




10 










15 






acc 


atg 


gtg 


ctg 


gca 


tgc 


tac 


gtg 


etc 


Thr 


Met 


Val 


Leu 


Ala 


Cys 


Tyr 


Val 


Leu 


25 










30 








aag 


cat 


etc 


aag 


cac 


gag 


egg 


etc 


cgc 


Lys 


His 


Leu 


Lys 


His 


Glu 


Arg 


Leu 


Arg 



cca gtg aag gac aag aaa ctt 50 
Pro Val Lys Asp Lys Lys Leu 
-30 -25 
age tgg ate ttg atg egg gac 98 
Ser Trp He Leu Met Arg Asp 
-15 -10 
ttt caa aga ggt tac tac egg 146 
Phe Gin Arg Gly Tyr Tyr Arg 
5 

aag ggg age ate teg ggg att 194 
Lys Gly Ser lie Ser Gly He 
20 

ttt age tac tec ttt tec tac 242 
Phe Ser Tyr Ser Phe Ser Tyr 

35 40 
aaa tac cac tgaagaggac 288 
Lys Tyr His 
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45 50 

acactctgca cccccccacc ccacgacctt ggcccgagcc cctccgtgag gaacacaatc 348 

tcaatcgttg ctgaatcctt tcatatccta ataggaatta acctccaaat aaaacatgac 408 
tggtacgtgt gctgaaaaaa aaaaaaaaaa 438 

<210> 85 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74 . .259 

<220> 

<221> sig_peptide 

<222> 74 . .199 

<223> Von Heijne matrix 

score 3.56683458636822 

seq CXXSCCSCCPVGC/AK 

<220> 

<221> polyA_site 
<222> 404 . .419 

<220> 

<221> polyA_signal 
<222> 388. .393 

<400> 85 

cactccgcct tccacgtgca cccactgcct cttcccttct cgcttgggaa ctctagtctc 60 
gcctcgggtt gca atg gac ccc aac tgc tec tgt gec get gca ggt gtc 109 
Met Asp Pro Asn Cys Ser Cys Ala Ala Ala Gly Val 















-40 










-35 












tec 


tgc 


ace 


tgc 


gee 


age 


tec 


tgc 


aag 


tgc 


aaa 


gag 


tgc 


aaa 


tgc 


acc 


157 


Ser 


Cys 


Thr 


Cys 


Ala 


Ser 


Ser 


Cys 


Lys 


Cys 


Lys 


Glu 


Cys 


Lys 


Cys 


Thr 




-30 










-25 










-20 










-15 




tec 


tgc 


aar 


rag 


age 


tgc 


tgc 


tec 


tgc 


tgc 


cct 


gtg 


ggc 


tgt 


gee 


aag 


205 


Ser 


Cys 


Lys 


Xaa 


Ser 


Cys 


Cys 


Ser 


Cys 


Cys 


Pro 


Val 


Gly 


Cys 


Ala 


Lys 












-10 










-5 










1 






tgt 


gee 


cag 


ggc 


tgc 


ate 


tgc 


aaa 


ggg 


gca 


teg 


gag 


aag 


tgc 


age 


tgc 


253 


Cys 


Ala 


Gin 


Gly 


Cys 


He 


Cys 


Lys 


Gly 


Ala 


Ser 


Glu 


Lys 


Cys 


Ser 


Cys 








5 










10 










15 










tgc 


gee 


tgatgtcggg . 


acagccctgc tcccaagtac aaatagagtg 


accegtaaaa 


309 



Cys Ala 
20 

tccaggattt tttgtttttt gctacaatct tgaccccttt gctacattcc tttttttctg 369 
tgaaatatgt gaataataat taaacactta gaccaaaaaa aaaaaaaaaa 419 

<210> 86 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1 . . 396 

<220> 

<221> polyA_site 
<222> 381. .396 



<400> 86 
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etc 


ttc 


ccg 


get 


cca 


get 


ccg 


ccg 


cca 


get 


cca 


gee 


ttt 


get 


ccm 


cct 


48 


Leu 


Phe 


Pro 


Ala 


Pro 


Ala 


Pro 


Pro 


Pro 


Ala 


Pro 


Ala 


Phe 


Ala 


Pro 


Pro 




1 








5 










10 










15 






ccc 


aaa 


gtc 


ccc 


tec 


ccg 


gag 


egg 


age 


gca 


cct 


agg 


gtc 


cct 


ctt 


ccg 


96 


Pro 


Lys 


Val 


Pro 


Ser 


Pro 


Glu 


Arg 


Ser 


Ala 


Pro 


Arg 


Val 


Pro 


Leu 


Pro 










20 










25 










30 








tec 


ccc 


cag 


ccc 


age 


tac 


ccr 


ttc 


aga 


cca 


gca 


gee 


teg 


ggg 


ggc 


ace 


144 


Ser 


Pro 


Gin 


Pro 


Ser 


Tyr 


Pro 


Phe 


Arg 


Pro 


Ala 


Ala 


Ser 


Gly 


Gly 


Thr 








35 










40 










45 










ccc 


ccg 


cca 


gee 


tgc 


etc 


cct 


ccc 


get 


cag 


ccc 


tgc 


cag 


ggt 


tec 


cca 


192 


Pro 


Pro 


Pro 


Ala 


Cys 


Leu 


Pro 


Pro 


Ala 


Gin 


Pro 


Cys 


Gin 


Gly 


Ser 


Pro 






50 










55 










60 












gec 


atg 


aat 


etc 


ttc 


cga 


ttc 


ctg 


gga 


gac 


etc 


tec 


cac 


etc 


etc 


gee 


240 


Ala 


Met 


Asn 


Leu 


Phe 


Arg 


Phe 


Leu 


Gly Asp 


Leu 


Ser 


His 


Leu 


Leu 


Ala 




65 










70 










75 










80 




ate 


ate 


ttg 


eta 


ctg 


etc 


aaa 


ate 


tgg 


aag 


tec 


cgc 


teg 


tgc 


gee 


gee 


288 


He 


He 


Leu 


Leu 


Leu 


Leu 


Lys 


He 


Trp 


Lys 


Ser 


Arg 


Ser 


Cys 


Ala 


Ala 












85 










90 










95 






cac 


ccc 


cag 


ctt 


cct 


etc 


tec 


ttc 


tgt 


ctg 


tct 


gtc 


tgt 


ctg 


tct 


gtc 


336 


His 


Pro 


Gin 


Leu 


Pro 


Leu 


Ser 


Phe 


Cys 


Leu 


Ser 


Val 


Cys 


Leu 


Ser 


Val 










100 










105 










110 








tct 


etc 


tct 


etc 


tct 


gkc 


tct 


etc 


tct 


etc 


tct 


ttc 


tct 


gtc 


tea 


aaa 


384 


Ser 


Leu 


Ser 


Leu 


Ser 


Xaa 


Ser 


Leu 


Ser 


Leu 


Ser 


Phe 


Ser 


Val 


Ser 


Lys 








115 










120 










125 










aaa 


aaa 


aaa 


aaa 


























396 


Lys 


Lys 


Lys 


Lys 





























130 



<210> 87 

<211> 493 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. .491 

<400> 87 

aa ccc cac cgc eta act etc cca act ctg cag tec gtg ace ttc agg 47 
Pro His Arg Leu Thr Leu Pro Thr Leu Gin Ser Val Thr Phe Arg 
15 10 15 



tgc 


cct 


tea 


agg 


tea 


gaa 


aaa 


etc 


ggc 


aag 


aat 


atg 


gtt 


tct 


agt 


ttt 


95 


Cys 


Pro 


Ser 


Arg 


Ser 


Glu 


Lys 


Leu 


Gly 


Lys 


Asn 


Met 


Val 


Ser 


Ser 


Phe 












20 










25 










30 






agg 


gtt 


tct 


gaa 


eta 


caa 


gta 


tta 


eta 


ggc 


ttt 


get 


gga 


egg 


aat 


aaa 


143 


Arg 


Val 


Ser 


Glu 


Leu 


Gin 


Val 


Leu 


Leu 


Gly 


Phe 


Ala 


Gly 


Arg 


Asn 


Lys 










35 










40 










45 








agt 


gga 


cgc 


aag 


cat 


gac 


etc 


ctg 


atg 


agg 


gcg 


ctg 


cat 


tta 


ttg 


aag 


191 


Ser 


Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala 


Leu 


His 


Leu 


Leu 


Lys 








50 










55 










60 










age 


ggc 


tgc 


age 


cct 


gcg 


gtt 


cag 


att 


aaa 


ate 


cga 


gaa 


ttg 


tat 


aga 


239 


Ser 


Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He 


Arg 


Glu 


Leu 


Tyr 


Arg 






65 










70 










75 












cgc 


cga 


tat 


cca 


cga 


act 


ctt 


gaa 


gga 


ctt 


tct 


gat 


tta 


tec 


aca 


ate 


287 


Arg 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser 


Asp 


Leu 


Ser 


Thr 


He 




80 










85 










90 










95 




aaa 


tea 


teg 


gtt 


ttc 


agt 


ttg 


gat 


ggt 


ggc 


tea 


tea 


cct 


gta 


gaa 


cct 


335 


Lys 


Ser 


Ser 


Val 


Phe 


Ser 


Leu 


Asp 


Gly 


Gly 


Ser 


Ser 


Pro 


Val 


Glu 


Pro 










100 










105 










110 






gac 


ttg 


gee 


gtg 


get 


gga 


ate 


cac 


teg 


ttg 


cct 


tec 


act 


tea 


gtt 


aca 


383 


Asp 


Leu 


Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro 


Ser 


Thr 


Ser 


Val 


Thr 








115 










120 










125 








cct 


cac 


tea 


cca 


tec 


tct 


cct 


gtt 


ggt 


tct 


gtg 


ctg 


ctt 


caa 


gat 


act 


431 
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Pro His Ser Pro 
130 

aag ccc aca ttt 
Lys Pro Thr Phe 
145 

cat cct gat gtg 
His Pro Asp Val 
160 

<210> 88 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. . 500 

<400> 88 

aa ccc cac cgc eta act etc cca act ctg cag tec gtg acc ttc agg 47 
Pro His Arg Leu Thr Leu Pro Thr Leu Gin Ser Val Thr Phe Arg 
15 10 15 



tgc 


cct 


tea 


agg 


tea 


gaa 


aaa 


etc 


ggc 


aag 


aat 


atg 


gtt 


tct 


agt 


ttt 


95 


Cys 


Pro 


Ser 


Arg 


Ser 


Glu 


Lys 


Leu 


Gly 


Lys 


Asn 


Met 


Val 


Ser 


Ser 


Phe 












20 










25 










30 






agg 


gtt 


tct 


gaa 


eta 


caa 


gta 


tta 


eta 


ggc 


ttt 


get 


gga 


egg 


aat 


aaa 


143 


Arg 


Val 


Ser 


Glu 


Leu 


Gin 


Val 


Leu 


Leu 


Gly 


Phe 


Ala 


Gly 


Arg 


Asn 


Lys 










35 










40 










45 








agt 


gga 


cgc 


aag 


cat 


gac 


etc 


ctg 


atg 


agg 


gcg 


ctg 


cat 


tta 


ttg 


aag 


191 


Ser 


Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala 


Leu 


His 


Leu 


Leu 


Lys 








50 










55 










60 










age 


ggc 


tgc 


age 


cct 


gcg 


gtt 


cag 


att 


aaa 


ate 


cga 


gaa 


ttg 


tat 


aga 


239 


Ser 


Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He 


Arg 


Glu 


Leu 


Tyr 


Arg 






65 










70 










75 












cgc 


cga 


tat 


cca 


cga 


act 


ctt 


gaa 


gga 


ctt 


tct 


gat 


tta 


tec 


aca 


ate 


287 


Arg 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser 


Asp 


Leu 


Ser 


Thr 


He 




80 










85 










90 










95 




aaa 


tea 


teg 


gtt 


ttc 


agt 


ttg 


gat 


ggt 


ggc 


tea 


tea 


cct 


gta 


gaa 


cct 


335 


Lys 


Ser 


Ser 


Val 


Phe 


Ser 


Leu 


Asp 


Gly 


Gly 


Ser 


Ser 


Pro 


Val 


Glu 


Pro 












100 










105 










110 






gac 


ttg 


gee 


gtg 


get 


gga 


ate 


cac 


teg 


ttg 


cct 


tec 


act 


tea 


gtt 


aca 


383 


Asp 


Leu 


Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro 


Ser 


Thr 


Ser 


Val 


Thr 










115 










120 










125 








cct 


cac 


tea 


cca 


tec 


tct 


cct 


gtt 


ggt 


tct 


gtg 


ctg 


ctt 


caa 


gat 


act 


431 


Pro 


His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly 


Ser 


Val 


Leu 


Leu 


Gin 


Asp 


Thr 








130 










135 










140 










aag 


ccc 


aca 


ttt 


gag 


atg 


cag 


cag 


cca 


tct 


ccc 


cca 


att 


cct 


cct 


gtc 


479 


Lys 


Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro 


Pro 


He 


Pro 


Pro 


Val 






145 










150 










155 












cat 


cct 


gat 


gtg 


cag 


tta 


aaa 




















500 


His 


Pro 


Asp 


Val 


Gin 


Leu 


Lys 























160 165 

<210> 89 

<211> 544 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141 . .494 

<220> 



Ser Ser Pro Val Gly 
135 

gag atg cag cag cca 
Glu Met Gin Gin Pro 
150 

ca 



73 

Ser Val Leu Leu Gin 
140 

tct ccc cca att cct 
Ser Pro Pro lie Pro 
155 
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Asp Thr 

cct gtc 479 
Pro Val 

493 
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<221> polyA_site 
<222> 529. . 544 



<220> 

<221> polyA_signal 
<222> 508. .513 



<400> 89 

agtctcgcga taactgcgca ggcgcggacc aaagcgatct cttctgagga tccggcaaga 60 
tggcagaagt atagcagaag aagaagcgga ccttccgcaa gttcacctac cgcggcgtgg 120 
acctcgacca gctgctggac atg tec tac gag cag ctg atg cag ctg tac agt 173 

Met Ser Tyr Glu Gin Leu Met Gin Leu Tyr Ser 
15 10 



gcg 


cgc 


cag 


egg 


egg 


egg 


ctg 


aac 


egg 


ggc 


ctg 


egg 


egg 


aag 


cag 


cac 


221 


Ala 


Arg 


Gin 


Arg 
15 


Arg 


Arg 


Leu 


Asn 


Arg 
20 


Gly 


Leu 


Arg 


Arg 


Lys 
25 


Gin 


His 




tec 


ctg 


ctg 


aag 


cgc 


ctg 


cgc 


aag 


gee 


aag 


aag 


gag 


gcg 


ccg 


ccc 


atg 


269 


Ser 


Leu 


Leu 

30 


Lys 


Arg 


Leu 


Arg 


Lys 
35 


Ala 


Lys 


Lys 


Glu 


Ala 
40 


Pro 


Pro 


Met 




gag 


aag 


ccg 


gaa 


gtg 


gtg 


aag 


acg 


cac 


ctg 


egg 


gac 


atg 


ate 


ate 


eta 


317 


Glu 


Lys 
45 


Pro 


Glu 


Val 


Val 


Lys 
50 


Thr 


His 


Leu 


Arg 


Asp 
55 


Met 


He 


He 


Leu 




ccc 


gag 


atg 


gtg 


ggc 


age 


atg 


gtg 


ggc 


gtc 


tac 


aac 


ggc 


aag 


acc 


ttc 


365 


Pro 


Glu 


Met 


Val 


Gly 


Ser 


Met 


Val 


Gly 


Val 


Tyr 


Asn 


Gly 


Lys 


Thr 


Phe 




60 










65 










70 










75 




aac 


cag 


gtg 


gag 


ate 


aag 


ccc 


gag 


atg 


ate 


ggc 


cac 


tac 


ctg 


ggc 


gag 


413 


Asn 


Gin 


Val 


Glu 


He 
80 


Lys 


Pro 


Glu 


Met 


He 

85 


Gly 


His 


Tyr 


Leu 


Gly 
90 


Glu 




ttc 


tec 


ate 


acc 


tac 


aag 


ccc 


gta 


aag 


cat 


ggc 


egg 


ccc 


ggc 


ate 


ggg 


461 


Phe 


Ser 


He 


Thr 
95 


Tyr 


Lys 


Pro 


Val 


Lys 
100 


His 


Gly 


Arg 


Pro 


Gly 
105 


He 


Gly 




gec 


ace 


cac 


tec 


tec 


cgc 


ttc 


ate 


cct 


etc 


aag 


taatggctca gctaataaag 


514 


Ala 


Thr 


His 
110 


Ser 


Ser 


Arg 


Phe 


He 
115 


Pro 


Leu 


Lys 















gcgcacatga ctccaaaaaa aaaaaaaaaa 544 



<210> 90 

<211> 304 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2. .304 



<400> 90 

a cgt acg gca ggg egg ggt egg ccc gca gtg gee age tgg gag eta cga 4 9 

Arg Thr Ala Gly Arg Gly Arg Pro Ala Val Ala Ser Trp Glu Leu Arg 
15 10 15 



gcg 


egg 


ast 


tgc 


gca 


gaa 


gac 


ccc 


cat 


cag 


ggt 


gcg 


ggg 


tgc 


agt 


tgc 


97 


Ala 


Arg 


Xaa 


Cys 


Ala 


Glu 


Asp 


Pro 


His 


Gin 


Gly 


Ala 


Gly 


Cys 


Ser 


Cys 










20 










25 










30 








ggc 


tec 


agg 


gee 


atg 


gcg 


gag 


gag 


cag 


ggc 


egg 


gaa 


egg 


gac 


teg 


gtt 


145 


Gly 


Ser 


Arg 


Ala 


Met 


Ala 


Glu 


Glu 


Gin 


Gly 


Arg 


Glu 


Arg 


Asp 


Ser 


Val 








35 










40 










45 










ccc 


aag 


ccg 


teg 


gtg 


ctg 


ttc 


etc 


cac 


cca 


gac 


ctg 


ggc 


gtg 


ggc 


ggc 


193 


Pro 


Lys 


Pro 


Ser 


Val 


Leu 


Phe 


Leu 


His 


Pro 


Asp 


Leu 


Gly 


Val 


Gly 


Gly 






50 










55 










60 












get 


gag 


egg 


ctg 


gtg 


etc 


ttg 


cct 


ttt 


ccc 


act 


gag 


aga 


agg 


ctg 


etc 


241 


Ala 


Glu 


Arg 


Leu 


Val 


Leu 


Leu 


Pro 


Phe 


Pro 


Thr 


Glu 


Arg 


Arg 


Leu 


Leu 
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70 










75 










80 




ttt 


tgt 


act 


gee 


ccc 


cgc 


tea 


tta 


aac 


age 


etc 


ccc 


eta 


aaa 


aaa 


aaa 


289 


Phe 


Cys 


Thr 


Ala 


Pro 


Arg 


Ser 


Leu 


Asn 


Ser 


Leu 


Pro 


Leu 


Lys 


Lys 


Lys 
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75 



85 



90 



95 



aaa awt gcr aaa age 
Lys Xaa Ala Lys Ser 
100 
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304 



<210> 91 

<211> 302 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .302 



<220> 

<221> polyA__site 
<222> 281. .302 



<220> 

<221> polyA_signal 
<222> 230. .235 



<400> 91 

ac gta egg cag ggc ggg gtc ggc ccg cag tgg cca get ggg age tac 47 
Val Arg Gin Gly Gly Val Gly Pro Gin Trp Pro Ala Gly Ser Tyr 
15 10 15 

gag cgc gga get tgc gca gaa gac ccc cat cag ggt gcg ggg tgc agt 95 
Glu Arg Gly Ala Cys Ala Glu Asp Pro His Gin Gly Ala Gly Cys Ser 

20 25 30 

tgc ggc tec agg gee atg gcg gag gag cag ggc egg gaa egg gac teg 143 
Cys Gly Ser Arg Ala Met Ala Glu Glu Gin Gly Arg Glu Arg Asp Ser 

35 40 45 

gtt ccc aag ccg teg gtg ctg ttc etc cac cca gac ctg ggc gtg ggc 191 
Val Pro Lys Pro Ser Val Leu Phe Leu His Pro Asp Leu Gly Val Gly 

50 55 60 

ggc get gag egg ctg gtg etc ttg cct ttt ccc act gag aga agg ctg 239 
Gly Ala Glu Arg Leu Val Leu Leu Pro Phe Pro Thr Glu Arg Arg Leu 

65 70 75 

etc ttt tgt act gee ccc cgc tea tta aac age etc ccc eta aaa aaa 287 
Leu Phe Cys Thr Ala Pro Arg Ser Leu Asn Ser Leu Pro Leu Lys Lys 
80 85 90 95 

aaa aaa aat gcg aaa 302 
Lys Lys Asn Ala Lys 
100 



<210> 92 

<211> 596 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 154 . .594 



<400> 92 

atagtgagcg gcagacagag gggtggccag tttgttgttt tacctaaaag gtgttttcca 60 
catggagtgg gactgagagg cacaggtggc tgaggggacc cgcctgggac gtgaggegea 120 
ggtcctcggc cccttcatct gtcagctgea gca atg gaa tac gtg ctg gaa gtg 174 

Met Glu Tyr Val Leu Glu Val 

1 5 

aag aac tct ccg egg cac etc ctg aag caa ttc aca gtg tgt gac gtt 222 
Lys Asn Ser Pro Arg His Leu Leu Lys Gin Phe Thr Val Cys Asp Val 
10 15 20 
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cct 


ctg 


tat 


gac 


att 


tgt 


gac 


tac 


aac 


gtc 


tec 


aga 


gac 


cga 


tgc 


cag 


270 


Pro 


Leu 


Tyr 


Asp 


He 


Cys 


Asp 


Tyr 


Asn 


Val 


Ser 


Arg 


Asp 


Arg 


Cys 


Gin 






25 










30 










35 












gag 


etc 


ggg 


tgc 


tgc 


tty 


tac 


gaa 


ggc 


gty 


tgc 


tac 


aag 


aaa 


gcg 


gtt 


318 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 


Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 




40 










45 










50 










55 




ccc 


att 


tac 


ate 


cac 


gtg 


ttc 


tct 


gcc 


ttg 


att 


gtg 


ate 


aaa 


get 


ggg 


366 


Pro 


He 


Tyr 


He 


His 


Val 


Phe 


Ser 


Ala 


Leu 


He 


Val 


He 


Lys 


Ala 


Gly 












60 










65 










70 






gcc 


ttc 


gtc 


ate 


ace 


ate 


ate 


tac 


aga 


gtc 


att 


cag 


gag 


age 


agg 


aaa 


414 


Ala 


Phe 


Val 


He 


Thr 


He 


He 


Tyr 


Arg 


Val 


He 


Gin 


Glu 


Ser 


Arg 


Lys 










75 










80 










85 








gaa 


aag 


gsc 


ate 


cct 


gtg 


tat 


gtc 


gcg 


ctg 


cca 


cag 


aag 


tec 


age 


gaa 


462 


Glu 


Lys 


Xaa 


He 


Pro 


Val 


Tyr 


Val 


Ala 


Leu 


Pro 


Gin 


Lys 


Ser 


Ser 


Glu 








90 










95 










100 










aag 


gcg 


gag 


ttg 


gcc 


tea 


tec 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


cct 


gcg 


510 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Lys 


Leu 


Gly 


Leu 


Lys 


Pro 


Ala 






105 










110 










115 












agt 


cct 


ggg 


cct 


cca 


agt 


get 


ggg 


ccc 


teg 


atg 


aag 


agt 


gac 


gag 


gat 


558 


Ser 


Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 


Ser 


Met 


Lys 


Ser 


Asp 


Glu 


Asp 




120 










125 










130 










135 




aag 


gat 


gat 


gta 


aca 


ggg 


aca 


ata 


aca 


gaa 


gcc 


gaa 


ga 








596 


Lys 


Asp 


Asp 


Val 


Thr 


Gly 


Thr 


He 


Thr 


Glu 


Ala 


Glu 













140 145 

<210> 93 

<211> 596 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 154 . .594 

<400> 93 

atagtgagcg gcagacagag gggtggccag tttgttgttt tacctaaaag gtgttttcca 60 
catggagtgg gactgagagg cacaggtggc tgaggggacc cgcctgggac gtgaggegea 120 
ggtcctcggc cccttcatct gtcagctgea gca atg gaa tac gtg ctg gaa gtg 174 

Met Glu Tyr Val Leu Glu Val 
1 5 



aag 


aac 


tct 


ccg 


egg 


cac 


etc 


ctg 


aag 


caa 


ttc 


aca 


gtg 


tgt 


gac 


gtt 


222 


Lys 


Asn 


Ser 


Pro 


Arg 


His 


Leu 


Leu 


Lys 


Gin 


Phe 


Thr 


Val 


Cys 


Asp 


Val 








10 










15 










20 










cct 


ctg 


tat 


gac 


att 


tgt 


gac 


tac 


aac 


gtc 


tec 


aga 


gac 


cga 


tgc 


cag 


270 


Pro 


Leu 


Tyr 


Asp 


He 


Cys 


Asp 


Tyr 


Asn 


Val 


Ser 


Arg 


Asp 


Arg 


Cys 


Gin 






25 










30 










35 












gag 


etc 


ggg 


tgc 


tgc 


ttc 


tac 


gaa 


ggc 


gtc 


tgc 


tac 


aag 


aaa 


gcg 


gtt 


318 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 


Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 




40 










45 










50 










55 




ccc 


att 


tac 


ate 


cac 


gtg 


ttc 


tct 


gcc 


ttg 


att 


gtg 


ate 


ata 


get 


ggg 


366 


Pro 


He 


Tyr 


He 


His 


Val 


Phe 


Ser 


Ala 


Leu 


He 


Val 


He 


lie 


Ala 


Gly 












60 










65 










70 






gcc 


ttc 


gtc 


ate 


ace 


ate 


ate 


tac 


aga 


gtc 


att 


cag 


gag 


age 


agg 


aaa 


414 


Ala 


Phe 


Val 


He 


Thr 


He 


He 


Tyr 


Arg 


Val 


He 


Gin 


Glu 


Ser 


Arg 


Lys 










75 










80 










85 








gaa 


aag 


gcc 


ate 


cct 


gtg 


tat 


gtc 


gcg 


ctg 


cca 


cag 


aag 


tec 


age 


gaa 


4 62 


Glu 


Lys 


Ala 


He 


Pro 


Val 


Tyr 


Val 


Ala 


Leu 


Pro 


Gin 


Lys 


Ser 


Ser 


Glu 








90 










95 










100 










aag 


gcg 


gag 


ttg 


gcc 


tea 


tec 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


cct 


gcg 


510 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Lys 


Leu 


Gly 


Leu 


Lys 


Pro 


Ala 




105 










110 










115 












agt 


cct 


ggg 


cct 


cca 


agt 


get 


ggg 


ccc 


teg 


atg 


aag 


agt 


gac 


gag 


gat 


558 


Ser 


Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 


Ser 


Met 


Lys 


Ser 


Asp 


Glu 


Asp 
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120 125 
aag gat gat gta aca ggg aca ata aca 
Lys Asp Asp Val Thr Gly Thr lie Thr 
140 
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130 135 
gaa gcc gaa ga 596 
Glu Ala Glu 
145 



<210> 94 

<211> 1413 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24 . .470 



<220> 

<221> polyA_site 
<222> 1398 . . 1413 



<400> 94 

gggttcccgt tccccgcgga gcc atg egg tac aac gag aag gag ctg cag get 

Met Arg Tyr Asn Glu Lys Glu Leu Gin Ala 
15 10 
ctg tec egg cag ccg gcc gag atg gcg gcc gag ctg ggc atg agg ggc 
Leu Ser Arg Gin Pro Ala Glu Met Ala Ala Glu Leu Gly Met Arg Gly 

15 20 25 

ccc aag aag ggc age gtg ctg aag egg egg ctg gtg aag ctg gtg gtg 
Pro Lys Lys Gly Ser Val Leu Lys Arg Arg Leu Val Lys Leu Val Val 

30 35 4 0 

aat ttc etc ttc tac ttt egg aca gac gag gcc gag ccc gtc gga gcc 
Asn Phe Leu Phe Tyr Phe Arg Thr Asp Glu Ala Glu Pro Val Gly Ala 

45 50 55 

ctg ctg ctg gag cgc tgc aga gtc gtc egg gaa gag ccc ggc ace ttc 
Leu Leu Leu Glu Arg Cys Arg Val Val Arg Glu Glu Pro Gly Thr Phe 

60 65 70 

tec ate age ttc att gag gac cct gag agg aag tat cac ttt gag tgc 
Ser lie Ser Phe lie Glu Asp Pro Glu Arg Lys Tyr His Phe Glu Cys 
75 80 85 90 

age age gag gag cag tgt cag gag tgg atg gag get ctg cgt egg gcc 
Ser Ser Glu Glu Gin Cys Gin Glu Trp Met Glu Ala Leu Arg Arg Ala 

95 100 105 

age tac gag ttc atg egg aga age etc ate ttc tac agg aac gaa ate 
Ser Tyr Glu Phe Met Arg Arg Ser Leu lie Phe Tyr Arg Asn Glu lie 

110 115 120 

egg aag gtg acg ggc aag gac ccc ctg gaa cag ttc ggc ata tec gag 
Arg Lys Val Thr Gly Lys Asp Pro Leu Glu Gin Phe Gly lie Ser Glu 

125 130 135 

gag gcc agg ttc cag ctg agt ggc ttg cag gcg tgagegcagg gcacggtggt 
Glu Ala Arg Phe Gin Leu Ser Gly Leu Gin Ala 

140 145 
cagcgtgcag egggaeggga ctggccctgc ccagccatga ategcttgge catgcctgga 
tctgttttgt tttggttttt ggtttttggg tcagggtttc actgtgttgc ccaggctaga 
gtgcagtggt gccacagctc actgtgacct tgaccttctg gactcaagtg atcctcctgc 
ctcagcttcc caagtagegg ggatcacagg catgagtege cacacccggc catcacacct 
ggattttcag tgggaggttt ttggtttgga gacatccaaa gcctgaagcc aggtgggtgt 
gggcaggggc tgcattttat gaaactgccc ageaagctge gctccctggg gccccaggat 
ccacctaact ggcttcagct tttgtcatat gggtaaaaat taaagatgtc attgaactac 
tgtcttgttt atgagaccat tcagtggtga actgtttctg gctgataggt tatgagatat 
gtaaagcttt ctagtactct taaaataact aaatggagta ttatatatca attcatatca 
ttgactttat tattttagta gtatgectat agaaaatatt atggactcag agtgtcataa 
aatcactctt aagaatccat gcagcaggcc aggcacagtg gctcacacct gtaatgcctg 
cactttggaa ggecgagaca ggeggatcac ttgaggtcag gagtttgaaa ccagccaggc 
caacacagtg aaaccctgtc tctactaaaa atacaaaagg ttagceggge atggtggcag 
gcgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga aegcaggagg 



53 



101 



149 



197 



245 



293 



341 



389 



437 



490 



550 
610 
670 
730 
790 
850 
910 
970 
1030 
1090 
1150 
1210 
1270 
1330 
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caaaggttgc agtgagctga gatcacgcca ctgcactcca gcctgggcaa cagacctcga 1390 
ctccatcaaa aaaaaaaaaa aaa 1413 



<210> 95 

<211> 519 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 211. .519 



<220> 

<221> polyA__site 
<222> 504 . .519 



<400> 95 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 60 
cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 120 
atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 180 
taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 234 

Met Pro Val Val Pro Ala Leu Gly 

1 5 



agg 


cca 


agg 


tgg 


gca 


gat 


cac 


ctg 


agg 


teg 


gga 


gtt 


egg 


gac 


cag 


cct 


282 


Arg 


Pro 
10 


Arg 


Trp 


Ala 


Asp 


His 
15 


Leu 


Arg 


Ser 


Gly 


Val 
20 


Arg 


Asp 


Gin 


Pro 




ggc 


caa 


cct 


ggt 


gaa 


gcc 


ccc 


cca 


tct 


eta 


eta 


aaa 


ata 


caa 


aaa 


ttg 


330 


Gly 


Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 


Ser 


Leu 


Leu 


Lys 


He 


Gin 


Lys 


Leu 




25 










30 










35 










40 




gcc 


ggg 


tat 


ggt 


ggt 


ggc 


tgc 


ctg 


tgg 


tec 


cag 


eta 


etc 


ggg 


agg 


ctg 


378 


Ala 


Gly 


Tyr 


Gly 


Gly 
45 


Gly 


Cys 


Leu 


Trp 


Ser 
50 


Gin 


Leu 


Leu 


Gly 


Arg 
55 


Leu 




agg 


egg 


gag 


aat 


cac 


ttg 


age 


ccg 


gga 


ggc 


gga 


ggt 


tgc 


agt 


gag 


ccg 


426 


Arg 


Arg 


Glu 


Asn 
60 


His 


Leu 


Ser 


Pro 


Gly 
65 


Gly 


Gly 


Gly 


Cys 


Ser 
70 


Glu 


Pro 




aga 


ttg 


tgc 


cac 


tgc 


act 


cca 


gcc 


tgg 


gtg 


aca 


gag 


caa 


gac 


tec 


ate 


474 


Arg 


Leu 


Cys 
75 


His 


Cys 


Thr 


Pro 


Ala 
80 


Trp 


Val 


Thr 


Glu 


Gin 
85 


Asp 


Ser 


He 




tea 


aaa 


ata 


gaa 


aag 


att 


tct 


caa 


ttg 


cca 


aaa 


aaa 


aaa 


aaa 


aaa 




519 


Ser 


Lys 
90 


He 


Glu 


Lys 


He 


Ser 
95 


Gin 


Leu 


Pro 


Lys 


Lys 
100 


Lys 


Lys 


Lys 







<210> 96 

<211> 347 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 73 . .306 



<220> 

<221> polyA_site 
<222> 332. .347 



<220> 

<221> polyA_signal 
<222> 305. .310 



<400> 96 

actccgcctt ccacgtgcac ccactgcctc ttcccttctc gcttgggaac tctagtctcg 60 
cctcgggttg ca atg gac ccc aac tgc tec tgt gcc get ggt gtc tec tgc 111 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
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79 









1 








5 










10 








acc 


tgc 


gcc 


age 


tec 


tgc 


aag 


tgc 


aaa 


gag 


tgc 


aaa 


tgc 


acc 


tec 


tgc 


159 


Thr 


Cys 
15 


Ala 


Ser 


Ser 


Cys 


Lys 
20 


Cys 


Lys 


Glu 


Cys 


Lys 
25 


Cys 


Thr 


Ser 


Cys 




aag 


aag 


age 


tgc 


tgc 


tec 


tgg 


tgc 


act 


ggc 


ccc 


acc 


etc 


gtg 


gac 


acc 


207 


Lys 


Lys 


Ser 


Cys 


Cys 


Ser 


Trp 


Cys 


Thr 


Gly 


Pro 


Thr 


Leu 


Val 


Asp 


Thr 




30 










35 










40 










45 




tgc 


cct 


gcc 


ctg 


cca 


cct 


gtc 


tgt 


ctg 


tec 


caa 


aga 


agt 


tct 


ggt 


atg 


255 


Cys 


Pro 


Ala 


Leu 


Pro 
50 


Pro 


Val 


Cys 


Leu 


Ser 
55 


Gin 


Arg 


Ser 


Ser 


Gly 
60 


Met 




aac 


ttg 


agg 


aca 


cat 


gtc 


cag 


tgg 


gag 


gtg 


aga 


cca 


cct 


etc 


aat 


att 


303 


Asn 


Leu 


Arg 


Thr 

65 


His 


Val 


Gin 


Trp 


Glu 
70 


Val 


Arg 


Pro 


Pro 


Leu 
75 


Asn 


He 





caa taaagctget gagaatctag cctcgaaaaa aaaaaaaaaa a 347 
Gin 



<210> 97 

<211> 477 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 120. .395 



<220> 

<221> polyA_site 
<222> 462. .477 



<400> 97 

gcttctttct agtgtggacc agaggtgggg gcaatgagta aacatcttca ctgtaagggt 60 
tttgggactg cccctcaggg cttgaggtct eggcategtg tctcagtcag tccttggcg 119 



atg 


tec 


cct 


cag 


acg 


cac 


tea 


cag 


act 


tgc 


ata 


cgt 


aat 


tta 


gta 


aca 


167 


Met 


Ser 


Pro 


Gin 


Thr 


His 


Ser 


Gin 


Thr 


Cys 


He 


Arg 


Asn 


Leu 


Val 


Thr 




1 








5 










10 










15 






tgc 


ate 


aac 


tat 


cct 


aga 


aca 


tct 


aca 


ggg 


tgc 


aaa 


gga 


acc 


acc 


act 


215 


Cys 


He 


Asn 


Tyr 
20 


Pro 


Arg 


Thr 


Ser 


Thr 
25 


Gly 


Cys 


Lys 


Gly 


Thr 

30 


Thr 


Thr 




cag 


aga 


ate 


atg 


gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 


gaa 


ggg 


aca 


gaa 


caa 


263 


Gin 


Arg 


He 
35 


Met 


Glu 


Pro 


Val 


Glu 
40 


Leu 


Glu 


Val 


Glu 


Gly 
45 


Thr 


Glu 


Gin 




gac 


aat 


get 


aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 


acc 


311 


Asp 


Asn 
50 


Ala 


Lys 


Thr 


Cys 


Gly 
55 


Ser 


Leu 


Gly 


Arg 


Gly 
60 


Asn 


Glu 


Asn 


Thr 




atg 


etc 


cga 


ggt 


gga 


ttc 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


359 


Met 


Leu 


Arg 


Gly 


Gly 


Phe 


Ser 


Met 


Asn 


Thr 


Thr 


Val 


Gly 


Gin 


Gly 


He 




65 










70 










75 










80 




tec 


aag 


caa 


aca 


cac 


cac 


act 


agt 


acc 


act 


tct 


tec 


taaacagaaa 




405 


Ser 


Lys 


Gin 


Thr 


His 
85 


His 


Thr 


Ser 


Thr 


Thr 
90 


Ser 


Ser 













agggaaagag catccttcga sttccccctc cctccatcag caccgggttt tcctccaaaa 465 
aaaaaaaaaa aa 477 



<210> 98 

<211> 363 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 117 . .269 



<220> 
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80 

<221> polyA_site 
<222> 348 . . 363 



<220> 

<221> polyA_signal 
<222> 329. .334 



<400> 98 

cctttctcgt tccccggcca tcttatcggc tgctgttggt tgggggccgt cccgctccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtac tatgct atg 119 

Met 
1 

ttg get aaa act ggt gtc cat cac tac agt ggc aat aat att gaa ctg 167 
Leu Ala Lys Thr Gly Val His His Tyr Ser Gly Asn Asn lie Glu Leu 

5 10 15 

ggc aca gca tgc gga aaa tac tac aga gtg tgc aca ctg get ate att 215 
Gly Thr Ala Cys Gly Lys Tyr Tyr Arg Val Cys Thr Leu Ala lie lie 

20 25 30 

gat cca ggt gac tct gac ate att aga age atg cca gaa cag act ggt 263 
Asp Pro Gly Asp Ser Asp lie lie Arg Ser Met Pro Glu Gin Thr Gly 

35 40 45 

gaa aag taaacctttt cacctacaaa atttcacctg caaaccttaa acctgeaaaa 319 
Glu Lys 
50 

ttttccttta ataaaatttg cttgttctaa aaaaaaaaaa aaaa 363 



<210> 99 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .411 



<220> 

<221> polyA_site 
<222> 591. . 606 



<220> 

<221> polyA_signal 
<222> 573. . 578 



<400> 99 

aagaaagggg tgaggcctaa gggacaatca ggatgttttt cagagagaag tgtggatgct 60 
ggacaggaag aaccacagat accagatacg ggtactgttg taactctgtt ctcc atg 117 

Met 
1 



aaa 


aaa 


aag 


gaa 


gaa 


aca 


aca 


ctt 


tea 


gag 


atg 


gag 


cct 


gtt 


gag 


cca 


165 


Lys 


Lys 


Lys 


Glu 
5 


Glu 


Thr 


Thr 


Leu 


Ser 
10 


Glu 


Met 


Glu 


Pro 


Val 
15 


Glu 


Pro 




cag 


tac 


caa 


eta 


gtc 


aat 


get 


gaa 


teg 


act 


tct 


ccc 


ttt 


eta 


cat 


tgc 


213 


Gin 


Tyr 


Gin 
20 


Leu 


Val 


Asn 


Ala 


Glu 

25 


Ser 


Thr 


Ser 


Pro 


Phe 
30 


Leu 


His 


Cys 




ctg 


aga 


gaa 


gtc 


att 


ggg 


gaa 


tac 


tct 


gta 


cac 


gaa 


ttt 


tea 


ctg 


ttg 


261 


Leu 


Arg 
35 


Glu 


Val 


He 


Gly 


Glu 
40 


Tyr 


Ser 


Val 


His 


Glu 
45 


Phe 


Ser 


Leu 


Leu 




ggg 


aaa 


aca 


gag 


agt 


caa 


ggg 


att 


gga 


ttg 


tgg 


att 


gca 


ttg 


gtg 


gtt 


309 


Gly 


Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


He 


Ala 


Leu 


Val 


Val 




50 










55 










60 










65 




ttc 


etc 


agt 


ttc 


etc 


ate 


ttc 


tec 


aca 


agt 


ttc 


tac 


ata 


teg 


aat 


gca 


357 


Phe 


Leu 


Ser 


Phe 


Leu 
70 


He 


Phe 


Ser 


Thr 


Ser 
75 


Phe 


Tyr 


He 


Ser 


Asn 
80 


Ala 
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81 

gag cag ccc ttc ttc aaa gaa cct cct acg gaa get get aag gaa etc 405 
Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu Leu 

8 5 " 90 95 

agt ctg tagctctgeg tggagccatg tgtaaacact gaactgagac ctgccacctc 461 
Ser Leu 

ctactaccta agggeccatt ttcatctgat atcatccccc agaaacaaac tcatgatgac 521 
ttccatgttt tttttagatt agatacatgg agaattttcc tttcccttag aattaaaatc 581 
ctgeattcta aaaaaaaaaa aaaat 606 



<210> 100 
<211> 648 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125. .508 



<220> 

<221> polyA_site 
<222> 633. . 648 



<220> 

<221> polyA_signal 
<222> 610. . 615 



<400> 100 

agetgectgg gtaaggecca agatggctgt cttcgcctta gtactcgtgt gaagttggca 60 
gggacggttc ctgtcatctt cttgggctta tttggtgtgc tgttgaaggg gggagactag 120 
agaa atg gca ggg aac etc tta tec ggg gca ggt agg cgc ctg tgg gac 169 
Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 





1 








5 










10 










15 




tgg 


gtg 


cct 


ctg 


gcg 


tgc 


aga 


age 


ttc 


tct 


ctt 


ggt 


gtg 


cct 


aga 


ttg 


217 


Trp 


Val 


Pro 


Leu 


Ala 


Cys 


Arg 


Ser 


Phe 


Ser 


Leu 


Gly 


Val 


Pro 


Arg 


Leu 












20 










25 










30 






ate 


ggt 


ata 


agg 


etc 


act 


etc 


ccg 


ccc 


ccc 


aaa 


gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 


Leu 


Thr 


Leu 


Pro 


Pro 


Pro 


Lys 


Val 


Val 


Asp 


Arg 


Trp 










35 










40 










45 








aac 


gag 


aaa 


agg 


gec 


atg 


ttc 


gga 


gtg 


tat 


gac 


aac 


ate 


ggg 


ate 


ctg 


313 


Asn 


Glu 


Lys 


Arg 


Ala 


Met 


Phe 


Gly 


Val 


Tyr 


Asp 


Asn 


He 


Gly 


He 


Leu 








50 










55 










60 










gga 


aac 


ttt 


gaa 


aag 


cac 


ccc 


aaa 


gaa 


ctg 


ate 


agg 


ggg 


ccc 


ata 


tgg 


361 


Gly 


Asn 


Phe 


Glu 


Lys 


His 


Pro 


Lys 


Glu 


Leu 


He 


Arg 


Gly 


Pro 


He 


Trp 






65 










70 










75 












ctt 


cga 


ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 


caa 


cgt 


tgt 


ate 


cga 


aag 


agg 


409 


Leu 


Arg 


Gly 


Trp 


Lys 


Gly 


Asn 


Glu 


Leu 


Gin 


Arg 


Cys 


He 


Arg 


Lys 


Arg 




80 










85 










90 










95 




aaa 


atg 


gtt 


gga 


agt 


aga 


atg 


ttc 


get 


gat 


gac 


ctg 


cac 


aac 


ctt 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 


Arg 


Met 


Phe 


Ala 


Asp 


Asp 


Leu 


His 


Asn 


Leu 


Asn 












100 










105 










110 






aaa 


cgc 


ate 


cgc 


tat 


etc 


tac 


aaa 


cac 


ttt 


aac 


cga 


cat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


He 


Arg 


Tyr 


Leu 


Tyr 


Lys 


His 


Phe 


Asn 


Arg 


His 


Gly 


Lys 


Phe 





115 120 125 



cga tagaagagaa agctgagaac ttcggaaaag gctcatctgt caccctggag 558 
Arg 

aagggaaact gtacttttcc ctgtgaggaa aeggctttgt attttctctg taataaaatg 618 

gggcttcttt ggacaaaaaa aaaaaaaaaa 648 



<210> 101 
<211> 491 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 50. .205 



<220> 

<221> polyA__site 
<222> 469. .491 



<400> 101 
actctttcct 



aca ctg gca 
Thr Leu Ala 
5 

gtt tat gaa 
Val Tyr Glu 
20 

tec tgg ate 
Ser Trp lie 



tggctgetea ggtcataggt gctgtgatct agagacaaa atg ttc ctt 

Met Phe Leu 
1 

ttc ctt ctg aat 
Phe Leu Leu Asn 



gat act tgc aaa eta agg gga atg age 
Asp Thr Cys Lys Leu Arg Gly Met Ser 

10 15 
gga gag gee act gtg tea tct gtc tta 
Gly Glu Ala Thr Val Ser Ser Val Leu 

25 30 
att gtg gga aat gaa aga tac ttc gat 
lie Val Gly Asn Glu Arg Tyr Phe Asp 
40 J ~ 45 

cat tgatccaatg ccaactccaa gactggaacg tegcaatgat 
His 

eggaaatttg 
cttctgaaag 
aaaataatcc 
aaacatctta 



gag eta ttg gaa 
Glu Leu Leu Glu 
35 

gga ate age age 
Gly lie Ser Ser 
50 

agttccaagg 



aegtaattet tttcgacaca gttttacagg tetggatgee cattttaatt 
catgcctcct ccttctggct tcaggactcc atctccagcc tcttgatcta 
ccaaaccaaa aaattagata ctatttcctc aaaattaggt attttaatca 
acataaatac attattatca cccaaaaaaa aaaaaaaatg cgaaaa 



58 



106 



154 



202 



255 

315 
375 
435 
491 



<210> 102 
<211> 565 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 90. .341 



<220> 

<221> polyA_site 
<222> 534 . .565 



<220> 

<221> polyA_signal 
<222> 515. . 520 



<400> 102 

gttgagggag tcgggccgcg actgtggtcg tttttatacc ttcccgcgcg gacgccggcg 60 

113 



161 

209 
257 
305 
351 



ctgccaacgg aagggeggge tctgctgcc 


atg 


aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


















Met 
1 


Lys 


Val 


Lys 


He 
5 


Lys 


Cys 


Trp 


aac 


ggc 


gtg 


gee 


act 


tgg 


etc 


tgg 


gtg 


gee 


aac 


gat 


gag 


aac 


tgt 


ggc 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 




10 










15 










20 










ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


lie 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


25 










30 










35 










40 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


Gly Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 










45 










50 










55 




atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


cag 


gtg 


cag 


cag 


cac 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 








60 










65 










70 






tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


gag 


tgaggecega 
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Cys Pro Met Cys Arg Gin Glu Trp Lys Phe Lys 

75 80 
cctggctctc gctggagggg catcctgaga ctccttcctc 
tggggacagc gcccctgagc tgcaacaagg tggaaacaag 
tgccatcact atgttgacac ttttatccaa taagtgaaaa 
ttaaaaaaaa aaaaaaagaa aaaaaaaaaa aaaa 



PCT/IB01/00914 



Glu 



atgctggcgc cgatggctgc 
ggctggagct gcgtttgttt 
ctcattaaac tactcaaatc 



411 
471 
531 
565 



<210> 103 
<211> 661 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 169. .420 



<220> 

<221> polyA_site 
<222> 614 . . 661 



<220> 

<221> polyA_signal 
<222> 594 . .599 



<400> 103 

gagactcggc gggcgctgtt gagggagtcg gcccgcgact gtggtcgttt ttataccttc 60 
ccgcgcggac gccggcctgc caacggaagg gcggagacgg agtttcgtca tgttggccag 120 
gcccatttga gatctttgaa gatatcctca acgtgaggct ctgctgcc atg aag gtg 177 

Met Lys Val 



aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gec 


act 


tgg 


etc 


tgg 


1 

gtg 


gee 


aac 


225 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 






5 










10 










15 












gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


273 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 




20 










25 










30 










35 




gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


321 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 












40 










45 










50 






tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


369 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 










55 










60 










65 








cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


417 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 


Phe 


Lys 





70 75 80 



gag tgaggecega cctggctctc gctggagggg catcctgaga ctccttcctc 470 
Glu 

atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg tggaaacaag 530 

ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa taagtgaaaa 590 

ctcattaaac tactcaaatc ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagaaaaa 650 

aaaaaaaaaa a 661 



<210> 104 
<211> 609 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 144 . . 395 



<220> 

<221> polyA_site 



WO 02/083898 

<222> 594 . . 609 



84 



PCT/IB01/00914 



<220> 

<221> polyA_signal 
<222> 569. .574 

<400> 104 

gtttttatac cttcccgcgc ggacgccggc gctgccaacg gaagggcggg tagggcggtg 60 
cgtgattaga gacggagttt cgtcatgttg gccaggccca tttgagatct ttgaagatat 120 
cctcaacgtg aggctctgct gcc atg aag gtg aag att aag tgc tgg aac ggc 173 

Met Lys Val Lys lie Lys Cys Trp Asn Gly 
1 5 10 



gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


221 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 












15 










20 










25 






agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


269 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 










30 










35 










40 








gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


317 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 








45 










50 










55 










tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


365 


Cys 


lie 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 






60 










65 










70 












atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


gag 


tgaggecega < 


ectggctctc 


415 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 


Phe 


Lys 


Glu 
















75 










80 

























gctggagggg catcctgaga ctccttcctc atgctggcgc cgatggctgc tggggacagc 475 

gcccctgagc tgcaacaagg tggaaacaag ggctggagct gcgtttgttt tgccatcact 535 

atgttgacac ttttatccaa taagtgaaaa ctcattaaac tactcaaatc ttgaaaagaa 595 

aaaaaaaaaa aaaa 609 

<210> 105 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177 . .428 

<220> 

<221> polyA__site 
<222> 620. . 635 

<220> 

<221> polyA_signal 
<222> 602. . 607 

<400> 105 

aagactegge gggcgctgtt gagggagtcg ggccgcgact gtggtcgttt ttataccttc 60 
ccgcgcggac gccggcgctg ecaaeggaag ggegggtagg gcgagacgga gtttegtcat 120 
gttggccagg cccatttgag atctttgata tatcctcaac gtgaggctct getgee atg 179 

Met 

1 

aag gtg aag att aag tgc tgg aac ggc gtg gcc act tgg etc tgg gtg 227 
Lys Val Lys He Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp Val 

5 10 15 

gcc aac gat gag aac tgt ggc ate tgc agg atg gca ttt aac gga tgc 275 
Ala Asn Asp Glu Asn Cys Gly lie Cys Arg Met Ala Phe Asn Gly Cys 

20 25 30 

tgc cct gac tgc aag gtg ccc ggc gac gac tgc ccg ctg gtg tgg ggc 323 
Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp Gly 
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35 40 45 

cag tgc tec cac tgc ttc cac atg cat tgc ate etc aag tgg ctg cac 371 
Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu His 
50 55 60 65 

gca cag cag gtg cag cag cac tgc ccc atg tgc cgc cag gaa tgg aag 419 
Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp Lys 

70 75 80 

ttc aag gag tgaggecega cctggctctc gctggagggg catcctgaga 4 68 

Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 528 
tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 588 
taagtgaaaa ctcattaaac tactcaaatc caaaaaaaaa aaaaaaa 635 



<210> 106 
<211> 573 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 114 . . 365 



<220> 

<221> polyA_site 
<222> 558 . . 573 



<220> 

<221> polyA_signal 
<222> 539. .544 



<400> 106 

ggcgtgcgag acteggeggg cgctgttgag ggagteggge cgcgactgtg gtcgttttta 60 
taccttcccg cgcggacgcc ggcgctgcca aeggaaggge gggctctget gee atg 116 

Met 
1 



aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gee 


act 


tgg 


etc 


tgg 


gtg 


164 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 










5 










10 










15 








gee 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


212 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 








20 










25 










30 










tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


260 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 






35 










40 










45 












cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


308 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


356 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 





70 75 80 



ttc aag gag tgaggecega cctggctctc gctggagggg catcctgaga 405 
Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 4 65 

tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 525 

taagtgaaaa ctcattaaac tactcaaatc ctaaaaaaaa aaaaaaaa 573 



<210> 107 
<211> 689 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 44 . . 283 
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<220> 

<221> polyA_site 
<222> 674 . . 689 



<220> 

<221> polyA_signal 
<222> 644 . . 649 



<400> 107 

tttcgccgcc aaagcatcca gcagccccct gctccggccc 



acc cag acc ccc aca aag get cct gag gaa cct 
Thr Gin Thr Pro Thr Lys Ala Pro Glu Glu Pro 
5 10 15 

gac tac aac acg gtg cag act gtg ggc atg act 
Asp Tyr Asn Thr Val Gin Thr Val Gly Met Thr 

25 30 
ttc ctg ctg ggt ate etc ate gtc ate age aag 
Phe Leu Leu Gly lie Leu lie Val lie Ser Lys 

40 45 
aag gcg gac tec agg tct gag age cca acc tgc 
Lys Ala Asp Ser Arg Ser Glu Ser Pro Thr Cys 

55 ~ 60 

gag ctt ccc tct tea gee cct ggt ggc ggc ggc 
Glu Leu Pro Ser Ser Ala Pro Gly Gly Gly Gly 

70 75 
ccgaggaaac tccgctgccg accctgcctg agegegggag 
ggtggggacc cagccgcgcg ccgggagcgc tccccggaat 
aggctggagc cgctgcaccc tgccgtccct ctccaggcct 
agagcccgtc tgcaccccag acccagggcc tcaggcctcc 
ccatcccggc ccagcacccc cagcatcccc gtgtatggcc 
atccccgaag atccgtcccc ctggcccctc agtgtccatg 
catttggttt tttcctctgc aaaaaaaaaa aaaaaa 



age atg gcg acc ccg 
Met Ala Thr Pro 
1 

gac cca ttt tac tat 
Asp Pro Phe Tyr Tyr 
20 

ctg gca acc ate ttg 
Leu Ala Thr lie Leu 
35 

aag gtg aag tgc agg 
Lys Val Lys Cys Arg 
50 

aaa tec tgt aag tct 
Lys Ser Cys Lys Ser 
65 

gtg taacaccttc 

Val 

80 

cctgaggacc gggtggaggc 
gagccgcccc acccacccca 
tggcaatgac gatcccccaa 
agctcctggg ateegggagt 
cccctgcacc tccttgtctc 
tcttgagctt aataaatgtg 



55 



103 



151 



199 



247 



293 



353 
413 
473 
533 
593 
653 
689 



<210> 108 
<211> 561 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 99. .479 



<220> 

<221> polyA_site 
<222> 546. .561 



<220> 

<221> polyA_signal 
<222> 526. .531 



<400> 108 

tttcgaaggg aagggagece cctataaaac agectacagt ggacagtctg gteggcagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccacc atg agt ttc tec ggc aag 116 

Met Ser Phe Ser Gly Lys 
1 5 

tac caa ctg cag age cag gaa aac ttt gaa gee ttc atg aag gca ate 164 
Tyr Gin Leu Gin Ser Gin Glu Asn Phe Glu Ala Phe Met Lys Ala lie 

10 15 20 

ggt ctg ccg gaa gag etc ate cag aag ggg aag gat ate aag ggg gtg 212 
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Gly 


Leu 


Pro 


Glu 


Glu 


Leu 


He 


Gin 


Lys 


Gly 


Lys 


Asp 


He 


Lys 


Gly 


Val 








25 










30 










35 










teg 


gaa 


ate 


gtg 


cag 


aat 


ggg 


aag 


cac 


ttc 


aag 


ttc 


acc 


ate 


acc 


get 


260 


Ser 


Glu 


He 


Val 


Gin 


Asn 


Gly 


Lys 


His 


Phe 


Lys 


Phe 


Thr 


He 


Thr 


Ala 






40 










45 










50 












ggg 


tec 


aaa 


gtg 


ate 


caa 


aac 


gaa 


ttc 


acg 


gtg 


ggg 


gag 


gaa 


tgt 


gag 


308 


Gly 


Ser 


Lys 


Val 


He 


Gin 


Asn 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 




55 










60 










65 










70 




ctg 


gag 


aca 


atg 


aca 


ggg 


gag 


aaa 


gtc 


aag 


aca 


gtg 


gtt 


cag 


ttg 


gaa 


356 


Leu 


Glu 


Thr 


Met 


Thr 


Gly 


Glu 


Lys 


Val 


Lys 


Thr 


Val 


Val 


Gin 


Leu 


Glu 












75 










80 










85 






ggt 


gac 


aat 


aaa 


ctg 


gtg 


aca 


act 


ttc 


aaa 


aac 


ate 


aag 


tct 


gtg 


acc 


404 


Gly 


Asp 


Asn 


Lys 


Leu 


Val 


Thr 


Thr 


Phe 


Lys 


Asn 


He 


Lys 


Ser 


Val 


Thr 










90 










95 










100 








gaa 


etc 


aac 


ggc 


gac 


ata 


ate 


ace 


aat 


acc 


atg 


aca 


ttg 


ggt 


gac 


att 


452 


Glu 


Leu 


Asn 


Gly 


Asp 


He 


He 


Thr 


Asn 


Thr 


Met 


Thr 


Leu 


Gly 


Asp 


He 








105 










110 










115 










gtc 


ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaacaagtc tgcatttcat 




499 


Val 


Phe 


Lys 


Arg 


He 


Ser 


Lys 


Arg 


He 



















120 125 
attattttag tgtgtaaaat taatgtaata aagtgaactt tgttttaaaa aaaaaaaaaa 559 
aa 561 



<210> 109 
<211> 738 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1. .261 



<220> 

<221> polyA_site 
<222> 724 . .738 



<220> 

<221> polyA_signal 
<222> 712. .717 



<400> 109 






























gac 


gca 


gta 


gee 


gtg 


ggg 


aag 


ttt 


teg 


caa 


tgg 


ccg 


ccg 


gaa 


egg 


teg 


48 


Asp 


Ala 


Val 


Ala 


Val 


Gly 


Lys 


Phe 


Ser 


Gin 


Trp 


Pro 


Pro 


Glu 


Arg 


Ser 




1 








5 










10 










15 






ccg 


gee 


gat 


get 


etc 


tgc 


gac 


gga 


aag 


aaa 


gee 


gee 


gga 


ccc 


act 


tec 


96 


Pro 


Ala 


Asp 


Ala 


Leu 


Cys 


Asp 


Gly 


Lys 


Lys 


Ala 


Ala 


Gly 


Pro 


Thr 


Ser 










20 










25 










30 








ggt 


gcg 


ttt 


tac 


aat 


gaa 


etc 


egg 


gca 


etc 


gag 


teg 


etc 


ccc 


gga 


aac 


144 


Gly Ala 


Phe 


Tyr 


Asn 


Glu 


Leu 


Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Pro 


Gly 


Asn 








35 










40 










45 










tat 


agt 


tct 


get 


tec 


acc 


gtt 


gtc 


eta 


ctt 


tgg 


ata 


gca 


tea 


aga 


caa 


192 


Tyr 


Ser 


Ser 


Ala 


Ser 


Thr 


Val 


Val 


Leu 


Leu 


Trp 


He 


Ala 


Ser 


Arg 


Gin 






50 










55 










60 












aaa 


tct 


cag 


tct 


teg 


agg 


aac 


tea 


cag 


tgt 


age 


aga 


aaa 


gac 


aaa 


gga 


240 


Lys 


Ser 


Gin 


Ser 


Ser 


Arg 


Asn 


Ser 


Gin 


Cys 


Ser 


Arg 


Lys 


Asp 


Lys 


Gly 




65 










70 










75 










80 




ggt 


aaa 


cag 


gag 


gag 


cat 


gaa 


taattcatct tgatgtaatc atgeaagact 


291 


Gly 


Lys 


Gin 


Glu 


Glu 


His 


Glu 























85 



cctaagaaga tgttaggaga ccctgtcatc attcttttgg cctctagtgt cctttatctt 351 

catgtttgct ggcttccttt tactactatg gaaatacaac atatgeaaaa gtagtgtttt 411 

cgctgtttaa aaacatcgtc agcagcatca tcatcattaa ctgcaccagg aagecatgae 471 

tcctttctgg acaaggtgea cttgcctgtc atccttccat ctctttcttc ctccttgaag 531 



WO 02/083898 



PCT/IB01/00914 



88 



ttcacaatac ccagtccagg agcattgcag agaatgctag gtgtgtctgc agacttgact 591 

ttaaaagaaa acaagttcct tcaagtgagt tcatcatgat gggaatgata atttgtttga 651 

gagaatgtgt gttctttggc aaaggaactc ctgttatacc atgacagtaa taatagcagt 711 

aataaaatca ccaaaaaaaa aaaaaaa 738 



<210> 110 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88 . .375 



<220> 

<221> polyA_site 
<222> 567. .584 



<220> 

<221> polyA_signal 
<222> 546. .551 



<400> 110 

gagtcagtct cgccagctgc cggtctttcg ggggctccgt aactttctat ccgtccgcgt 60 
cagcgcyttg ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act 114 

Met Pro Gly Pro Thr Pro Ser Gly Thr 

1 5 



aac 


gtg 


gga 


tec 


tea 


ggg 


cgc 


tct 


ccc 


age 


aaa 


gca 


gtg 


gec 


gec 


egg 


162 


Asn 


Val 


Gly 


Ser 


Ser 


Gly 


Arg 


Ser 


Pro 


Ser 


Lys 


Ala 


Val 


Ala 


Ala 


Arg 




10 










15 










20 










25 




gcg 


gcg 


gga 


tec 


act 


gtc 


egg 


cag 


agg 


aaa 


aat 


gee 


age 


tgt 


ggg 


aca 


210 


Ala 


Ala 


Gly 


Ser 


Thr 


Val 


Arg 


Gin 


Arg 


Lys 


Asn 


Ala 


Ser 


Cys 


Gly 


Thr 












30 










35 










40 






agg 


agt 


gca 


ggc 


cgc 


aca 


acc 


teg 


gca 


ggc 


acc 


ggg 


ggg 


atg 


tgg 


cga 


258 


Arg 


Ser 


Ala 


Gly 


Arg 


Thr 


Thr 


Ser 


Ala 


Gly 


Thr 


Gly 


Gly 


Met 


Trp 


Arg 










45 










50 










55 








ttc 


tac 


aca 


gaa 


gat 


tea 


cct 


ggg 


etc 


aaa 


gtt 


ggc 


cct 


gtt 


cca 


gta 


306 


Phe 


Tyr 


Thr 


Glu 


Asp 


Ser 


Pro 


Gly 


Leu 


Lys 


Val 


Gly 


Pro 


Val 


Pro 


Val 








60 










65 










70 










ttg 


gtt 


atg 


agt 


ctt 


ctg 


ttc 


ate 


get 


tct 


gta 


ttt 


atg 


ttg 


cac 


att 


354 


Leu 


Val 


Met 


Ser 


Leu 


Leu 


Phe 


lie 


Ala 


Ser 


Val 


Phe 


Met 


Leu 


His 


lie 






75 










80 










85 












tgg 


ggc 


aag 


tac 


act 


cgt 


teg 


tagattcagt tacatccatc tgtcatctga 


405 


Trp 


Gly 


Lys 


Tyr 


Thr 


Arg 


Ser 























90 95 

agaaggagga aaaaacccaa catttcttgg accaaaagta tagtgactat ctgttcatga 465 

gagaaatttt ctgtaagctt gctgttttac aggggattta tcaataattg attttgagga 525 

atcagttttt ttctatggct aataaacttt ttaattcact tataaaaaaa aaaaaaaaa 584 



<210> 111 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA__site 
<222> 289. . 588 



<220> 
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<221> polyA_signal 
<222> 267 . . 272 



<400> 111 
tttattttgc 



gga aat act 
Gly Asn Thr 
15 

ttt cga aga 
Phe Arg Arg 
30 

aaa tat gaa 
Lys Tyr Glu 
45 

aaatatttcc 
tgctgttgac 
agaaaaggag 
tcagcaactg 
cttttaggac 
gtggaagagt 
aaattgttgt 



cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

20 25 
cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

35 40 
aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 
tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
gagectaaaa aaagtaccag ccctaaaaaa getgeggage cactgtggat 
ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 112 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. . 589 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 112 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
4 5 50 55 

aaatatttcc tcctctgatg ctcctcttca gccttttgta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggage ccactgtgga 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 
agtggaagag ttgaccttca tgtactgett tccaaaaaat tgatcctagt tgttattgaa 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
589 



<210> 113 



WO 02/083898 
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<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 113 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 ~ 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga ■ aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggase cactgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 493 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 



<210> 114 
<211> 590 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. .590 



<220> 

<221> polyA_signal 
<222> 267 . . 272 



<400> 114 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gec aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
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30 35 
aaa tat gaa aag aag aaa tac tac gaa aaa atg 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa 
tcttttagga cttgtttgga gaaagattga gaaggaaaag 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaat 



40 

cca tagctattac 
Pro 



PCT/IB01/00914 



193 



tcctctcctt 
aaaaaggaag 
gaaaagctgc 
gctgcggagc 
agtgaaaggc 
tgatcctagt 



ctctgcaagc 
agaaaaagag 
agaagaaaga 
ccactgtgga 
agttagttga 
tgttattgaa 



253 
313 
373 
433 
493 
553 
590 



<210> 115 
<211> 595 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 572. .595 



<220> 

<221> polyA_signal 
<222> 552. .557 



<400> 115 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaatgcg aa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
595 



<210> 116 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA__site 
<222> 289. . 588 



WO 02/083898 
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92 

<220> 

<221> polyA_signal 
<222> 267 . .272 



<400> 116 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 2 5 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggase cactgtggat 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 117 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. . 588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 117 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggage cactgtggat 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 
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<210> 118 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 118 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 ^ 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 ^ ~ 50 ~ 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggage cactgtggat 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 119 
<211> 595 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 571. .595 



<220> 

<221> polyA_signal 
<222> 551. .556 



<400> 119 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
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Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu 

30 35 
aaa tat gaa aag aag aaa tac tac gaa aaa atg 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa 
cttttaggac ttgtttggag aaagattgag aaggaaaaga 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaatgcga 



Phe Phe lie Arg Asp 
4 0 

cca tagctattac 
Pro 



tcctctcctt 
aaaaaggaag 
gaaaagctgc 
gctgcggagc 
gtgaaaggca 
gatcctagtt 
aa 



ctctgcaagc 
agaaaaagag 
agaagaaaga 
cactgtggat 
gttagttgaa 
gttattgaat 



193 



253 
313 
373 
433 
493 
553 
595 



<210> 120 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 572. . 601 



<220> 

<221> polyA_signal 
<222> 552. . 557 



<400> 120 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gec aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 " 20 ' 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 
agtggaagag ttgaccttca tgtactgett tccaaaaaat tgatcctagt tgttattgaa 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaatgcg aaaagctt 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
601 



<210> 121 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA__site 
<222> 289. . 589 
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<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 121 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 " 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 4 0 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggage ccactgtgga 
tctttttgga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 
agtggaagag ttgaccttca tgtactgett tccaaaaaat tgatcctagt tgttattgaa 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
589 



<210> 122 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 571. .595 



<220> 

<221> polyA_signal 
<222> 551. .556 



<400> 122 
tttattttgc 



gga aat act 
Gly Asn Thr 
15 

ttt cga aga 
Phe Arg Arg 
30 

aaa tat gaa 
Lys Tyr Glu 
45 

aaatatttcc 
tgctgttgac 
agaaaaggag 
tcagcaactg 
cttttaggac 
gtggaagagt 
aaattgttgt 



cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 
Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

20 25 
cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

35 40 
aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
50 ~ 55 
tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
gagectaaaa aaagtaccag ccctaaaaaa getgeggase cactgtggat 
ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
ctgttttaag aaaaaaaaaa aaaaatgega aa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
595 
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<210> 123 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289. . 588 



<220> 

<221> polyA__signal 
<222> 267. .272 



<400> 123 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 ~ 50 " 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagectaaaa aaagtaccag ccctaaaaaa getgeggage cactgtggat 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 124 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 124 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 ~ 20 25 
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ttt cga aga cca cag aca gat caa gca gtg gaa 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu 

30 35 
aaa tat gaa aag aag aaa tac tac gaa aaa atg 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa 
cttttaggac ttgtttggag aaagattgag aaggaaaaga 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 



ttt ttc ate aga gat 
Phe Phe lie Arg Asp 
40 

cca tagctattac 
Pro 



tcctctcctt 
aaaaaggaag 
gaaaagctgc 
getgeggage 
gtgaaaggca 
gatcctagtt 



ctctgcaagc 
agaaaaagag 
agaagaaaga 
cactgtggat 
gttagttgaa 
gttattgaat 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 125 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289. .589 



<220> 

<221> polyA_signal 
<222> 267 . .272 



<400> 125 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 ~ 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 ~ 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa getgeggage ccactgtgga 
tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 
agtggaagag ttgaccttca tgtactgett tccaaaaaat tgatcctagt tgttattgaa 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
589 



<210> 126 
<211> 663 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 86. .451 



<220> 

<221> polyA_site 
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<222> 622. . 663 
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<220> 

<221> polyA_signal 
<222> 597 . . 602 



<400> 126 

acacatccaa gcttaagacg gtgaggtcag cttcacattc tcaggaactc tccttctttg 
ggccacggaa ttaacccgag caggc atg gag gcc tct get etc ace tea tea 

Met Glu Ala Ser Ala Leu Thr Ser Ser 
1 5 
gca gtg ace agt gtg gcc aaa gtg gtc agg gtg gcc tct ggc tct gcc 
Ala Val Thr Ser Val Ala Lys Val Val Arg Val Ala Ser Gly Ser Ala 
10 15 20 25 

gta gtt ttg ccc ctg gcc agg att get aca gtt gtg att gga gga gtt 
Val Val Leu Pro Leu Ala Arg He Ala Thr Val Val He Gly Gly Val 

30 35 40 

gtg gcc atg gcg get gtg ccc atg gtg etc agt gcc atg ggc ttc act 
Val Ala Met Ala Ala Val Pro Met Val Leu Ser Ala Met Gly Phe Thr 

45 50 55 

gcg gcg gga ate gcc teg tec tec ata gca gcc aag atg atg tec gcg 
Ala Ala Gly He Ala Ser Ser Ser He Ala Ala Lys Met Met Ser Ala 

60 65 70 

gcg gcc att gcc aat ggg ggt gga gtt gcc teg ggc age ctt gtg get 
Ala Ala He Ala Asn Gly Gly Gly Val Ala Ser Gly Ser Leu Val Ala 

75 80 85 

act ctg cag tea ctg gga gca act gga etc tec gga ttg ace aag ttc 
Thr Leu Gin Ser Leu Gly Ala Thr Gly Leu Ser Gly Leu Thr Lys Phe 
90 95 100 105 

ate ctg ggc tec att ggg tct gmc att gcg get gtc att gcg agg ttc 
He Leu Gly Ser He Gly Ser Xaa He Ala Ala Val He Ala Arg Phe 

110 115 120 

tac tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa 
Tyr 

ggcacccagc catcctgacc cagegaggag ccaactatcc caaatatacc tggggtgaaa 
tataccaaat tctgcatctc cagaggaaaa taagaaataa agatgaattg ttgeaactet 
aaaaaaaaaa aaaaaaaaaa aaaaacaaaa aaaaaaaaaa aa 



60 
112 



160 



208 



256 



304 



352 



400 



448 



501 

561 
621 
663 



<210> 127 
<211> 633 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .453 

<220> 

<221> polyA_site 
<222> 618 . . 633 

<220> 

<221> polyA_signal 
<222> 599. . 604 



<400> 127 

aaacacatcc aagcttaaga cggtgaggtc agcttcacat tctcaggaac tctccttctt 
tgggecaegg aattaacccg agcaggc atg gag gcc tct get etc acc tea tea 

Met Glu Ala Ser Ala Leu Thr Ser Ser 

1 5 
gca gtg acc agt gtg gcc aaa gtg gtc agg gtg gcc tct ggc tct gcc 
Ala Val Thr Ser Val Ala Lys Val Val Arg Val Ala Ser Gly Ser Ala 
10 15 20 25 



60 
114 



162 
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aca gtt gtg att gga gga gtt 
Thr Val Val lie Gly Gly Val 
35 40 



gta gtt ttg ccc ctg gem agg att get 
Val Val Leu Pro Leu Ala Arg lie Ala 
30 

gtg gec atg gcg get gtg ccc atg gtg etc agt gec atg ggc ttc act 
Val Ala Met Ala Ala Val Pro Met Val Leu Ser Ala Met Gly Phe Thr 

45 50 55 

gcg gcg gga ate gec teg tec tec ata gca gee aag atg atg tec gcg 
Ala Ala Gly lie Ala Ser Ser Ser lie Ala Ala Lys Met Met Ser Ala 

60 65 70 

gcg gee att gee aat sgg ggt gga gtt gee teg ggc age ctt gtg get 
Ala Ala lie Ala Asn Xaa Gly Gly Val Ala Ser Gly Ser Leu Val Ala 

75 80 85 

act ctg cag tea ctg gga gca act gga etc tec gga ttg ace aag ttc 
Thr Leu Gin Ser Leu Gly Ala Thr Gly Leu Ser Gly Leu Thr Lys Phe 
90 95 100 105 

ate ctg ggc tec att ggg tct gec att gcg get gtc att gcg agg ttc 
lie Leu Gly Ser lie Gly Ser Ala lie Ala Ala Val lie Ala Arg Phe 

110 115 120 

tac tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa 
Tyr 

ggcacccagc catcctgacc cagegaggag ccaactatcc caaatatacc tggggtgaaa 
tataccaaat tctgcatctc cagaggaaaa taagaaataa agatgaattg ttacaaaaaa 
aaaaaaaaaa 



210 



258 



306 



354 



402 



450 



503 

563 
623 
633 



<210> 128 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 15. .236 

<220> 

<221> polyA_site 
<222> 573. .588 

<220> 

<221> polyA_signal 
<222> 551. .556 



<400> 128 
gaaaattttc 



teg 
Ser 

egg 
Arg 

ttg 

Leu 

45 

aac 

Asn 



get aat 
Ala Asn 
15 

ggc gtc 
Gly Val 
30 

ata eta 
lie Leu 

ttg agt 
Leu Ser 



tagecaagta 
atctgtgacc 
tttcttcccc 
tctggtcagc 
agttcccaag 
tttaaaaaac 



cact atg get tec age act gtc ccg gtg age get get ggc 
Met Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly 
15 10 
gaa act ccc gaa ata ccg gac aac gtg gga gat tgg ctt 
Glu Thr Pro Glu lie Pro Asp Asn Val Gly Asp Trp Leu 

20 25 
tac cgc ttt gee act gat agg aat gac ttc egg agg aac 
Tyr Arg Phe Ala Thr Asp Arg Asn Asp Phe Arg Arg Asn 

35 4 0 

aat ttg gga etc ttt get gcg gga gtt tgg ctg gee agg 
Asn Leu Gly Leu Phe Ala Ala Gly Val Trp Leu Ala Arg 

50 55 60 

gac att gac etc atg gca cct cag cca ggg gtg 
Asp lie Asp Leu Met Ala Pro Gin Pro Gly Val 
65 70 
gacaaatgga atcctgtgct gaacccgaat cttccaaaaa acagcctaca 
accacaagat gtgecctgat ggcagctgaa gtttgattca gatgggcact 
ttccctgcct agtttccttt tgttccttga gtccacgcag aattccattc 
agacaggctt aagctaaagt attgectcta ttctgtaaag ttctgtacat 
ettctgeagg gggtgatttt tgctcttgtc ctgagaaata acagtgctgt 
ataaaataaa taccgcacac aaagacaaaa aaaaaaaaaa aa 



50 



98 



146 



194 



236 



296 
356 
416 
476 
536 
588 
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<210> 129 
<211> 608 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .260 



<220> 

<221> polyA_site 
<222> 589. . 608 



<220> 

<221> polyA_signal 
<222> 577. .582 



<400> 129 

acggttcctc caccgaggcc catgcgaagc tttccact atg get tec age act gtc 

Met Ala Ser Ser Thr Val 



56 



ccg gtg age get get ggc teg get aat gaa act ccc gaa ata ccg gac 
Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr Pro Glu lie Pro Asp 

10 15 20 

aac gtg gga gat tgg ctt egg ggc gtc tac cgc ttt gee act gat agg 
Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Arg Phe Ala Thr Asp Arg 

25 30 35 

aat gac ttc egg agg aac ttg ata eta aat ttg gga etc ttt get gcg 
Asn Asp Phe Arg Arg Asn Leu lie Leu Asn Leu Gly Leu Phe Ala Ala 

40 45 50 

gga gtt tgg ctg gee agg aac ttg agt gac att gac etc atg gca cct 
Gly Val Trp Leu Ala Arg Asn Leu Ser Asp lie Asp Leu Met Ala Pro 
55 " 60 " 65 70 

cag cca ggg gtg tagecaagta gacaaatgga atcctgtgct gaacccgaat 
Gin Pro Gly Val 

cttccaaaaa acagcctaca atctgtgacc accacaagat gtgecctgat ggcagctgaa 
gtttgattca gatgggcact tttcttcccc ttccctgcct agtttccttt tgttccttga 
gtccacgcag aattccattc tctggtcagc agacaggctt aagctaaagt attgectcta 
ttctgtaaag ttctgtacat agttyccaag ettctgeagg gggtgattkt tgctcttgtc 
ctgagaaata acagtgctgt tttaaaaaac atttgaaata aataccgcac acaaaaaaaa 
aaaaaaaa 



104 



152 



200 



248 



300 

360 
420 
480 
540 
600 
608 



<210> 130 

<211> 1200 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. . 469 



<220> 

<221> polyA_site 
<222> 1185. .1200 



<400> 130 

acttggegag tgagacgctg atgggagg atg gac gta ctg gtg tct gag tgc 52 

Met Asp Val Leu Val Ser Glu Cys 
1 5 

tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu lie Lys Ser Leu Thr Ala 
10 15 20 
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gaa 


att 


gac 


egg 


ttg 


aaa 


aac 


tgt 


ggc 


101 

tgt tta 


gga 


get 


tct 


cca 


aat 


148 


Glu 


He 


Asp 


Arg 


Leu 


Lys 


Asn 


Cys 


Gly 


Cys 


Leu 


Gly 


Ala 


Ser 


Pro 


Asn 




25 










30 










35 










40 




ttg 


gag 


cag 


tta 


caa 


gaa 


gaa 


aat 


tta 


aaa 


tta 


aag 


tat 


cga 


ctg 


aat 


196 


Leu 


Glu 


Gin 


Leu 


Gin 
45 


Glu 


Glu 


Asn 


Leu 


Lys 
50 


Leu 


Lys 


Tyr 


Arg 


Leu 
55 


Asn 




att 


ctt 


cga 


aag 


agt 


ctt 


cag 


gca 


gaa 


agg 


aac 


aaa 


cca 


act 


aaa 


aat 


244 


He 


Leu 


Arg 


Lys 
60 


Ser 


Leu 


Gin 


Ala 


Glu 
65 


Arg 


Asn 


Lys 


Pro 


Thr 
70 


Lys 


Asn 




atg 


att 


aac 


att 


att 


age 


cgc 


eta 


caa 


gag 


gtc 


ttt 


ggt 


cat 


gca 


att 


292 


Met 


He 


Asn 
75 


He 


He 


Ser 


Arg 


Leu 
80 


Gin 


Glu 


Val 


Phe 


Gly 
85 


His 


Ala 


He 




aag 


get 


gca 


tat 


cca 


gat 


ttg 


gaa 


aat 


cct 


cct 


ctg 


eta 


gtg 


aca 


cca 


340 


Lys 


Ala 
90 


Ala 


Tyr 


Pro 


Asp 


Leu 
95 


Glu 


Asn 


Pro 


Pro 


Leu 
100 


Leu 


Val 


Thr 


Pro 




agt 


cag 


cag 


gec 


aag 


ttt 


ggg 


gac 


tat 


cag 


tgt 


aat 


agt 


get 


atg 


ggt 


388 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Asp 


Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 




105 










110 










115 










120 




att 


tct 


cag 


gtg 


atg 


tat 


tgt 


cat 


gac 


tct 


tgg 


ctg 


ttt 


gat 


ttt 


ttt 


436 


He 


Ser 


Gin 


Val 


Met 


Tyr 


Cys 


His 


Asp 


Ser 


Trp 


Leu 


Phe 


Asp 


Phe 


Phe 





125 



130 



aag tat tat tat cat cat tgc cat tta cag aaa 
Lys Tyr Tyr Tyr His His Cys His Leu Gin Lys 

140 145 
tccttagtga aaaggacatt tgccacagtt tgaaaaactt 
atatgtttta acttttttag gcacaatttt taaggtttgg 
caatatttaa gggcaatcat tggtactctt ttgtttaggt 
aggattgetg tgtggtggtg atgagtgctg ggaggtgaaa 
cagtcagcat cccaattaaa tatttgatgt aactgtgatc 
tcattttcaa gcagaggagt tccccatttt aaatagaggc 
ttaactgeat ctggcttggg tctttctgtt ttcctttctt 
ctctgaagag tccaggtctt acagtgtggt ttatttctca 
tcatggcttg aaaattagaa atgtaaatat gctggaacca 
tgttatgtca tcattaaaaa gttcgaggtg ttgtaatccc 
caggagaatt gcttgaaccc aggggeggag gttacagtga 
ctccagcctg ggcaacagag cgagactcca tctctaaaaa 



taatactatt 



gagaaaggag 
ttaaatttta 
atttccctcc 
aattaaaata 
tttgagccag 
attgtctgat 
tgctgaattg 
agtgtgaata 
gacttcataa 
agctactagg 
gecgagatea 
aaaaaaaaaa 



135 
acaagttgta 



ttgggggggt 
tatgtattct 
tgctgtgtcc 
agecatttae 
gcttatatat 
gtgtttatgg 
gaaggggtta 
ttgcacacct 
ctgtagactt 
gaggctgagg 
caccactgca 
a 



489 



549 
609 
669 
729 
789 
849 
909 
969 
1029 
1089 
1149 
1200 



<210> 131 
<211> 646 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .340 



<220> 

<221> polyA_site 
<222> 631. . 646 



<220> 

<221> polyA_signal 
<222> 614 . . 619 



<400> 131 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca ttgatcaag atg tct tea 58 

Met Ser Ser 
1 

gca cct gag cct cca aca ttc aaa aag gaa cca ccc aaa gaa aaa gag 106 
Ala Pro Glu Pro Pro Thr Phe Lys Lys Glu Pro Pro Lys Glu Lys Glu 

5 10 15 

ttt caa age cca ggg etc aga ggg gtg cgc aca aca ace tta ttt cgt 154 
Phe Gin Ser Pro Gly Leu Arg Gly Val Arg Thr Thr Thr Leu Phe Arg 
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20 










25 










30 










35 




get 


gtg 


aat 


cca 


gag 


etc 


ttc 


att 


aaa 


cct 


aac 


aaa 


cct 


gta 


atg 


get 


202 


Ala 


Val 


Asn 


Pro 


Glu 
40 


Leu 


Phe 


He 


Lys 


Pro 
45 


Asn 


Lys 


Pro 


Val 


Met 
50 


Ala 




ttc 


gga 


ttg 


gta 


act 


ctt 


tea 


ctt 


tgc 


gtg 


gca 


tat 


att 


ggt 


tat 


eta 


250 


Phe 


Gly 


Leu 


Val 
55 


Thr 


Leu 


Ser 


Leu 


Cys 
60 


Val 


Ala 


Tyr 


He 


Gly 
65 


Tyr 


Leu 




cat 


gca 


ata 


caa 


gag 


aat 


aaa 


aag 


gac 


etc 


tat 


gaa 


get 


att 


gat 


agt 


298 


His 


Ala 


He 
70 


Gin 


Glu 


Asn 


Lys 


Lys 
75 


Asp 


Leu 


Tyr 


Glu 


Ala 
80 


He 


Asp 


Ser 




gag 


ggg 


cac 


agt 


tat 


atg 


agg 


egg 


aaa 


aca 


tec 


aaa 


tgg 


gat 






340 


Glu 


Gly 
85 


His 


Ser 


Tyr 


Met 


Arg 
90 


Arg 


Lys 


Thr 


Ser 


Lys 
95 


Trp 


Asp 









tagtagtget ggttagtgca gatggacctt tattaaaggt tctgaaatct tcaaatgaaa 400 

gaccttgtga gtgtacagta tcatgtttct tgttctagaa catgetaatg aagagagaag 460 

atagcagttg caaccagaca actgtcgtaa attttgtcct ttcacagctg cagecattat 520 

ctcattcttt ttccacagag tgagegtcat aatatttttc tttccttacc tcttataaaa 580 

gtgecatgag aatggaagtt gtttttgtta attattaaat tctgtataat aaaaaaaaaa 640 

aaaaat 646 



<210> 132 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177 . . 470 



<220> 

<221> polyA__site 
<222> 561. .575 



<220> 

<221> polyA_signal 
<222> 543. .548 



<400> 132 

atcgaccacg gaaatttgac acctccgggc ttggaagcag ctctctcctc cttccccgct 60 
gcttataaac ctcagccctg aggctccagc tcactctacc ccatctcctt geegggtcag 120 
ccctgacaaa ggtcagctag ccccttgagg acatcagctt tggectcagg gtccta atg 179 

Met 



gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag 


gag 


tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 
5 


Leu 


Thr 


Glu 


Leu 


Glu 
10 


Glu 


Ser 


He 


Glu 


Asn 
15 


Val 


Val 




acc 


acc 


ttc 


ttc 


acc 


ttt 


gca 


agg 


cag 


gag 


ggc 


egg 


aak 


gat 


age 


etc 


275 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe 


Ala 


Arg 


Gin 


Glu 


Gly Arg 


Xaa 


Asp 


Ser 


Leu 








20 










25 










30 










age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt 


acc 


cag 


cag 


ttg 


ccc 


cat 


ctg 


323 


Ser 


Val 
35 


Asn 


Glu 


Phe 


Lys 


Glu 
40 


Leu 


Val 


Thr 


Gin 


Gin 
45 


Leu 


Pro 


His 


Leu 




etc 


aag 


gat 


gtg 


ggc 


tct 


ctt 


gat 


gag 


aag 


atg 


aag 


age 


ttg 


gat 


gtg 


371 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu 


Asp 


Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


Val 




50 










55 










60 










65 




aat 


cag 


gac 


teg 


gag 


etc 


aag 


ttc 


aat 


gag 


tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 
70 


Leu 


Lys 


Phe 


Asn 


Glu 
75 


Tyr 


Trp 


Arg 


Leu 


He 
80 


Gly 




gag 


ctg 


gee 


aag 


gaa 


ate 


agg 


aag 


aag 


aaa 


gac 


ctk 


aag 


ate 


agg 


aag 


467 


Glu 


Leu 


Ala 


Lys 
85 


Glu 


He 


Arg 


Lys 


Lys 
90 


Lys 


Asp 


Leu 


Lys 


He 
95 


Arg 


Lys 





aag taaagccgcc tggctgagat ggggtgggca gggcagagct gatcagggee 520 
Lys 
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gagcagaacc gcactcttcc caaataaagc ttcctccttg aaaaaaaaaa aaaaa 575 

<210> 133 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75 . .419 

<400> 133 

ctttcctttc tcgttccccg gccatcttag cggctgctgt tggttggggg ccgtcccgct 60 
cctaaggcag gaag atg gtg gcc gca aag aag acg aaa aag teg ctg gag 110 
Met Val Ala Ala Lys Lys Thr Lys Lys Ser Leu Glu 
1 5 10 

teg ate aaa tct agg etc caa etc gtt atg aaa agt ggg aag tac gtc 158 
Ser lie Lys Ser Arg Leu Gin Leu Val Met Lys Ser Gly Lys Tyr Val 

15 20 25 

ctg ggg tac aag cag act ctg aag atg ate aga caa ggc aaa gcg aaa 206 
Leu Gly Tyr Lys Gin Thr Leu Lys Met lie Arg Gin Gly Lys Ala Lys 

30 35 4 0 

ttg gtc att etc get aac aac tgc cca get ttg agg aaa tct gaa ata 254 
Leu Val lie Leu Ala Asn Asn Cys Pro Ala Leu Arg Lys Ser Glu lie 
4 5 50 55 60 

gag tac tat get atg ttg get aaa act ggt gtc cat cac tac agt ggc 302 
Glu Tyr Tyr Ala Met Leu Ala Lys Thr Gly Val His His Tyr Ser Gly 

65 70 75 

aat aat att gaa ctg ggc aca gca tgc gga aaa tac tac aga gtg tgc 350 
Asn Asn lie Glu Leu Gly Thr Ala Cys Gly Lys Tyr Tyr Arg Val Cys 

80 85 90 

aca ctg get ate att gat cca rgt gac tct kac ate att aga age atg 398 
Thr Leu Ala lie lie Asp Pro Xaa Asp Ser Xaa lie lie Arg Ser Met 

95 100 105 

cca gaa cag act ggt gaa aag taaacctttt cacctacaaa atttcacctg 449 
Pro Glu Gin Thr Gly Glu Lys 
110 115 
caaacctta 458 



<210> 134 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 144 . .449 



<220> 

<221> polyA_site 
<222> 573. .588 



<220> 

<221> polyA_signal 
<222> 552. .557 



<400> 134 

gaggggegag ggggctagcg gcgacggctg gggegagege gcctgcgcgc tgggtgattt 60 
tttcacgtgt cgccagggcc ggactgegag tetctttgeg gcgctacact agagcagagt 120 
acgagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ctt 173 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 
15 10 
cca etc ttt gac cga gta ttg gtt gaa agg agt cct get gaa act gta 221 
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Pro Leu Phe Asp Arg Val Leu Val Glu Arg Ser 

15 20 
acc aaa gga ggc att atg ctt cca gaa aaa tct 
Thr Lys Gly Gly lie Met Leu Pro Glu Lys Ser 

30 35 
caa gca aca gta gtc get gtt gga teg ggt tct 
Gin Ala Thr Val Val Ala Val Gly Ser Gly Ser 

45 50 
gag att caa cca gtt age gtg aaa gtt gga gat 
Glu He Gin Pro Val Ser Val Lys Val Gly Asp 

60 65 
gaa tat gga ggc acc aaa gta gtt yta gat aac 
Glu Tyr Gly Gly Thr Lys Val Val Leu Asp Asn 
75 80 85 

ttt aga gat ggt gac att ctt gga aag tac gta 
Phe Arg Asp Gly Asp He Leu Gly Lys Tyr Val 

95 ~ 100 

cactattgaa atggcatcaa catgatgetg cccattccac 
tcatgtaaat aatttccata tttctctttt ataataaact 
aaaaaaaaa 



Pro 

caa 
Gin 

aaa 
Lys 

aaa 

Lys 

70 

aag 

Lys 

gac 
Asp 



Ala 


Glu 


Thr 


Val 








25 






gga 


aaa 


gta 


ttg 


269 


Gly 


Lys 


Val 


Leu 






4 0 








gga 


aag 


ggt 


gga 


317 


Gly 


Lys 


Gly 


Gly 




55 










gtt 


ctt 


etc 


cca 


365 


Val 


Leu 


Leu 


Pro 




gat 


tat 


ttc 


eta 


413 


Asp 


Tyr 


Phe 


Leu 





90 



tgaaataagt 



tgaagttctg aaatctttcg 
aatgataact aacaaaaaaa 



459 



519 
579 
588 



<210> 135 
<211> 538 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .398 



<220> 

<221> polyA_site 
<222> 523. .538 



<220> 

<221> polyA_signal 
<222> 501. .506 



<400> 135 

gggtgatttt ttcacgtgtc gccagggccg gaetgegagt etctttgegg cgctacacta 60 
gagcagagta cgagtctgag geggagggag ta atg gca gga caa gcg ttt aga 113 
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tgaaataagt cactattgaa atggcatcaa catgatgetg cccattccac tgaagttctg 458 
aaatctttcg tcatgtaaat aatttccata tttctctttt ataataaact aatgataact 518 
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105 



PCT/IB01/00914 

538 



<210> 136 
<211> 912 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 97 . .444 



<220> 

<221> polyA_site 
<222> 897 . . 912 



<400> 136 

atctttccgt cccgggcagc cagcgccagt cggagccagc 
ctgccgctgc caagtcctcc acccgctgcc cccgcc atg 

Met 
1 

acg gcc ccc cct get gec ccg get ggg gag ggt 
Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu Gly 

10 15 
cct cca aac etc acc agt aac agg aga ctg cag 
Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu Gin 

25 30 
gtg gat gag gtg gtg gac ate atg agg gtg aac 
Val Asp Glu Val Val Asp lie Met Arg Val Asn 

40 45 
gag cga gac cag aag ctg teg gag ctg gac gac 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp 
55 60 65 

cag gcg ggg ccc tec cag ttt gaa aca age gca 
Gin Ala Gly Pro Ser Gin Phe Glu Thr Ser Ala 

75 80 
aaa tac tgg tgg aaa aac etc aag atg atg ate 
Lys Tyr Trp Trp Lys Asn Leu Lys Met Met lie 

90 ~~ 95 

tgc gcc ate ate etc ate ate ate ata gtt tac 
Cys Ala lie He Leu He He He He Val Tyr 

105 110 
taaatccccg aggagtctgc cctgcctaga gaagggcetc 
cctccacctc teagecatat ctttcagccc cccctcccct 
ccgtgtgtgt gtccccctgt aaatagccag ctgttattta 
tatttggtct gtttgtagtt ttattactag atgatttttc 
tcctgaggtt cccttcaccc ctctctcttg ccttccttcc 
agccccaagt cccttcattt geatctgeta tgcaatagtc 
ccctcagatt tagctgatcc ttcctcccac cctggccttc 
tctccccgtc atgctccctc tgccccgccc tcaaaaaaaa 



gcgagccgcc gccgccatca 
tct get acc get gcc 
Ser Ala Thr Ala Ala 
5 

ggt ccc cct gca ccc 
Gly Pro Pro Ala Pro 
20 

cag acc cag gcc cag 
Gin Thr Gin Ala Gin 
35 

gtg gac aag gtc ctg 
Val Asp Lys Val Leu 
50 

cgt gca gat gca etc 
Arg Ala Asp Ala Leu 
70 

gcc aag etc aag cgc 
Ala Lys Leu Lys Arg 
85 

ate ttg gga gtg att 
He Leu Gly Val lie 
100 

ttc age act 
Phe Ser Thr 
115 

tcccccaacc ctcagccgtt 
ggatccgtgt gtgtgtgtgt 
tacatatata atattatata 
cggttgtcct taacacccct 
ctttcccttt cttcctgact 
cctctccttt ccttcttctt 
ctttcctctt tcctcctcac 
aaaaaaaa 
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564 
624 
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744 
804 
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912 



<210> 137 
<211> 562 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 45 . . 335 



<220> 

<221> polyA_site 
<222> 545. .562 
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<220> 

<221> polyA_signal 
<222> 526. .531 



<400> 137 

tgttattttg tagctttaat gatttttgtt ttctaatagg 



ttg gtg ttt 
Leu Val Phe 
5 

tct agt tea 
Ser Ser Ser 

caa act ctt 
Gin Thr Leu 

cca aaa agt 
Pro Lys Ser 

55 

cat ttg ttt 
His Leu Phe 
70 

ttt aac tea 
Phe Asn Ser 
85 

ttatagaatg 
ttgectcaga 
gctccttttt 
aataaacctt 



aga get get tea tat ttt aaa eta 
Arg Ala Ala Ser Tyr Phe Lys Leu 

10 15 
aac cag ttt tta cag cct cct ggg 
Asn Gin Phe Leu Gin Pro Pro Gly 

25 30 
gtg ttg tta cat ttt gag agg ttt 
Val Leu Leu His Phe Glu Arg Phe 
40 45 
gca caa ggt aaa ggt aat tta cag 
Ala Gin Gly Lys Gly Asn Leu Gin 
60 

cat ttc ttg act ttc cct aag cag 
His Phe Leu Thr Phe Pro Lys Gin 
75 

tta ctt tgt ttg atg tgt ctt aca 
Leu Leu Cys Leu Met Cys Leu Thr 
90 95 
ttattcctct gggggaatct tttacaggtg 
ttcaaactgg catcaccata aaccctgtag 
atagttgaaa tacaattttt tattcaccat 
tttttcccta caaaaaaaaa aaaaaaa 



gcaa atg 
Met 
1 

gtt cca 
Val Pro 

tgg gtc 
Trp Val 

tea tac 
Ser Tyr 

cca gaa 
Pro Glu 
65 

ata age 
lie Ser 
80 

tat ttt 
Tyr Phe 



tct eta age 
Ser Leu Ser 



ttc 
Phe 

gtc 
Val 

cag 
Gin 
50 
aca 
Thr 



cac agt 
His Ser 

20 
ttg ace 
Leu Thr 
35 

aat gta 
Asn Val 

aat ata 
Asn lie 



aga aac ttg 
Arg Asn Leu 

tgactaaaag 



gaggaatggg gatagcagta 
gccarggtgg aatgaagtca 
tgtctgeaca aatccttgaa 
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465 
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562 



<210> 138 
<211> 566 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 32. .361 



<220> 

<221> polyA_site 
<222> 551. . 566 



<220> 

<221> polyA_signal 
<222> 532. . 537 



<400> 138 

aacttgetet gcccgctcaa acagttgcag g atg teg atg aca gac ttg ctg 52 

Met Ser Met Thr Asp Leu Leu 
1 5 
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60 65 70 

tec cca gat gec aga gac ctg tct get aaa gaa ace aag atg ctg atg 292 
Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys Met Leu Met 

75 80 85 

get get gga gac aaa gat ggg gac ggc aaa att ggg gtt gac gaa ttc 340 
Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys lie Gly Val Asp Glu Phe 

90 " " 95 ~ 100 

tec act ctg gtg get gaa age taagaagcac tgactgcccc tggtcttcca 391 
Ser Thr Leu Val Ala Glu Ser 
105 110 
cctctctgcc ctcaacaccc aatctcggcc cctcttgcca ccctcctgca tttctgttca 451 
gttcgtttat gttatttttt actcccccat cccctgtggc cctctaatga caccattctt 511 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaaa 566 



<210> 139 
<211> 567 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 31. .360 



<220> 

<221> polyA_site 
<222> 552. .567 



<220> 

<221> polyA__signal 
<222> 531. .536 



<400> 139 

acttgetctg cccgctcaaa cagttgeagg atg teg atg aca gac ttg ctg aac 54 

Met Ser Met Thr Asp Leu Leu Asn 
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Val 
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90 95 100 

act ctg gtg get gaa age taagaagcac tgactgcccc tggtcttcca 390 
Thr Leu Val Ala Glu Ser 
105 110 

cctctctgcc ctgaacaccc aatctcggcc cctctcgcca ccctcctgca tttctgttca 450 
gttcgtttat gttatttttt actcccccat cccctgtggc cctctaatga caccattctt 510 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca caaaaaaaaa aaaaaaa 567 



<210> 140 
<211> 566 
<212> DNA 
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<220> 
<221> CDS 
<222> 32 . .361 



<220> 

<221> polyA_site 
<222> 551. . 566 



<220> 

<221> polyA_signal 
<222> 532. .537 



<400> 140 

aacttgctct gcccgctcaa acagttgcag g atg teg atg aca gac ttg ctg 52 

Met Ser Met Thr Asp Leu Leu 
1 5 

aac get gag gac ate aag aag gcg gtg gga gee ttt age get ace gac 100 

Asn Ala Glu Asp lie Lys Lys Ala Val Gly Ala Phe Ser Ala Thr Asp 

10 15 20 

tec ttc gac cac aaa aag ttc ttc caa atg gtc ggc ctg aag aaa aag 148 

Ser Phe Asp His Lys Lys Phe Phe Gin Met Val Gly Leu Lys Lys Lys 

25 30 35 

agt gcg gat gat gtg aag aag gtg ttt cac atg ctg gac aag gac aaa 196 

Ser Ala Asp Asp Val Lys Lys Val Phe His Met Leu Asp Lys Asp Lys 
40 45 50 55 

agt ggc ttc ate gag gag gat gag ctg gga ttc ate eta aaa ggc ttc 244 

Ser Gly Phe lie Glu Glu Asp Glu Leu Gly Phe lie Leu Lys Gly Phe 

60 65 70 

aaa gaa ace aag atg ctg atg 292 

Lys Glu Thr Lys Met Leu Met 
85 

aaa att ggg gtt gac gaa ttc 340 

Lys lie Gly Val Asp Glu Phe 
100 

:ac tqactqcccc tqqtcttcca 391 



cctctctgcc ctcaacaccc aatctcggcc cctcttgcca ccctcctgca tttctgttca 451 
gttcgtttat gttatttttt actcccccat cccctgtggc cctctaatga caccattctt 511 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaaa 566 
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<210> 141 
<211> 625 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .420 



<220> 

<221> polyA_site 
<222> 610. . 625 



<400> 141 

cccttctgcg cggtcacgcc gagccagcgc ctgggcctgg aaccgggccg tagccccccc 60 
agtttcgccc accacctccc tacc atg gac ccc cgc aaa gtg aac gak ctt 111 

Met Asp Pro Arg Lys Val Asn Xaa Leu 

1 5 

egg gee ttt gtg aaa atg tgt aag cag gat ccg age gtt ctg cac acc 159 
Arg Ala Phe Val Lys Met Cys Lys Gin Asp Pro Ser Val Leu His Thr 
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105 
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tagecacagg < 


cttccacgga cttcacgtca 
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lie 


Thr 


Ala 


Gin 


His 


Phe 


Leu 























110 



ttattggctc aactttcctc actatctget tcatccgcca actaatattt cactttacat 510 
ccaaacatca ctttggcttc gaagccgccg cctgatactg gcattttgta gatgtggttt 570 
gactatttct gtatgtctcc atctattgat gagggtctta aaaaaaaaaa aaaat 625 

<210> 142 
<211> 685 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .375 

<220> 

<221> polyA_site 
<222> 670. . 685 

<220> 

<221> polyA_signal 
<222> 650. . 655 

<400> 142 
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Ser 


Gly 


Ser 


Val 
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Tyr 


He 


Thr 


Leu 


Lys 
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Tyr 


Asp 


Gly 


Arg 


Thr 
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Pro 
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Pro 


Lys 
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tgt 


ctg 


tta 


195 


Lys 


Gly 


Thr 


Val 


Glu 


Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 
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get 
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age 
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243 


Arg 


Ala 


Thr 


Asp 
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Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 
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Arg 
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ace 
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Met 
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Arg 


Asp 


Lys 


Lys 
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Glu 


Thr 
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Lys 


Thr 
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acatttcctg 
ctaggtttct 
gaatagttgc 
taattggtct 
catctctgtg 



110 
ctttcaccaa 
ggttccccca 
agtgtttata 
gtgctgcatg 
aagagcaagc 



ttaaccactg 
cagtaggtgt 
ggatagttgt 
gttatatact 
tatcattaaa 



110 

Thr Ala Ala 

115 
aattgctatt 
tttcacataa 
ggtaagaatc 
cctggattat 
catgtctgtt 



Gin 

ttttcctttt 
gattagggtc 
tagtttattt 
agattaaaag 
tatcaaaaaa 



ggccagatag 
cttttggaaa 
tgcatttggc 
tctctgtaga 
aaaaaaaaaa 



445 
505 
565 
625 
685 



<210> 143 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. .335 



<220> 

<221> polyA_site 
<222> 538 . . 553 



<220> 

<221> polyA_signal 
<222> 520. . 525 



<400> 143 

tcaagaat atg gat agt tta cct tct ggt aaa ata cat cga aaa gtg aaa 50 
Met Asp Ser Leu Pro Ser Gly Lys lie His Arg Lys Val Lys 

cca aat get gaa 98 
Pro Asn Ala Glu 
30 

aac aga att tgt 146 
Asn Arg lie Cys 
45 

act agt gaa gag 194 
Thr Ser Glu Glu 
60 

aag agt ggt ata 242 
Lys Ser Gly lie 
75 

gat aat ctg tta 290 
Asp Asn Leu Leu 

act ggc ttt 335 
Thr Gly Phe 

tagaatttaa aaaatgeata cttttcagaa gtgataagga tcatattctt gaaattttta 395 

taaatatgta tggaaattct taggattttt ttaccagctt tgtttacaga cccaaatgta 455 

aatattaaaa ataaatattt gcaattttct acagaattga atacctgtta aagaaaaatt 515 

acagaataaa cttgtgactg gtaaaaaaaa aaaaaaat 553 







1 








5 










10 


ata 


ata 


tta 


gga 


cga 


aat 


aga 


aaa 


gaa 


aat 


ctg 


gaa 


He 


He 


Leu 


Gly 


Arg 


Asn 


Arg 


Lys 


Glu 


Asn 


Leu 


Glu 


15 










20 










25 




ttt 


gat 


aaa 


aga 


act 


gaa 


ttt 


att 


aca 


caa 


gaa 


gaa 


Phe 


Asp 


Lys 


Arg 


Thr 


Glu 


Phe 


He 


Thr 


Gin 


Glu 


Glu 










35 










40 






agt 


tea 


ccg 


gta 


cag 


tct 


tta 


eta 


gac 


ttg 


ttt 


cag 


Ser 


Ser 


Pro 


Val 


Gin 


Ser 


Leu 


Leu 


Asp 


Leu 


Phe 


Gin 








50 










55 








aaa 


tea 


gaa 


ttt 


ttg 


ggt 


ttc 


aca 


age 


tac 


aca 


gaa 


Lys 


Ser 


Glu 


Phe 


Leu 


Gly 


Phe 


Thr 


Ser 


Tyr 


Thr 


Glu 






65 










70 










tgc 


aat 


gtt 


tta 


gat 


att 


tgg 


gaa 


gag 


gaa 


aat 


tea 


Cys 


Asn 


Val 


Leu 


Asp 


He 


Trp 


Glu 


Glu 


Glu 


Asn 


Ser 




80 










85 










90 


aca 


gcg 


ttt 


ttc 


teg 


tec 


cct 


tea 


act 


tct 


aca 


ttt 


Thr 


Ala 


Phe 


Phe 


Ser 


Ser 


Pro 


Ser 


Thr 


Ser 


Thr 


Phe 


95 










100 










105 





<210> 144 
<211> 631 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 156. .464 



<220> 

<221> polyA_site 
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<220> 

<221> polyA_signal 
<222> 559. .564 



<400> 144 

aattcatatc tgagtgggcg gtggcgattg gtgttggcgg tctggctcag ctgggcaggg 60 
ggtaacttta ctgatttggg ggtggttttt agtttaattt ttcttttcta gcttcccatc 120 
gacggtcagt gcgcacgttg taatcagctg aggcc atg tea gga gac gga gec 173 

Met Ser Gly Asp Gly Ala 
1 5 

acg gag cag gca get gag tat gtc cca gag aag gtg aag aaa gcg gaa 221 
Thr Glu Gin Ala Ala Glu Tyr Val Pro Glu Lys Val Lys Lys Ala Glu 

10 15 20 

aag aaa tta gaa gag aat cca tat gac ctt gat get tgg age att etc 269 
Lys Lys Leu Glu Glu Asn Pro Tyr Asp Leu Asp Ala Trp Ser lie Leu 

25 30 35 

att cga gag gca cag aat caa cct ata gac aaa gca egg aag act tat 317 
lie Arg Glu Ala Gin Asn Gin Pro lie 

40 45 
gaa cgc ctt gtt gee cag ttc ccc agt tct ggc aga ttc tgg aaa ctg 365 
Glu Arg Leu Val Ala Gin Phe Pro Ser 
55 60 

tac att gaa gca gag gtt act att tta ttt tat ttt ttc tta tat cag 413 
Tyr lie Glu Ala Glu Val Thr lie Leu 
75 

tat tgc age att cac tgt agt gat aga aaa caa gtt asg aac ata gee 461 
Tyr Cys Ser lie His Cys Ser Asp A 

90 95 100 

mat taggacaagg aggatttaaa tgtgtcttac ctttattttg taaaataggt 514 
Xaa 

ataaaggagt aattaaaatg aatttttgaa atttgggtct tttacaagct gatgattgtt 574 
gcattttgga gttgcaacaa cattaaaaca gtttcaatga taaaaaaaaa aaaaaaa 631 



gac 


aaa 


gca 


egg 


aag 


act 


tat 


Asp 


Lys 


Ala 


Arg 


Lys 


Thr 


Tyr 






50 










tct 


ggc 


aga 


ttc 


tgg 


aaa 


ctg 


Ser 


Gly 


Arg 


Phe 


Trp 


Lys 


Leu 




65 










70 


ttt 


tat 


ttt 


ttc 


tta 


tat 


cag 


Phe 


Tyr 


Phe 


Phe 


Leu 


Tyr 


Gin 


80 










85 




aaa 


caa 


gtt 


asg 


aac 


ata 


gee 


Lys 


Gin 


Val 


Xaa 


Asn 


He 


Ala 



<210> 145 
<211> 635 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 160. .375 



<220> 

<221> polyA_site 
<222> 620. . 635 



<220> 

<221> polyA_signal 
<222> 569. . 574 



<400> 145 

ttttaattca tatctgagtg ggcggtggcg attggtgttg gcggtctggc tcagctgggc 60 
agggggtaac tttactgatt tgggggtggt ttttagttta atttttcttt tctagcttcc 120 
catcgaeggt cagtgcgcac gttgtaatca getgaggee atg tea gga gac gga 174 

Met Ser Gly Asp Gly 
1 5 

gee acg gag cag gca get gag tat gtc tea gag aag gtg aag aaa gcg 222 
Ala Thr Glu Gin Ala Ala Glu Tyr Val Ser Glu Lys Val Lys Lys Ala 

10 15 20 

gaa aag aaa tta gaa gag aat cca tat gac ctt gat get tgg age att 270 
Glu Lys Lys Leu Glu Glu Asn Pro Tyr Asp Leu Asp Ala Trp Ser He 
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25 30 
etc att cga gag gca cag aat caa cct ata gac 
Leu lie Arg Glu Ala Gin Asn Gin Pro lie Asp 

40 45 
tat gaa cgc ctt gtt gec cag ttc ccc agt tct 
Tyr Glu Arg Leu Val Ala Gin Phe Pro Ser Ser 

55 60 
ctg tac att taagcagagg ttactatttt attttatttt 
Leu Tyr lie 
70 

agtattgcag cattcactgt agtgatagaa aamaagttas 
aggaggattt aaatttgtct tacctttatt ttgtaaaata 
atgaattttt gaaatttggg tcttttacaa gctgatgatt 
caacattaaa acagtttcaa tggcaaaaaa aaaaaaaaaa 



35 



aaa gca egg aag act 
Lys Ala Arg Lys Thr 
50 

ggc aga ttc tgg aaa 
Gly Arg Phe Trp Lys 
65 

ttcttatatc 



gaacatagee aattaggaca 
ggtataaagg agtaattaaa 
gttgcatttt ggagttgcaa 



318 



366 



415 



475 
535 
595 
635 



<210> 
<211> 
<212> 
<213> 



146 
733 
DNA 
Homo 



sapiens 



<220> 
<221> CDS 
<222> 32. .349 

<220> 

<221> polyA_site 
<222> 718. .733 



<220> 

<221> polyA_signal 
<222> 711. .716 



<400> 146 

actaactagc aaaeggggae tagaaatagg g atg ctg aaa age aac ggg gag 52 

Met Leu Lys Ser Asn Gly Glu 
1 5 



aga 


cgc 


agt 


cgt 


aac 


gca 


ctt 


ccg 


gcg 


gtc 


tac 


gcg 


agg 


aag 


atg 


get 


100 


Arg 


Arg 


Ser 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


Val 


Tyr 


Ala 


Arg 


Lys 


Met 


Ala 








10 










15 










20 










gca 


tec 


cag 


cag 


caa 


get 


tea 


gcg 


get 


tec 


tea 


get 


get 


ggt 


gta 


teg 


148 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


Ala 


Gly 


Val 


Ser 






25 










30 










35 












ggt 


cct 


agt 


teg 


get 


ggc 


ggc 


ccg 


ggt 


ccc 


cag 


cag 


cag 


ccg 


caa 


ccg 


196 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Pro 




40 










45 










50 










55 




cca 


gca 


caa 


ctg 


gtg 


ggc 


cct 


gee 


cag 


age 


ggc 


etc 


ctg 


cag 


caa 


cag 


244 


Pro 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 


Ala 


Gin 


Ser 


Gly 


Leu 


Leu 


Gin 


Gin 


Gin 












60 










65 










70 






caa 


cag 


gac 


ttc 


gat 


cct 


gtg 


cag 


cgt 


tat 


aag 


atg 


etc 


ate 


ccg 


cag 


292 


Gin 


Gin 


Asp 


Phe 


Asp 


Pro 


Val 


Gin 


Arg 


Tyr 


Lys 


Met 


Leu 


He 


Pro 


Gin 










75 










80 










85 








ctg 


aag 


gag 


agt 


eta 


cag 


gtg 


att 


ggc 


ctt 


aag 


cag 


cga 


gaa 


gca 


aac 


340 


Leu 


Lys 


Glu 


Ser 


Leu 


Gin 


Val 


He 


Gly 


Leu 


Lys 


Gin 


Arg 


Glu 


Ala 


Asn 





90 95 100 



tgg att tgg tagtctagag gggcaggccg rgggccaggt tagteggaga 389 
Trp He Trp 
105 

gagegctaag cagaaagagg aatagaacct gctgttttgg cctggctggt gcacgcctgt 44 9 

gttcccagca ctttgggagg ccgaggcggg eggatcaett gaggtcagga gttcaagacc 509 

agcctggtga aacctcgtct ctactaaaaa tacaaaaatt ageegggcat ggtggcgcgc 569 

gectgtaatt acagctactc gggaagctga ggcaggagaa tegcttgaac ecaggaggeg 629 

gaggttgcag tgagecgaca tegtgecatt gcactccagc ctgggcaaca gagegagetc 689 

cgtcaawaaa ataaataaat aaataaataa aaaaaaaaaa aaaa 733 
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<210> 147 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 111. .368 

<220> 

<221> polyA_site 
<222> 506. . 521 

<220> 

<221> polyA_signal 
<222> 483. . 488 

<400> 147 

tctcagcttc cggctggtag tagttccgct tcctgtccga ctgtggtgtc tttgctgagg 60 
gtcacattga gctgcaggtt gaatccgggg tgcctttagg attcagcacc atg gcg 116 

Met Ala 
1 



gaa 


gac 


atg 


gag 


ace 


aaa 


ate 


aag 


aac 


tac 


aag 


acc 


gee 


cct 


ttt 


gac 


164 


Glu 


Asp 


Met 
5 


Glu 


Thr 


Lys 


He 


Lys 
10 


Asn 


Tyr 


Lys 


Thr 


Ala 
15 


Pro 


Phe 


Asp 




age 


cgc 


ttc 


ccc 


aac 


cag 


aac 


cag 


act 


aga 


aac 


tgc 


tgg 


cag 


aac 


tac 


212 


Ser 


Arg 
20 


Phe 


Pro 


Asn 


Gin 


Asn 
25 


Gin 


Thr 


Arg 


Asn 


Cys 
30 


Trp 


Gin 


Asn 


Tyr 




ctg 


gac 


ttc 


cac 


cgc 


tgt 


cag 


aag 


gca 


atg 


ace 


get 


aaa 


gga 


ggc 


gat 


260 


Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


Gly 


Asp 




35 










40 










45 










50 




ate 


tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg 


tac 


cag 


tec 


etc 


tgc 


ccc 


308 


lie 


Ser 


Val 


Cys 


Glu 
55 


Trp 


Tyr 


Gin 


Arg 


Val 
60 


Tyr 


Gin 


Ser 


Leu 


Cys 
65 


Pro 




aca 


tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa 


egg 


get 


gaa 


ggc 


acg 


ttt 


356 


Thr 


Ser 


Trp 


Val 
70 


Thr 


Asp 


Trp 


Asp 


Glu 
75 


Gin 


Arg 


Ala 


Glu 


Gly 
80 


Thr 


Phe 





ccc ggg aag aty tgaactggct gcatctccct ttcctctgtc ctccatcctt 408 
Pro Gly Lys He 
85 

ctcccaggat ggtgaagggg gacctggtac ccagtgatcc ccaccccagg atcctaaatc 468 
atgacttacc tgctaataaa aactcattgg aaaagtcaaa aaaaaaaaaa aaa 521 



<210> 148 
<211> 547 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 95 . .409 

<220> 

<221> polyA_site 
<222> 533. .547 

<220> 

<221> polyA_signal 
<222> 509. . 514 

<400> 148 

ttcgaagtgg ttgtctctgc actatttact gaaaagttca tcctttcccc agtgatttgt 60 
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aacactgcct ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 

Met Cys Val Gly Leu Leu Met 



gtt 


tct 


att 


ctg 


act 


gac 


ate 


aat 


ttg 


1 

tct 


aat 


ctt 


gta 


5 

gca 


gta 


cag 


163 


Val 


Ser 


lie 


Leu 


Thr 


Asp 


He 


Asn 


Leu 


Ser 


Asn 


Leu 


Val 


Ala 


Val 


Gin 








10 










15 










20 










tac 


agt 


cct 


gat 


tat 


tgc 


aac 


ttt 


agg 


aaa 


agg 


tct 


gat 


aaa 


aac 


caa 


211 


Tyr 


Ser 


Pro 


Asp 


Tyr 


Cys 


Asn 


Phe 


Arg 


Lys 


Arg 


Ser 


Asp 


Lys 


Asn 


Gin 






25 










30 










35 












gat 


gec 


tec 


aca 


ttt 


tgt 


cat 


aat 


tgt 


aac 


caa 


ttt 


cac 


etc 


tgt 


etc 


259 


Asp 


Ala 


Ser 


Thr 


Phe 


Cys 


His 


Asn 


Cys 


Asn 


Gin 


Phe 


His 


Leu 


Cys 


Leu 




40 










45 










50 










55 




cag 


tat 


cac 


cac 


aaa 


att 


gtt 


ctt 


cct 


tgg 


agt 


gtc 


ttg 


get 


att 


ctt 


307 


Gin 


Tyr 


His 


His 


Lys 


He 


Val 


Leu 


Pro 


Trp 


Ser 


Val 


Leu 


Ala 


He 


Leu 












60 










65 










70 






age 


caa 


ctg 


ttc 


etc 


cat 


att 


act 


tct 


aga 


att 


aga 


cca 


aca 


att 


tat 


355 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr 


Ser 


Arg 


He 


Arg 


Pro 


Thr 


He 


Tyr 










75 










80 










85 








aaa 


tea 


aac 


aaa 


ccc 


aag 


age 


att 


gaa 


att 


ttg 


att 


gga 


ttt 


gta 


ttg 


403 


Lys 


Ser 


Asn 


Lys 


Pro 


Lys 


Ser 


He 


Glu 


He 


Leu 


He 


Gly 


Phe 


Val 


Leu 





90 95 100 



aat tta tagattaatc tggtaaacca tgtcatcttt acaatgttgt cttccaatac 459 
Asn Leu 
105 

atgaatatgg tacagctctt catttactta ggcctttgaa atatctttca ataaagttta 519 
taattttctc cataaaaaaa aaaaaaaa 547 

<210> 149 

<211> 3159 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103. .405 

<400> 149 

acttceggtg cgaaccgcct cggccgttcc ctcgcggact tactgagege ggccgccgag 60 
cccagctccg ccgccgagcg cctgtgccgg cacggctaca cc atg gag cgc ccg 114 

Met Glu Arg Pro 

1 



gat 


aag 


gcg 


gcg 


ctg 


aac 


gca 


ctg 


cag 


cct 


cct 


gag 


ttc 


aga 


aat 


gaa 


162 


Asp 


Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Pro 


Pro 


Glu 


Phe 


Arg 


Asn 


Glu 




5 










10 










15 










20 




age 


tea 


tta 


gca 


tct 


aca 


ctg 


aag 


acg 


etc 


ctg 


ttc 


ttc 


aca 


get 


tta 


210 


Ser 


Ser 


Leu 


Ala 


Ser 


Thr 


Leu 


Lys 


Thr 


Leu 


Leu 


Phe 


Phe 


Thr 


Ala 


Leu 












25 










30 










35 






atg 


ate 


act 


gtt 


cct 


att 


ggg 


tta 


tat 


ttc 


aca 


act 


aaa 


tct 


tac 


ata 


258 


Met 


He 


Thr 


Val 


Pro 


He 


Gly 


Leu 


Tyr 


Phe 


Thr 


Thr 


Lys 


Ser 


Tyr 


He 










40 










45 










50 








ttt 


gaa 


ggc 


gee 


ctt 


ggg 


atg 


tec 


aat 


agg 


gac 


age 


tat 


ttt 


tac 


get 


306 


Phe 


Glu 


Gly 


Ala 


Leu 


Gly 


Met 


Ser 


Asn 


Arg 


Asp 


Ser 


Tyr 


Phe 


Tyr 


Ala 








55 










60 










65 










get 


att 


gtt 


gca 


gtg 


gtc 


gee 


gtc 


cat 


gtg 


gtg 


ctg 


gee 


etc 


ttt 


gtg 


354 


Ala 


He 


Val 


Ala 


Val 


Val 


Ala 


Val 


His 


Val 


Val 


Leu 


Ala 


Leu 


Phe 


Val 






70 










75 










80 












tat 


gtg 


gee 


tgg 


aat 


gaa 


ggc 


tea 


cga 


cag 


tgg 


cgt 


gaa 


ggc 


aaa 


cag 


402 


Tyr 


Val 


Ala 


Trp 


Asn 


Glu 


Gly 


Ser 


Arg 


Gin 


Trp 


Arg 


Glu 


Gly 


Lys 


Gin 




85 










90 










95 










100 





gat taaagtgaac atcacctttt tatagcatta aattcatttt ttaaaatgat 455 
Asp 

aaatgctgga gggggecate tgatttgaat aaagttgaaa gaacatgtta aagtcagtct 515 

taaggagtca cgtttgagta tgtaaatttt gatctttcta atatgttggt ttgtatattc 575 
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agttttaact 
atatgttaat 
gttttatagt 
tgcacacagt 
ataccaggtc 
actggaatta 
agaattatcc 
aaggacctta 
tcagcctgct 
taatgagata 
ggaagctcca 
aatttgcatt 
attttttttt 
tttattttcg 
ctaaaaacac 
tagaaaactg 
gccatctaaa 
cttaatgcta 
ttcctgacag 
tttctttatt 
agtgcagtgg 
cacctcagcc 
tttttgtatt 
gtgacctcag 
taccacgccc 
aaattcaaat 
accattaccg 
agagggatga 
aagatcagta 
tataatgaca 
aagtattgtt 
cttgtgcttt 
ttttggtttc 
taaaccaggg 
tgaaggttaa 
tcaaagcagt 
ctatgttaaa 
tttcctttta 
cttatacatg 
aacctcagtt 
acggtaacac 
catttgtgtt 
aatggatacc 
ttta 



gtatgaatct 
ttaattagac 
attctgactt 
gaagcaaatt 
cttggatttg 
atctgaatgt 
attacattat 
gcttactgaa 
tccttcaagt 
ctttgcttcc 
tttaaaattg 
taaatatgtt 
catccaaaaa 
tatctttagt 
tcccacaagt 
gaaaaataca 
aataacctct 
gtttgataca 
ctatttgcat 
ttatttattt 
tgcaatctcg 
tcccaagtag 
tttttgtaga 
gtgatccacc 
ggccttattg 
tttctgttac 
atttgttgtt 
ttagtcataa 
tgattattat 
tttgagtcta 
gtatttagaa 
tgggatctta 
aggcatttgc 
gcatatgcaa 
aatcaaatgc 
tttttcctag 
taacatatgg 
cacaatgtat 
ttttaaggta 
aggagatata 
attaccttcc 
gagaaatgaa 
attgtacata 



gatttgcaaa 
aatactctgg 
acggttgctt 
aaactccact 
ggattttaat 
cactgaggaa 
tttatgggaa 
ggatctaaag 
ccccttgcag 
tcattcaaca 
tttttttact 
catttttgca 
gtattctcct 
ctgtgttctt 
atttgttttt 
tattcaaaca 
attttagttg 
cactaaaagt 
ttttttcaga 
atttttttga 
gctcactgca 
ctgggactac 
gacggggttt 
cgcctcggcc 
accattttct 
tgataatcct 
cacatttatt 
aagtcaaata 
ggaactgttt 
gggtagtatg 
ttgtccacct 
tgctgtttgt 
tgaataggtg 
aggcagttgt 
tagggttgat 
agttttaact 
tatttggcaa 
atacttcaag 
gaaatatgtt 
acttgaagtg 
tcacctcctg 
aagtctgctg 
taacagtaga 



115 

tgagaatttg 
aaggaatttt 
ttgagtttta 
ttacgctgga 
ttcctatgga 
tttcacatga 
aactaggcta 
agcaaagcaa 
gccagctttg 
ttgaagctag 
attttttaaa 
tatgttagga 
tgggcatatc 
tctagttatt 
taattataaa 
gaaaaaaaaa 
atatcccgta 
ttagcttaca 
tgagtgatta 
gacagagttt 
acctctgcct 
ggatgcctgc 
caccatgttg 
tccgaaagtg 
aaataagcac 
aatactagga 
ttttatgttg 
gcatttgtgt 
ttaattcttg 
tggtagattt 
aatttctttt 
aaaatgttac 
atgatacatg 
aatttcctct 
atttaggctt 
gttaactcac 
atagatttat 
atgtatagaa 
ccctattgtt 
tcagtccaaa 
tttggccctg 
tagaatgtat 
atttggtttg 



gaaaagttag 
atcttctttc 
ctcatktgga 
atgctttctt 
aagttgctta 
agtgtaatcc 
aattacatcc 
agatctcact 
tgctttgcag 
gcttcaatta 
attgtagtgt 
gtggaaacaa 
tgatggaaaa 
tggtactaat 
atcatagtat 
tcaaaattcc 
ttcatttttg 
agttcaaatt 
ttggccattt 
tgctctgttg 
cctgggttca 
caccacgcct 
tccaggctaa 
ctgggattac 
attctatctt 
ttcttgctta 
tgaaactgga 
ttaactgttg 
cttaaagact 
ctagatggtc 
tatataatgc 
tgtccaatgt 
ggtatttttc 
tggaaaaagc 
ataacaaaat 
tagtttgctg 
ttttcaaaat 
aggaaagcta 
tgaaaactga 
ctactgattt 
gagaatgtag 
ctgatgtcat 
gggttgttag 



ttacaaagaa 
aacaaaacat 
tatattaaga 
tagcatgaaa 
aattgtggac 
ctagtcaata 
attcaggtaa 
actcaaacac 
accaactttt 
aaaggttcga 
atatgatagg 
tctggaaaac 
aaaccttgat 
tatgtgcaat 
atgttctttg 
ccataatgtt 
aaagccattc 
ctgccagctt 
tctttttctt 
cccaggctgg 
agtgattctc 
ggctaatttt 
tcttgaactt 
aggcgtgagc 
tattctctta 
agtatgtgaa 
ctaaaggaat 
agaaaagtga 
acaattttag 
cctaattaag 
caaggtattt 
tggattattg 
tgcaagtatt 
gccaaatgtt 
aggcttgttt 
ctgtttttaa 
gtctcactag 
cagttgagcc 
ttgtaagaat 
aaccctattt 
tcctttttct 
tagttcttca 
tgaaaaaaaa 



635 
695 
755 
815 
875 
935 
995 
1055 
1115 
1175 
1235 
1295 
1355 
1415 
1475 
1535 
1595 
1655 
1715 
1775 
1835 
1895 
1955 
2015 
2075 
2135 
2195 
2255 
2315 
2375 
2435 
2495 
2555 
2615 
2675 
2735 
2795 
2855 
2915 
2975 
3035 
3095 
3155 
3159 



<210> 150 

<211> 1033 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30 . . 305 



<400> 150 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 

Met Phe Arg Asp Phe Gly Arg Arg 

1 5 

ctg cag agg gat ttg aag aga gtg gtg gat get agg ctg agg etc age 101 

Leu Gin Arg Asp Leu Lys Arg Val Val Asp Ala Arg Leu Arg Leu Ser 

10 15 20 

gag gag etc ars sgc ggg agg ate aag ccg aag cct gtg gag gtc cag 14 9 

Glu Glu Leu Xaa Xaa Gly Arg lie Lys Pro Lys Pro Val Glu Val Gin 
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25 30 35 40 

gtg gtc acg cat cac atg cag cgc tac gcc gtg tgg ttc gga ggc tec 
Val Val Thr His His Met Gin Arg Tyr Ala Val Trp Phe Gly Gly Ser 

45 50 55 

atg ctg gcc teg act ccc gag ttc ttt cag gtc tgc cac acc aag aag 
Met Leu Ala Ser Thr Pro Glu Phe Phe Gin Val Cys His Thr Lys Lys 

60 65 70 

gac tat gaa gag tac ggg ccc age ate tgc cgc cac aac ccc gtc ttt 
Asp Tyr Glu Glu Tyr Gly Pro Ser lie Cys Arg His Asn Pro Val Phe 

7 5 8 0 8 5 

gga gtc atg tec tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 
Gly Val Met Ser 
90 

gggaacaagt gtccttcaga acccagagaa ggccgccgtt ctgtaaatag egaegteggt 
gttgctgccc agcagcgtgc ttgeattgee ggtgcatgag gcgcggcgcg ggcccttcag 
taaaagecat ttatccgtgt gccgaccgct gtctgccagc ctcctccttc tcccgccctc 
ctcaccctcg ctctccctcc tcctcctcct ccgagctgct agctgacaaa tacaattctg 
aaggaatcca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggcgcaa 
aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc ttttcctccc agectatttt 
tgtaaataaa atgtttaaac ttgaaataca aatcgatgtt tatatttcct atcattttgt 
attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 
agtgctgtaa tccaaagttt ttaattgtga ggcatgttct gatatgttta taggcaaaca 
aataaaacag caaacttttt tgccacatgt ttgctagaaa atgattatac tttattggag 
tgacatgaag tttgaacact aaacagtaat gtatgagaat tactacagat acatgtatct 
tttagttttt tttgtttgaa ctttctgg 



197 



245 



293 



345 



405 
465 
525 
585 
645 
705 
765 
825 
885 
945 
1005 
1033 



<210> 151 

<211> 1033 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .305 



<400> 151 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 

Met Phe Arg Asp Phe Gly Arg Arg 
15 

ctg cag agg gat ttg aag aga gtg gtg gat get agg ctg agg etc age 101 
Leu Gin Arg Asp Leu Lys Arg Val Val Asp Ala Arg Leu Arg Leu Ser 

10 L 15 * 20 

gag gag etc age ggc ggg agg ate aag ccg aag cct gtg gag gtc cag 14 9 

Glu Glu Leu Ser Gly Gly Arg lie Lys Pro Lys Pro Val Glu Val Gin 
25 30 35 40 

gtg gtc acg cat cac atg cag cgc tac gcc gtg tgg ttc gga ggc tec 197 
Val Val Thr His His Met Gin Arg Tyr Ala Val Trp Phe Gly Gly Ser 

45 50 55 

atg ctg gcc teg act ccc gag ttc ttt cag gtc tgc cac acc aag aag 245 
Met Leu Ala Ser Thr Pro Glu Phe Phe Gin Val Cys His Thr Lys Lys 

60 65 70 

gac tat gaa gag tac ggg ccc age ate tgc cgc cac aac ccc gtc ttt 293 
Asp Tyr Glu Glu Tyr Gly Pro Ser lie Cys Arg His Asn Pro Val Phe 

7 5 80 " 85 

gga gtc atg tec tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 345 
Gly Val Met Ser 

90 

gggaacaagt gtccttcaga acccagagaa ggccgccgtt ctgtaaatag egaegteggt 405 
gttgctgccc agcagcgtgc ttgeattgee ggtgcatgag gcgcggcgcg ggcccttcag 4 65 
taaaagecat ttatccgtgt gccgaccgct gtctgccagc ctcctccttc tcccgccctc 525 
ctcaccctcg ctctccctcc tcctcctcct ccgagctgct agctgacaaa tacaattctg 585 
aaggaatcca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggcgcaa 645 
aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc ttttcctccc agectatttt 705 
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tgtaaataaa atgtttaaac ttgaaataca aatcgatgtt tatatttcct atcattttgt 765 

attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 825 

agtgctgtaa tccaaagttt ttaattgtga ggcatgttct gatatgttta taggcaaaca 885 

aataaaacag caaacttttt tgccacatgt ttgctagaaa atgattatac tttattggag 945 

tgacatgaag tttgaacact aaacagtaat gtatgagaat tactacagat acatgtatct 1005 

tttagttttt tttgtttgaa ctttctgg 1033 



<210> 152 
<211> 688 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 159. . 452 



<220> 

<221> polyA__site 
<222> 678. . 688 



<220> 

<221> polyA_signal 
<222> 670. . 675 



<400> 152 

atacgtacac caatgcggac aagctcctag aagcagcaga gcagttggct cagacggggg 60 
aatgtgaccc cgaggagatc tacaaggcag ctcgacacct ggaggtgcgc atccaagact 120 
tcgtgcgcag ggtggagcag cggaagcttc tcctggac atg tct gtt tec ttc cac 176 

Met Ser Val Ser Phe His 
1 5 

aca cac acc aaa gag ttg tgg aca tgg atg gaa gac ctt cag aag gag 224 
Thr His Thr Lys Glu Leu Trp Thr Trp Met Glu Asp Leu Gin Lys Glu 

10 15 20 

atg ttg gag gat gtc tgt gca gat tct gtg gat gca gtc cag gaa ctg 272 
Met Leu Glu Asp Val Cys Ala Asp Ser Val Asp Ala Val Gin Glu Leu 

25 30 35 

ate aag cag ttc cag cag cag cag acc gec act eta gat gec aca etc 320 
lie Lys Gin Phe Gin Gin Gin Gin Thr Ala Thr Leu Asp Ala Thr Leu 

40 45 50 

aat gtc ate aag gaa ggc gaa gac ctt ate cag cag etc agg tea gcg 368 
Asn Val lie Lys Glu Gly Glu Asp Leu lie Gin Gin Leu Arg Ser Ala 
55 60 65 70 

cct ccc tec etc ggg gag ccc age gag gec agg tea gca tgg gca gag 416 
Pro Pro Ser Leu Gly Glu Pro Ser Glu Ala Arg Ser Ala Trp Ala Glu 

75 80 85 

ctt tec agt ggg aaa tgc etc ggg eta gac gtg aga taagtcctgt 462 
Leu Ser Ser Gly Lys Cys Leu Gly Leu Asp Val Arg 

90 ^ 95 
cccagtcttg actcagcaaa atettgettt gtggtcttgt gtgagcttcg gtttctttat 522 
ctgtaagatg taaaaatgee tgcaccatac tcaacttgaa tctagaataa aatagtaaat 582 
ggaaaagctt taatgtttca aagaattatt tatcattcag gatgttattc agecatcagt 642 
aagtaacata aatacaacaa aaattaacat aaaacaaaaa aaaaaa 688 



<210> 153 
<2U> 420 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 63. .329 



<220> 



WO 02/083898 PCT/IB01/00914 

118 

<221> polyA_site 
<222> 406. .420 



<400> 153 

atgctcgcca ggataccccc tcttcccatg gcgacctcag accgcgggcg tcagcgttgg 60 
eg atg gtc tec gga gac ggc ttc etc gtt tec agg cct gaa gcg att 107 
Met Val Ser Gly Asp Gly Phe Leu Val Ser Arg Pro Glu Ala lie 
15 10 15 



cat 


eta 


gga 


cct 


egg 


cag 


gcg 


gtg 


cga 


cca 


age 


gtt 


egg 


gee 


gag 


age 


155 


His 


Leu 


Gly 


Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 


Ser 












20 










25 










30 






cgt 


cga 


gtg 


gat 


ggt 


ggc 


ggc 


egg 


age 


cca 


agg 


gaa 


cca 


gat 


ggc 


egg 


203 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 


Arg 










35 










40 










45 








ggc 


egg 


age 


cgc 


caa 


gcg 


aga 


ttc 


tea 


cct 


tac 


cca 


ate 


cct 


gec 


gtt 


251 


Gly 


Arg 


Ser 


Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro 


Ala 


Val 








50 










55 










60 










gaa 


ccc 


gat 


etc 


eta 


aga 


agt 


gtg 


ctg 


caa 


cag 


cgt 


ttg 


att 


gca 


tta 


299 


Glu 


Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 


Leu 






65 










70 










75 












gga 


ggt 


gtt 


ate 


gca 


get 


cga 


att 


tea 


gtt 


taaacgaaca cctttcctct 


349 


Gly 


Gly 


Val 


He 


Ala 


Ala 


Arg 


He 


Ser 


Val 

















80 85 

ggccctcact tagcttgtga acaggecttt ttaaaatcct ttcttggtgt agcagcaaaa 409 
aaaaaaaaaa a 420 



<210> 154 
<211> 640 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 181. .549 



<220> 

<221> polyA_site 
<222> 626. . 640 



<400> 154 

atgctcgcca gaatacccct cgttaaaggc aaggeggett ctggctcttc cgcaggctca 60 

gttatgggtt cctgtgtgct tagtcategg gtccgacaca ggctggactg atctggggag 120 

ccgcgaaggg cctgccttca caaagggacg taacgcaagt attgegggea gtgtttgaat 180 



atg 


gee 


ctg 


aac 


aat 


gtg 


tec 


ctg 


tec 


tec 


ggt 


gat 


cag 


agg 


age 


agg 


228 


Met 


Ala 


Leu 


Asn 


Asn 


Val 


Ser 


Leu 


Ser 


Ser 


Gly 


Asp 


Gin 


Arg 


Ser 


Arg 




1 








5 










10 










15 






gtg 


gee 


tac 


cgc 


tct 


tec 


cat 


ggc 


gac 


etc 


aga 


ccg 


egg 


gcg 


tea 


gcg 


276 


Val 


Ala 


Tyr 


Arg 


Ser 


Ser 


His 


Gly 


Asp 


Leu 


Arg 


Pro 


Arg 


Ala 


Ser 


Ala 










20 










25 










30 








ttg 


gcg 


atg 


gtc 


tec 


gga 


gac 


ggc 


ttc 


etc 


gtt 


tec 


agg 


cct 


gaa 


gcg 


324 


Leu 


Ala 


Met 


Val 


Ser 


Gly 


Asp 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 








35 










40 










45 










att 


cat 


eta 


gga 


cct 


egg 


cag 


gcg 


gtg 


cga 


cca 


age 


gtt 


egg 


gec 


gag 


372 


He 


His 


Leu 


Gly 


Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 






50 










55 










60 












age 


cgt 


cga 


gtg 


gat 


ggt 


ggc 


ggc 


egg 


age 


cca 


agg 


gaa 


cca 


gat 


ggc 


420 


Ser 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 




65 










70 










75 










80 




egg 


ggc 


egg 


age 


cgc 


caa 


gee 


aga 


ttc 


tea 


cct 


tac 


cca 


ate 


cct 


gec 


468 


Arg 


Gly 


Arg 


Ser 


Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro 


Ala 












85 










90 










95 






gtt 


gaa 


ccc 


gat 


etc 


eta 


aga 


agt 


gtg 


ctg 


caa 


cag 


cgt 


ttg 


att 


gca 


516 


Val 


Glu 


Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 
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100 105 110 

tta gga ggt gtt ate gca get cga att tea gtt taaacgaaca cctttcctct 
Leu Gly Gly Val lie Ala Ala Arg lie Ser Val 

115 120 
ggccctcact tagcttgtga acaggecttt ttaaaatcct ttcttggtgt agcagcaaaa 
aaaaaaaaaa a 



569 



629 
640 



<210> 155 

<211> 1582 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .523 

<220> 

<221> polyA_site 
<222> 1567 . . 1582 



<400> 155 

agtcaag atg gcg gga gca get ace cag get tec 
Met Ala Gly Ala Ala Thr Gin Ala Ser 
1 5 
egg ate atg egg ctg gtg gee gaa tgc age cgc 
Arg lie Met Arg Leu Val Ala Glu Cys Ser Arg 
15 20 25 

ggc gag ctg tgg ctg ccg cat ggg aca gtg gee 
Gly Glu Leu Trp Leu Pro His Gly Thr Val Ala 

35 40 
cca gtg ggc acg cag gec acc atg aag ggc ate 
Pro Val Gly Thr Gin Ala Thr Met Lys Gly lie 

50 55 
gac get ctg ggt tgc cgc ate tgc ctg ggc aat 
Asp Ala Leu Gly Cys Arg lie Cys Leu Gly Asn 

65 70 
eta agg ccg gga ccc gag ctg ate cag aaa gee 
Leu Arg Pro Gly Pro Glu Leu lie Gin Lys Ala 

80 85 
ttc atg aat tgg cct cat aat ctg eta acg gac 
Phe Met Asn Trp Pro His Asn Leu Leu Thr Asp 
95 100 105 

atg gtg teg ctg gtg tct ctg tec gag gtg acg 
Met Val Ser Leu Val Ser Leu Ser Glu Val Thr 

115 120 
ttc cgc tec ccc tac gac ggc aat gag acc ctg 
Phe Arg Ser Pro Tyr Asp Gly Asn Glu Thr Leu 



130 



135 



tec gtg cag ate cag aat gcg ctg ggc teg gac 
Ser Val Gin lie Gin Asn Ala Leu Gly Ser Asp 

145 150 
gac gac gtg gtt age agt act gtg act ggg cca 
Asp Asp Val Val Ser Ser Thr Val Thr Gly Pro 

160 165 
taggecatgt acaggtcaat ccgctggctg gaeeggtgea 
gacaagcaga acctcttcgc cattatccag ggtgggctgg 
tgccttgaag agatgaccaa gcgagacgtg cctggcttcg 
ggtgagagca agtcgcagtt ctggcggatg gtggcgctga 
gacaagcccc gatatctgat gggggttggg tatgttgtgg 
cctgtgggaa gtggattcct ggggaccccc taccctgctt 
aaaeggacac aggtctgatc tgaggagact aggaagacat 
tctgagggaa tatggcccag tctggggcag tgtgagtgtt 
cctgaggttc tctgccccct cccgtcatgg ctgcaacccc 



ctg gag teg gec cca 
Leu Glu Ser Ala Pro 
10 

tec agg gec egg gca 
Ser Arg Ala Arg Ala 
30 

act cct gtg ttc atg 
Thr Pro Val Phe Met 
45 

acg acc gaa cag ctg 
Thr Thr Glu Gin Leu 
60 

acc tac cat ctg ggt 
Thr Tyr His Leu Gly 
75 

aac ggt etc cac ggc 
Asn Gly Leu His Gly 
90 

age ggc ggt ttc cag 
Ser Gly Gly Phe Gin 
110 

gag gag ggc gtc cgc 
Glu Glu Gly Val Arg 
125 

ctg age ccg gag aaa 
Leu Ser Pro Glu Lys 
14 0 

ate ate atg cag ctg 
lie lie Met Gin Leu 

155 
cgt gtg gag 
Arg Val Glu 
170 

ttgcagccca tcagcggccg 
aegcagatet ccgggccacc 
ecateggggg ectgageggg 
gcacctctcg getgecgaag 
atagggaagc cagagcccta 
ggggaggtgg catttggggg 
ggctgtccct tgggggecat 
gggaggggee ctgggaagcc 
agetatgeca ctgatctggt 



49 



97 



145 



193 



241 



289 



337 



385 



433 



481 



523 



583 
643 
703 
763 
823 
883 
943 
1003 
1063 



WO 02/083898 PCT/IB01/00914 

120 

agtctgcgtg gctcttggat gtgacatgtt cgactgcgtc ttccccacac ggacagcgcg 1123 

ctttggctct gccctggtgc ccactgggaa cctgcagttg aggaagaagg tgtttgagaa 1183 

ggacttcggc cccatagacc cggagtgcac ctgccccacg tgccaaaagc acagccgcgc 1243 

cttcctgcac gcactgctgc acagtgacaa cacggccgcg ctgcaccacc tcacggtcca 1303 

caacatcgcc taccagctgc agctcatgag cgccgtccgc accagcatcg tggagaagcg 1363 

cttcccggac ttcgtgcggg acttcatggg cgccatgtac ggggatccca ccctctgtcc 1423 

cacctgggcc actgacgctc tggcctctgt gggaatcaca ctgggctgac ctggcattgg 1483 

gagagggagg gaggaaggaa gggagggagg ggctggaaga tactgaagga ttcctttttg 1543 

aaaggttttt tttattgtaa cttaaaaaaa aaaaaaaaa 1582 



<210> 156 
<211> 104 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. . 104 



<400> 156 

agagctgagc tgaagcggga cccggagccc gagcagccgc cgcc atg gca ate aaa 56 

Met Ala lie Lys 
1 

ttt ctg gaa gtc ate aag ccc ttc tgt gtc ate ctg caa eta aca tct 104 
Phe Leu Glu Val lie Lys Pro Phe Cys Val lie Leu Gin Leu Thr Ser 
5 10 " 15 20 



<210> 157 
<211> 150 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2. .70 



<220> 

<221> polyA__site 
<222> 135. .150 



<220> 

<221> polyA_signal 
<222> 115. . 120 



<400> 157 

t ttt etc aac aat tec tea ccg cag gag cct ctg aag etc cca cca ggc 4 9 

Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

cag etc tec tec ace cac cgc taactctcag ccccagtcac cctcttggag 100 

Gin Leu Ser Ser Thr His Arg 
20 

cttccctgct ttgaattaaa gaccactcat gcgcaaaaaa aaaaaaaaaa , 150 



<210> 158 
<211> 510 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 150. .284 



<220> 
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<221> polyA_site 
<222> 495. . 510 



<220> 

<221> polyA_signal 
<222> 475. .480 



<400> 158 

aaccaaagca tgggctcttt gtttcctctt tgccaggcct gggtccttcc cagtgacccc 
tcagccttca cttcctcctc actcccgggg agctgtgaca ggaagaggag atcccagaac 
tggagattga cgtggatgag ctcctggac atg gag agt gac gat gcc egg get 

Met Glu Ser Asp Asp Ala Arg Ala 
1 5 
gcc agg gtc aag gag ctg ctg gtt gac tgt tac aaa ccc aca gag gcc 
Ala Arg Val Lys Glu Leu Leu Val Asp Cys Tyr Lys Pro Thr Glu Ala 

10 15 20 

ttc att tyt ggc ctg ctg gac aag ate egg ggc atg cag aag ctg age 
Phe lie Xaa Gly Leu Leu Asp Lys lie Arg Gly Met Gin Lys Leu Ser 
25 30 35 40 

aca ccc cag aag aag tgagggtccc cgacccagga gaacggtggc tcccacagga 
Thr Pro Gin Lys Lys 
45 

caatcgctgc cccccaacct cgtagcaaca geaatacegg gggaccctgc ggccaggcct 
ggtgccatga gcagggctcc tcgtgcccct ggcccagggg tctcttcccc tgccccctca 
gttttccact tttggggttt ttttattgtt attaaactga tgggacttct aaaaaaaaaa 
aaaaaa 



60 
120 
173 



221 



269 



324 



384 
444 
504 
510 



<210> 159 
<211> 390 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 10. .336 



<220> 

<221> polyA__site 
<222> 373. . 390 



<220> 

<221> polyA_signal 
<222> 354 . .359 



<400> 159 

agtctgaag atg gcg gcc tea gca gcg cga ggt get gcg gcg ctg cgt aga 51 
Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 







1 








5 










10 










agt 


ate 


aat 


cag 


ccg 


gtt 


get 


ttt 


gtg 


aga 


aga 


att 


cct 


tgg 


act 


gcg 


99 


Ser 


He 


Asn 


Gin 


Pro 


Val 


Ala 


Phe 


Val 


Arg 


Arg 


He 


Pro 


Trp 


Thr 


Ala 




15 










20 










25 










30 




gcg 


teg 


agt 


cag 


ctg 


aaa 


gaa 


cac 


ttt 


gca 


cag 


ttc 


ggc 


cat 


gtc 


aga 


147 


Ala 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 


His 


Phe 


Ala 


Gin 


Phe 


Gly 


His 


Val 


Arg 












35 










40 










45 






agg 


tgc 


att 


tta 


cct 


ttt 


gac 


aag 


gag 


act 


ggc 


ttt 


cac 


aga 


ggt 


ttg 


195 


Arg 


Cys 


He 


Leu 


Pro 


Phe 


Asp 


Lys 


Glu 


Thr 


Gly 


Phe 


His 


Arg 


Gly 


Leu 










50 










55 










60 








ggt 


tgg 


gtt 


cag 


ttt 


tct 


tea 


gaa 


gaa 


gga 


ctt 


egg 


aat 


gca 


eta 


caa 


243 


Gly 


Trp 


Val 


Gin 


Phe 


Ser 


Ser 


Glu 


Glu 


Gly 


Leu 


Arg 


Asn 


Ala 


Leu 


Gin 








65 










70 










75 










cag 


gaa 


aat 


cat 


att 


ata 


gat 


gga 


gta 


aag 


gtc 


cag 


gtt 


cac 


act 


aga 


291 


Gin 


Glu 


Asn 


His 


He 


He 


Asp 


Gly 


Val 


Lys 


Val 


Gin 


Val 


His 


Thr 


Arg 





80 85 90 
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agg cca aaa ctt ccg caa aca tct gat gat gaa aag aaa gat ttt 336 
Arg Pro Lys Leu Pro Gin Thr Ser Asp Asp Glu Lys Lys Asp Phe 
95 ^ 100 105 

tgagactgca gcctattaat aaagttaaca taactgacaa aaaaaaaaaa aaaa 390 



<210> 160 
<211> 398 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. . 175 



<220> 

<221> polyA_site 
<222> 383. .398 



<220> 

<221> polyA_signal 
<222> 365. .370 



<400> 160 

gagaaggtgc tttagtctga ag atg gcg gcc tea gca gcg cga ggt get gcg 52 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala 
15 10 
gcg ctg cat aag tat caa tea gcc ggt tgc ttt tgt gag aag aat tec 100 
Ala Leu His Lys Tyr Gin Ser Ala Gly Cys Phe Cys Glu Lys Asn Ser 

15 20 25 

ttg gac tgc ggc gtc gag tea get gaa aga aca ctt tgc aca gtt egg 148 
Leu Asp Cys Gly Val Glu Ser Ala Glu Arg Thr Leu Cys Thr Val Arg 

30 35 40 

cca tgt cag aag gtg cat ttt acc ttt tgacaaggag actggctttc 195 
Pro Cys Gin Lys Val His Phe Thr Phe 

4 5 50 
acagaggttt gggttgggtt cagttttctt cagaagaagg actteggaat gcactacaac 255 
aggaaaatca tattatagat ggagtaaagg tccaggttca cactagaagg ccaaaacttc 315 
cgcaaacatc tgatgatgaa aagaaagatt tttgagactg cagectatta ataaagttaa 375 
cataaccaaa aaaaaaaaaa aaa 398 



<210> 161 
<211> 425 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 127. . 423 



<220> 

<221> polyA_site 
<222> 394 . . 425 



<400> 161 

c cca gtc aga ggg ctg aac atg tec gag ttt gtc tgt aac ctg tea gca 49 
Pro Val Arg Gly Leu Asn Met Ser Glu Phe Val Cys Asn Leu Ser Ala 
15 10 15 

agg cct tat gtg tac gac etc att gcc gtg tec aat cat tat gga gcc 97 
Arg Pro Tyr Val Tyr Asp Leu lie Ala Val Ser Asn His Tyr Gly Ala 

20 25 30 

atg ggg gtt ggc cac tac taaagcagct tatgtgctat tttaccaacg 145 
Met Gly Val Gly His Tyr 
35 
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tcgagatgat gaattttata agacaccttc acttagcagt tctggttcct ctgatggagg 205 

gacacgacca agcagctctc agcagggctt tggggatgat gaggcttgca gcatggacac 265 

caactaatgc tgactccacg atcctgccac cctgtagcgc cagtgtaatc ccccaggaga 325 

acatctttga cactctgcag actgctagtg ttctgtctaa aaaccagaca aggaaatacc 385 

cttcttttat gagcagaagg aawcaaaaaa aaaaaaaaaa 425 



<210> 162 
<211> 438 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. . 41 



<220> 

<221> polyA_site 
<222> 419. .438 



<400> 162 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgagcactca 
Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 
ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 
gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 
tttgggaggc egaggeggge agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 
tgatgaaacc ctgtctctac taaaaataca aaaattagee aggcatggtg gtgcacacct 
gtagtcccag ctgctcagga ggctgaggca ggagaatege tcgaacccgg gaggtggagg 
ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 
cagtctcaga aaaaaaaaaa aaaaaag 



51 



111 
171 
231 
291 
351 
411 
438 



<210> 163 
<211> 436 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 3. .41 



<220> 

<221> polyA_site 
<222> 419. .436 



<400> 163 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act 
Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 
ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat 
gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta 
tttgggaggc egaggeggge agatcacgag gtcaggagtt cgagaccagc 
tgatgaaacc ctgtctctac taaaaataca aaaattagee aggcatggtg 
gtagtcccag ctgctcagga ggctgaggca ggagaatege tcgaacccgg 
ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca 
cagtctcaga aaaaaaaaaa aaaaa 



tgagcactca 



aaagtgtgtg 
atcccagcac 
ctgaccaaca 
gtgcacacct 
gaggtggagg 
gagcaagact 



51 



111 
171 
231 
291 
351 
411 
436 



<210> 166 
<211> 436 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 3. . 41 
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<220> 

<221> polyA_site 
<222> 421 . .436 



<400> 166 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act 
Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 
ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat 
gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta 
tttgggaggc egaggeggge agatcacgag gtcaggagtt cgagaccagc 
tgatgaaacc ctgtctctac taaaaataca aaaattagee aggcatggtg 
gtagtcccag ctgctcagga ggctgaggca ggagaatege tcgaacccgg 
ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca 
cagtctcaga aaaaaaaaaa aaaaa 



tgagcactca 



aaagtgtgtg 
atcccagcac 
ctgaccaaca 
gtgcacacct 
gaggtggagg 
gagcaagact 



51 



111 
171 
231 
291 
351 
411 
436 



<210> 167 
<211> 436 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. . 41 

<220> 

<221> polyA_site 
<222> 419. .436 



<400> 167 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act 
Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 
ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat 
gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta 
tttgggaggc egaggeggge agatcacgag gtcaggagtt cgagaccagc 
tgatgaaacc ctgtctctac taaaaataca aaaattagee aggcatggtg 
gtagtcccag ctgctcagga ggctgaggca ggagaatege tcgaacccgg 
ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca 
cagtctcaga aaaaaaaaaa aaaaa 



tgagcactca 



taagtgtgtg 
atcccagcac 
ctgaccaaca 
gtgcacacct 
gaggtggagg 
gagcaagact 



51 



111 
171 
231 
291 
351 
411 
436 



<210> 168 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 10. . 336 



<220> 

<221> polyA_site 
<222> 373. . 394 



<220> 

<221> polyA_signal 
<222> 354 . . 359 



<400> 168 

agtctgaag atg gcg gcc tea gca gcg cga ggt get gcg gcg ctg cgt aga 
Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 



51 
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1 








5 


agt 


ate 


aat 


cag 


ccg 


gtt 


get 


Ser 


lie 


Asn 


Gin 


Pro 


Val 


Ala 


15 










20 




gcg 


teg 


agt 


cag 


ctg 


aaa 


gaa 


Ala 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 










35 






agg 


tgc 


att 


tta 


cct 


ttt 


gac 


Arg 


Cys 


He 


Leu 


Pro 


Phe 


Asp 








50 








ggt 


tgg 


gtt 


cag 


ttt 


tct 


tea 


Gly 


Trp 


Val 


Gin 


Phe 


Ser 


Ser 



ttt gtg aga 
Phe Val Arg 



65 



cag gaa 
Gin Glu 
80 

agg cca 
Arg Pro 
95 

tgagactg 



aat cat 
Asn His 



att 
He 



ata gat 
He Asp 

85 
caa aca 
Gin Thr 
100 

ca gectattaat aaagtt 



cac 
His 

aag 
Lys 

gaa 

Glu 

70 

gga 

Gly 



ttt gca 
Phe Ala 
40 

gag act 
Glu Thr 
55 

gaa gga 
Glu Gly 

gta aag 
Val Lys 



aga 

Arg 

25 

cag 

Gin 

ggc 
Gly 

ctt 
Leu 

gtc 
Val 



10 

atw cct tgg 
He Pro Trp 

ttc ggc cat 
Phe Gly His 



aaa ctt ccg 
Lys Leu Pro 



tct gat gat gaa 
Ser Asp Asp Glu 
105 

aaca taactgaaaa 



ttt 
Phe 

egg 
Arg 

cag 

Gin 

90 

aag 

Lys 



cac aga 
His Arg 
60 

aat gca 
Asn Ala 
75 

gtt cac 
Val His 

aaa gat 
Lys Asp 



act gcg 
Thr Ala 

30 
gtc aga 
Val Arg 
45 

ggt ttg 
Gly Leu 

eta caa 
Leu Gin 

act aga 
Thr Arg 

ttt 
Phe 



aaaaaaaaaa aaggtcaa 



99 



147 



195 



243 



291 



336 



394 



<210> 169 
<211> 772 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 595. 



.771 



<400> 169 

atataggegt 

ttacgcgcac 

tatggacege 

acagccctga 

ctccggtggc 

gagtcggccc 

gaaaggtaaa 

ctgcagcacc 

ctgagtcagc 

cagaeggaag 



ggctgaatac 
aegtaegtte 
tagcatctgt 
cacgagccgg 
ctcgctgggt 
tgaacgccca 
gaaaccaagt 
agctgtcccc 
cgaatcctag 
cagcagacca 



gggcctcctg 
ctcatgaaag 
cactggccac 
gtgctctctc 
cgegaggagg 
cgact ctcag 
ggaagagtgt 
gccctactcc 
ccccttccct 
tcaaggctac 



cgtacgtgcg 
ggacgacggg 
eggtttcegg 
ttctcaccgc 
eggaggactg 
ggtccagagg 
ttcctcctct 
ggaccgcccc 
tgggectget 
gggaggcccg 



tggtgtacgt 
agetgeatga 
gagtaagegg 
ggcccacgtc 
tactctgagg 
ccgtgagacc 
ggccgtaaag 
aaagactcca 
gtggtgctcg 
gggcgcttgc 



aag ttt ggc tgc etc tec ttc egg cag cct tat 
Lys Phe Gly Cys Leu Ser Phe Arg Gin Pro Tyr 

5 10 
aat gga ate aag act gtg gag acg cgc tgg cgt 
Asn Gly He Lys Thr Val Glu Thr Arg Trp Arg 

20 25 
cag egg aac tgt ace ate gee gtc cac att get 
Gin Arg Asn Cys Thr He Ala Val His He Ala 

35 " 40 

ggc gat gee tgt egg gag ctg ctg gtg gag a 
Gly Asp Ala Cys Arg Glu Leu Leu Val Glu 
50 55 



get ggc ttt 
Ala Gly Phe 
15 

cct ctg ytg 
Pro Leu Leu 
30 

cac agg gac 
His Arg Asp 
45 



acgtgcgtga 
aagccgaagt 
cagctacctt 
tcctcgctgg 
ccaaaagcca 
ggccgcggct 
caggtactct 
tgggatggac 
acatcagtga 
gaag atg 

Met 

1 

gtc tta 
Val Leu 

age age 
Ser Ser 

tgg gaa 
Trp Glu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
597 



645 



693 



741 



772 



<210> 170 
<211> 128 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 



WO 02/083898 

<222> -34 . .-1 
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<220> 

<221> UNSURE 
<222> 7 

<223> Xaa = Ile,Thr 



<400> 170 



Met 


Ser 


Leu 


Arg 


Leu 


Asp 


Thr 


Thr 


Pro 


Ser 


Cys 


Asn 


Ser 


Ala 


Arg 


Pro 










-30 










-25 










-20 




Leu 


His 


Ala 


Leu 


Gin 


Val 


Leu 


Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Thr 


Ala 








-15 










-10 










-5 






Leu 


Ala 


Ser 


Ser 


Thr 


Lys 


Gly 


Gin 


Xaa 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 






1 








5 










10 










Lys 


Asp 


Glu 


Ser 


Leu 


Asp 


Ser 


Asp 


Leu 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


15 










20 










25 










30 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 










35 










40 










45 




Glu 


Val 


He 


Gly 


Lys 


Gly 


Thr 


His 


Cys 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 








50 










55 










60 






Thr 


Leu 


Lys 


Asp 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Pro 


Asp 


Ala 


Pro 


Arg 






65 










70 










75 








He 


Lys 


Lys 


He 


Val 


Gin 


Lys 


Lys 


Leu 


Ala 


Gly 


Asp 


Glu 


Ser 


Ala 


Asp 
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<210> 171 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 

<400> 171 



Met 


Lys 


Ala 


Ala 


Val 


Leu 


Thr 


Leu 


Ala 


Val 


Leu 


Phe 


Leu 


Thr 


Gly 


Ser 








-15 










-10 










-5 






Gin 


Ala 


Arg 


His 


Phe 


Trp 


Gin 


Gin 


Asp 


Glu 


Pro 


Pro 


Gin 


Ser 


Pro 


Trp 






1 








5 










10 










Asp 


Arg 


Val 


Lys 


Asp 


Leu 


Ala 


Thr 


Val 


Tyr 


Val 


Asp 


Val 


Leu 


Lys 


Asp 


15 










20 










25 










30 


Ser 


Gly 


Arg 


Asp 


Tyr 


Val 


Ser 


Gin 


Phe 


Glu 


Gly 


Ser 


Ala 


Leu 


Gly 


Lys 










35 










40 










45 




Gin 


Leu 


Asn 


Leu 


Lys 


Leu 


Leu 


Asp 


Asn 


Trp 


Asp 


Ser 


Val 


Thr 


Ser 


Ala 








50 










55 










60 






Phe 


Ser 


Asn 


Leu 


Arg 


Glu 


Gin 


Leu 


Gly 


Pro 


Val 


Thr 


Gin 


Glu 


Phe 


Trp 






65 










70 










75 








Asp 


Asn 


Leu 


Glu 


Lys 


Glu 


Thr 


Glu 


Gly 


Leu 


Arg 


Gin 


Glu 


Met 


Ser 


Lys 




80 










85 










90 










Asp 


Leu 


Glu 


Glu 


Val 


Lys 


Ala 


Lys 


Val 


Gin 


Pro 


Tyr 


Leu 


Asp 


Asp 


Phe 


95 










100 










105 










110 


Gin 


Lys 


Lys 


Trp 


Gin 


Glu 


Glu 


Met 


Glu 


Leu 


Tyr 


Arg 


Gin 


Lys 


Ala 


Ala 










115 










120 










125 




Phe 


Leu 


Thr 


He 


Leu 


Thr 


Ser 


Leu 


Gly 


Pro 


Asn 


Gly Asn 


Lys 


Ala 


Phe 








130 










135 










140 







<210> 172 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
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<222> -26. .-1 
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<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Leu, Pro 



<400> 172 
Met Gly Arg Ala 
-25 

Ala Leu Leu Leu 
-10 

Thr Ser Ser Asn 
10 

Asn Pro Thr Asn 
25 

Thr Ala Ser Leu 
40 



Met Val Ala Arg 
-20 

Pro Thr Gin lie 
-5 

Ser Ser Gin Ser 

Ala Thr Thr Lys 
30 

Phe Val Val Ser 
45 



Leu Gly Leu Gly 
-15 

Tyr Ser Ser Glu 
1 

Thr Ser Asn Ser 
15 

Val Ala Gly Gly 

Leu Ser Leu Leu 

50 



Leu Leu Leu Leu 

Thr Thr Thr Gly 
5 

Gly Leu Ala Pro 
20 

Ala Leu Gin Ser 
35 

His Xaa Tyr Ser 



<210> 173 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -27 . . -1 

<223> Von Heijne matrix 

score 11.8486806626293 
seq LFLGLLLLPLVVA/FA 



<400> 173 
Met Ser Ser Ala 
-25 

Leu Gly Leu Leu 
-10 

Ala Glu Glu Asp 

Gin Val Arg Pro 
25 

Pro His Cys Pro 
40 

Lys lie Cys Leu 
55 

Lys Leu Leu Glu 
70 



Ala Gly Phe Cys 
-20 

Leu Leu Pro Leu 
-5 

Gly Asp Leu Gin 
10 

Arg His lie Thr 

Thr Ala Gin Leu 
45 

Asp Leu Gin Ala 
60 

Ser 



Ala Ser Arg Pro 

Val Val Ala Phe 
1 

Cys Leu Cys Val 
15 

Ser Leu Glu Val 
30 

lie Ala Thr Leu 

Pro Leu Tyr Lys 
65 



Gly Leu Leu Phe 
-15 

Ala Ser Ala Glu 
5 

Lys Thr Thr Ser 
20 

lie Lys Ala Gly 
35 

Lys Asn Gly Arg 
50 

Lys lie lie Lys 



<210> 174 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -27 . .-1 

<223> Von Heijne matrix 

score 11.8486806626293 
seq LFLGLLLLPLVVA/FA 



<400> 174 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe 

-25 -20 -15 

Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu 
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-10 -5 15 

Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val Lys Thr Thr Ser 

10 ~ 15 20 

Gin Val Arg Pro Arg His lie Thr Ser Leu Glu Val lie Lys Ala Gly 

25 30 35 

Pro His Cys Pro Thr Ala Gin Leu lie Ala Thr Leu Lys Asn Gly Arg 

40 45 50 

Lys lie Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys lie lie Lys 

55 60 65 

Lys Leu Leu Glu Ser 
70 

<210> 175 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 

<400> 175 



Met lie Pro 


Ala 


Val 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Glu 


Gin 


Ala 




-15 










-10 










-5 






Ala Ala Leu 


Gly 


Glu 


Pro 


Gin 


Leu 


Cys 


Tyr 


lie 


Leu 


Asp 


Ala 


He 


Leu 


1 








5 










10 










Phe Leu Tyr 


Gly 


He 


Val 


Leu 


Thr 


Leu 


Leu 


Tyr 


Cys 


Arg 


Leu 


Lys 


He 


15 






20 










25 










30 


Gin Val Arg 


Lys 


Ala 


Ala 


lie 


Thr 


Ser 


Tyr 


Glu 


Lys 


Ser 


Asp 


Gly 


Val 






35 










40 










45 




Tyr Thr Gly 


Leu 


Ser 


Thr 


Arg 


Asn 


Gin 


Glu 


Thr 


Tyr 


Glu 


Thr 


Leu 


Lys 




50 










55 










60 






His Glu Lys 


Pro 


Pro 


Gin 






















65 




























<210> 176 




























<211> 92 




























<212> PRT 




























<213> Homo : 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -24 . .■ 


-1 


























<220> 




























<221> UNSURE 


























<222> 52 




























<223> Xaa = 


Arg, Thr 
























<400> 176 




























Met Val Met 


Gly 


Leu 


Gly 


Val 


Leu 


Leu 


Leu 


Val 


Phe 


Val 


Leu 


Gly 


Leu 






-20 










-15 










-10 




Gly Leu Thr 


Pro 


Pro 


Thr 


Leu 


Ala 


Gin 


Asp 


Asn 


Ser 


Arg 


Tyr 


Thr 


His 




-5 










1 








5 








Phe Leu Thr 


Gin 


His 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly 


Arg 


Asp 


Asp 


Arg 


10 








15 










20 










Tyr Cys Glu 


Ser 


He 


Met 


Arg 


Arg 


Arg 


Gly 


Leu 


Thr 


Ser 


Pro 


Cys 


Lys 


25 






30 










35 










40 


Asp He Asn 


Thr 


Phe 


He 


His 


Gly 


Asn 


Lys 


Arg 


Xaa 


Ser 


Arg 


Pro 


Ser 






45 










50 










55 




Val Lys Thr 


Arg 


Met 


Glu 


Thr 


Leu 


Thr 


Glu 


Lys 


Thr 











60 65 
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<210> 177 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 177 
Met Leu Leu lie 
-15 

Gin Asp Leu Asp 
1 

Ser Asp Gly Gly 
20 

Pro Pro Leu Gly 
35 

Gin Asn Asp Gly 
50 

Gin Gin Gly Pro 
65 

Gin Gly Gly Arg 



Leu Leu Ser Val 
-10 

Glu Asp Val Ser 
5 

Asp Ser Glu Gin 

Gly Gin Gin Ser 
40 

Pro Gin Gin Gly 
55 

Pro Pro Pro Gin 
70 

Pro Gin Gly Pro 
8 5 



Ala Leu Leu Ala 
-5 

Gin Glu Asp Val 
10 

Phe lie Asp Glu 
25 

Gin Pro Ser Ala 

Pro Pro Gin Gin 
60 

Gly Lys Pro Gin 
75 

Pro Gin Gly Gin 
90 



Phe Ser Ser Ala 

Pro Leu Val lie 
15 

Glu Arg Gin Gly 
30 

Gly Asp Gly Asn 
45 

Gly Gly Gin Gin 

Gly Pro Pro Pro 
80 

Ser Pro Gin 
95 



<210> 178 
<211> 106 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 178 
Met Leu Leu lie 
-15 

Gin Asp Leu Asp 
1 

Ser Asp Gly Gly 
20 

Pro Pro Leu Gly 
35 

Gin Asp Asp Gly 
50 

Gin Gin Gly Pro 
65 

Gin Gly Gly Gin 



Leu Leu Ser Val 
-10 

Glu Asp Val Ser 
5 

Asp Ser Glu Gin 

Gly Gin Gin Ser 
40 

Pro Gin Gin Gly 
55 

Pro Pro Pro Gin 
70 

Ser Cys Cys Cys 
85 



Ala Leu Leu Ala 
-5 

Gin Glu Asp Val 
10 

Phe lie Asp Glu 

25 

Gin Pro Ser Ala 

Pro Pro Gin Gin 
60 

Gly Lys Pro Gin 
7 5 

Asp Lys 
90 



Phe Ser Ser Ala 

Pro Leu Val lie 
15 

Glu Arg Gin Gly 
30 

Gly Asp Gly Asn 
45 

Gly Gly Gin Gin 

Gly Pro Pro Gin 
80 



<210> 179 
<211> 141 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24 . .-1 



<400> 179 

Met Pro Ser Ser Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys 

-20 ' -15 -10 

Cys Leu Val Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala 
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-5 










1 








5 








Gin Lys Thr 


Asp 


Thr 


Ser 


His 


His 


Asp 


Gin 


Asp 


His 


Pro 


Thr 


Phe 


Asn 


10 








15 










20 










Lys lie Thr 


Pro 


Asn 


Leu 


Ala 


Glu 


Phe 


Ala 


Phe 


Ser 


Leu 


Tyr 


Arg 


Gin 


25 






30 










35 










40 


Leu Ala His 


Gin 


Ser 


Asn 


Ser 


Thr 


Asn 


He 


Phe 


Phe 


Ser 


Pro 


Val 


Ser 






45 










50 










55 




lie Ala Thr 


Ala 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys 


Ala 


Asp 


Thr 




60 










65 










70 






nib riSp blU 


Tip 
J. -L 


Leu 


ri I ,-, 
ulU 


biy 


Leu 








Leu 


1 il-L 


O-L LI 


Tip 


IT A- \J 


75 










80 










85 








Glu Ala Gin 


He 


His 


Glu 


Gly 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 


Thr 


Leu 


Asn 


90 








95 










100 










Gin Pro Asp 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


Lys 


Asn 








105 






1 1 n 

± ± u 










lis 

J. ± D 












<210> 180 




























<211> 124 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -30. .- 


-1 


























<400> 180 




























Met Pro Ala 


Cys 


Arg 


Leu 


Gly 


Pro 


Leu 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 


-30 






-25 










-20 










-15 


Leu Leu Leu 


Phe 


Gly 


Phe 


Thr 


Leu 


Val 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 






-10 










-5 










1 




Thr Gly Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


5 










10 










15 








Cys Val Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 


Asn 


Leu 


Lys 


Cys 


Cys 


Ser 


Ala 


20 








25 










30 










Gly Cys Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


Pro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


35 






40 










45 










50 


Pro Gin Val 


Asn 


He 


As n 


IT 1 1 fc? 


U L KJ 


CI "1 n 


Leu 




Leu 




A T~y-T 

rAJ. y 


7\ o f"! 


\j _L 1 1 






55 










60 










65 




Cys Gin Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 




70 










75 










80 






Gly Cys Gly 


Lys 


Val 


Ser 


Cys 


Val 


Thr 


Pro 


Asn 


Phe 










85 




























<210> 181 




























<211> 124 




























<212> PRT 




























<213> Homo . 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -30 . . 


-1 


























<400> 181 




























Met Pro Ala 


Cys 


Arg 


Leu 


Gly 


Pro 


Leu 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 


-30 






-25 










-20 










-15 


Leu Leu Leu 


Phe 


Gly 


Phe 


Thr 


Leu 


Val 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 






-10 










-5 










1 




Thr Gly Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


5 










10 










15 








Cys Val Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 


Asn 


He 


Lys 


Cys 


Cys 


Ser 


Ala 


20 








25 










30 










Gly Cys Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


Pro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


35 






40 










45 










50 



WO 02/083898 



Pro Gin Val Asn lie 
55 

Cys Gin Val Asp Ser 
70 

Gly Cys Gly Lys Val 
85 



Asn Phe Pro Gin 

Gin Cys Pro Gly 
75 

Ser Cys Val Thr 
90 



131 

Leu Gly Leu Cys Arg 
60 

Gin Met Lys Cys Cys 
80 

Pro Asn Phe 
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Asp Gin 
65 

Arg Asn 



<210> 182 
<211> 69 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -17. .-1 

<223> Von Heijne matrix 

score 9.5148977432302 
seq FVFLLFVISLAAA/AH 



<400> 182 
Met Phe Pro Gly 
-15 

Ala Ala His Leu 
1 

Lys Val Cys Ser 

Glu Gly Ala Asn 
35 

His His Lys Gly 
50 



Phe Val Phe Leu 
-10 

Trp Val Leu Ala 
5 

Phe Thr Pro Glu 
20 

Asn Ser Arg Arg 
Leu 



Leu Phe Val lie 

Ala Phe Met Gly 
10 

Ala Ser Lys Thr 
25 

Thr Ala Phe Lys 
40 



Ser Leu Ala Ala 
-5 

Arg lie Thr Val 
15 

Val Ser Pro Pro 
30 

Ser Cys Asn Thr 
45 



<210> 183 
<211> 77 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -33. .-1 



<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Lys, Met 



<400> 183 












Met 


Gly 


Pro 


Val 
-30 


Lys 


Gin 


Leu 


Lys 


He 


Ala 


Thr 
-15 


He 


Met 


Val 


Leu 


Leu 
-10 


Ala 


Phe 
1 


Trp 


Trp 


His 


Asn 
5 


Xaa 


Gly 


Gin 


Ser 


Leu 


Ala 


Leu 
20 


Thr 


Trp 


Tyr 


Arg 


Asp 


Ala 


Val 
35 


Lys 


Lys 


Cys 


Phe 



Arg Met Phe Glu Pro Thr Arg Leu 
-25 -20 
Ser Phe Ala Leu Thr Leu Cys Ser 
-5 

Leu Ala Leu He Phe Cys He Leu 

10 15 
Ser Leu Ser Phe He Pro Phe Ala 

25 30 
Ala Val Cys Leu Ala 
40 



<210> 184 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -26. .-1 



<400> 184 
Met Thr Met Arg 
-25 

Trp Ala Phe Cys 
-10 

Phe Ser Gin Pro 
10 

Gin Glu His He 
25 

Ala Glu Met Ser 
40 

Asp Tyr Asp Gly 
55 

He Thr His Val 

Ser Glu Asp Glu 
90 

Asp Lys Asn Asn 
105 

Leu Gin 
120 



Ser Leu Leu Arg 
-20 

Ala Pro Gly Ala 
-5 

Gly Ser Met Gly 

Met Glu His Leu 
30 

Pro Gin Glu Leu 
45 

Asn Asn Leu Leu 
60 

His Lys Glu Glu 
75 

Leu lie Asn He 

Asp Gly Tyr He 
110 



Thr Pro Phe Leu 
-15 

Arg Ala Glu Glu 
1 

Leu Asp Lys Asn 
15 

Glu Gly Val He 

Gin Leu His Tyr 
50 

Asp Gly Leu Glu 
65 

Gly Ser Glu Gin 
80 

He Asp Gly Val 
95 

Asp Tyr Ala Glu 



Cys Gly Leu Leu 

Pro Ala Ala Ser 
5 

Thr Val His Asp 
20 

Asn Lys Pro Glu 
35 

Phe Lys Met His 

Leu Ser Thr Ala 
70 

Ala Pro Leu Met 
85 

Leu Arg Asp Asp 
100 

Phe Ala Lys Ser 
115 



<210> 185 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -46. .-1 

<223> Von Heijne matrix 

score 8.64329745298384 
seq AVLLLILFAIVFG/LL 



<400> 185 

Met Asp Gin Leu Val Phe Lys Glu Thr He Trp Asn Asp Ala Phe Trp 

-45 -40 -35 

Gin Asn Pro Trp Asp Gin Gly Gly Leu Ala Val He He Leu Phe He 
-30 ~ -25 " -20 -15 

Thr Ala Val Leu Leu Leu He Leu Phe Ala He Val Phe Gly Leu Leu 

-10 -5 1 

Thr Ser Thr Glu Asn Thr Gin Cys Glu Ala Gly Glu Glu Glu 
5 10 15 



<210> 186 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -46. . -1 

<223> Von Heijne matrix 

score 8.64329745298384 
seq AVLLLILFAIVFG/LL 



<400> 186 

Met Asp Gin Leu Val Phe Lys Glu Thr He Trp Asn Asp Ala Phe Trp 

-45 -40 -35 

Gin Asn Pro Trp Asp Gin Gly Gly Leu Ala Val He He Leu Phe He 
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-30 -25 
Thr Ala Val Leu Leu Leu lie Leu Phe 
-10 

Thr Ser Thr Glu Asn Thr Gin Cys Glu 
5 10 
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-20 -15 
Ala lie Val Phe Gly Leu Leu 
-5 1 
Ala Gly Glu Glu Glu 
15 



<210> 187 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 



<400> 187 
Met Arg lie Ala 
-45 

Gly Ala Gly Trp 
-30 

Thr Ser Leu Ala 
-15 

Met Gin Met Tyr 
5 

Gin Gly Gly Leu 
20 

Phe Asn Asn Leu 
35 

lie Phe Pro Glu 
50 

Gly Leu lie His 

Val Gly Leu Tyr 
85 

Ala Ala Pro Val 
100 

Arg Asn 
115 



Asn Arg Thr Arg 
-40 

Thr His Gly Arg 
-25 

Leu Ser Ser Leu 
-10 

Ser Arg Gin Leu 

Leu Gly Ser Gly 
25 

Glu Asn Leu Val 
40 

lie Leu Leu Cys 
55 

Arg Val Cys Val 
70 

Tyr lie Asn Lys 

Leu Thr Pro Ala 
105 



Phe Ser Ser Pro 

Gly Met Met Val 
-20 

Arg Ser Leu Leu 
-5 

Ala Ser Thr Glu 
10 

Leu Phe Val Phe 

Phe Gly Lys Gly 
45 

Leu Leu Leu Ala 
60 

Thr Thr Cys Phe 
7 5 

lie Ser Ser Thr 
90 

Lys Val Thr Gly 



Phe Leu Ala Arg 
-35 

Val Gly Thr Gly 

Leu Phe Ala Gly 
1 

Trp Leu Thr lie 
15 

Ser Leu Thr Ala 
30 

Phe Gin Ala Lys 

Leu Phe Ala Ser 
65 

lie Phe Ser Met 
80 

Leu Tyr Gin Ala 
95 

Lys Ser Lys Lys 
110 



<210> 188 
<211> 90 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 188 

Met Glu Thr Gly 

-35 

Met Arg Arg lie 

Ser Trp Ala Ala 
1 

lie Val Ser Asp 
15 

Phe Gin Leu Leu 
30 

Ser Met Cys Phe 



Ala Ser Ala Ser 
-30 

Trp Ser Leu Gly 
-15 

Leu Phe His Asp 

5 

Leu Ser Thr Val 
20 

Asp Val Trp Thr 
35 

Val Phe Leu Leu 
50 



lie Pro Glu Leu 
-25 

Leu Gly Leu Val 
-10 

Gly Phe Ala Val 

Arg Phe Val Ala 
25 

Arg Asn Phe Arg 
40 

Leu Pro 
55 



lie Cys Glu Ala 
-20 

Thr Leu Thr Ala 
-5 

Leu Gly Gly Asn 
10 

Gin Gin Gin His 

Lys Pro Leu Gly 
4 5 



<210> 189 
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<211> 146 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 

<220> 

<221> UNSURE 
<222> 60 

<223> Xaa = Lys,Arg 



<400> 189 



L v Je l Miry Leu 


Leu 


\j J. 11 


Leu 


Leu 


Phe 


Arg 


Ala 




Prn 

LT -L. Kj 


r\-L a 


Th t 
i in 




Leu 


-25 






-20 










-15 










-10 


Leu Val Leu 


Cys 


Leu 


Gin 


Leu 


Gly Ala Asn 


Lys 


Ala 


Gin 


Asp 


Asn 


Thr 






-5 










1 








5 






Arg Lys lie 


He 


He 


Lys 


Asn 


Phe 


Asp 


He 


Pro 


Lys 


Ser 


Val 


Arg 


Pro 


10 










15 


















Asn Asp Glu 


Val 


Thr 


Ala 


Val 


Leu 


Ala 


Val 


Gin 


Thr 


Glu 


Leu 


Lys 


Glu 


25 








30 










35 










Cys Met Val 


Val 


Lys 


Thr 


Tyr 


Leu 


He 


Ser 


Ser 


He 


Pro 


Leu 


Gin 


Gly 


40 






45 










50 










55 


Ala Phe Asn 


Tyr 


Xaa 


Tyr 


Thr 


Ala 


Cys 


Leu 


Cys 


Asp 


Asp 


Asn 


Pro 


Lys 






60 










65 










70 




Thr Phe Tyr 


Trp 


Asp 


Phe 


Tyr 


Thr 


Asn 


Arg 


±111. 


Val 


Gin 


He 


Ala 


Ala 




75 










80 










85 






Val Val Asp 


Val 


He 


Arg 


Glu 


Leu 


Gly 


He 


Cys 


Pro 


Asp 


Asp 


Ala 


Ala 


90 










95 










100 








Val He Pro 


He 


Lys 


Asn 


Asn 


Arg 


Phe 


Tyr 


Thr 


He 


Glu 


He 


Leu 


Lys 


105 








110 










115 










Val Glu 




























120 




























<210> 190 




























<211> 120 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -32. 


-1 


























<223> Von Heijne matrix 






















score 


7 . 64815044221151 


















seq VTLGIGFFALASA/LW 




















<400> 190 




























Met Ala Ser 


Pro 


Lys 


Gly 


Phe 


Phe 


Asn 


Tyr 


Leu 


Thr 


Tyr 


Phe 


Leu 


Ala 


-30 










-25 










-20 








Ala Gly Ala 


Val 


Thr 


Leu 


Gly 


He 


Gly 


Phe 


Phe 


Ala 


Leu 


Ala 


Ser 


Ala 


-15 








-10 










-5 










Leu Trp Phe 


Leu 


He 


Cys 


Lys 


Arg 


Arg 


Glu 


He 


Phe 


Gin 


Asn 


Ser 


Lys 


1 




5 










10 










15 




Phe Lys Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 


Pro 


Ser 


Met 


Ala 


Lys 




20 










25 










30 






He Lys Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe 


He 


Ser 


Arg 


Asn 


Phe 


His 


Thr 


35 










40 










45 








Gly Arg Phe 


Gin 


Leu 


Gin 


Gin 


Leu 


Lys 


He 


He 


Leu 


Lys 


Met 


Asn 


Pro 


50 








55 










60 










Asn Leu Gin 


Gin 


Lys 


He 


Ser 


Phe 


Val 


He 


Pro 


Gin 


Arg 


Pro 


Ala 


Pro 


65 






70 










75 










80 


Gin Gin He 


Ala 


Ala 


Val 


Leu 


His 
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<210> 191 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 191 

Met Lys Phe Phe 

lie Ser Ala Asp 
1 

Phe His Glu Lys 
15 

Glu Glu Ser Ser 
30 

Tyr Thr Leu Cys 



Val Phe Ala Leu 
-15 

Ser His Glu Lys 
5 

His His Ser Tyr 
20 

Lys Ser Asn Ala 
35 

Phe Arg lie Leu 
50 



Val Leu Ala Leu 
-10 

Arg His His Gly 

His lie Thr Leu 

25 

Asn Glu Lys His 
40 

Ala Phe Ser lie 
55 



Met lie Ser Met 
-5 

Tyr Arg Arg Lys 
10 

Leu Pro Leu Phe 

Tyr Asn Leu Leu 
45 

Val Thr 



<210> 192 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -20. .-1 

<223> Von Heijne matrix 

score 7.42708462116258 

seq LLVFLAGFPVLDA/ND 

<400> 192 

Met Gin Lys Val Thr Leu Gly Leu Leu Val Phe Leu Ala Gly Phe Pro 
-20 "* -15 -10 -5 

Val Leu Asp Ala Asn Asp Leu Glu Asp Lys Asn Ser Pro Phe Tyr Tyr 

1 5 10 

Asp Trp His Ser Leu Gin Val Gly Gly Leu lie Cys Ala Gly Val Leu 

15 20 25 

Cys Ala Met Gly lie lie lie Val Met Ser Ala Lys Cys Lys Cys Lys 

30 35 40 

Phe Gly Gin Lys Ser Gly His His Pro Gly Glu Thr Pro Pro Leu lie 
45 50 55 60 

Thr Pro Gly Ser Ala Gin Ser 
65 



<210> 193 
<211> 105 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 7.20796835452081 
seq VSIMLLLVTVSDC/AV 



<220> 

<221> unsure 
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<222> 37 

<223> Xaa = * , Glu 
<400> 193 



Met Arg Gly Ala 


Thr 


Arg 


Val 


Ser 


He 


Met 


Leu 


Leu 


Leu 


Val 


Thr 


Val 






-15 










-10 










-5 




Ser Asp Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 


Gin 


Cys 




1 








5 










10 








Gly Ala Gly 


Thr 


_y o 


Cys 


Ala 


-L1C 


Q y 




T rn 
lip 


Leu 


Arg 


o _l _y 


Leu 


Arg 


15 








20 










25 










Met Cys Thr 


Pro 


Leu 


Gly 


Arg 


Xaa 


Gly 


Glu 


Glu 


Cys 


His 


Pro 


Gly 


Ser 


30 






35 










40 










45 


His Lys lie 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 


Pro 


Cys 






50 










55 










60 




Leu Pro Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 


Arg 


Cys 




65 










70 










75 






Ser Met Asp 


Leu 


Lys 


Asn 


He 


Asn 


Phe 
















80 










85 


















<210> 194 




























<211> 105 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -19. 


-1 


























<223> Von Heijne matrix 






















score 


7.20796835452081 


















seq VSIMLLLVTVSDC/AV 




















<400> 194 




























Met Arg Gly 


Ala 


Thr 


Arg 


Val 


Ser 


He 


Met 


Leu 


Leu 


Leu 


Val 


Thr 


Val 






-15 










-10 










-5 




Ser Asp Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 


Gin 


Cys 




1 








5 










10 








Gly Ala Gly 


Thr 


Cys 


Cys 


Ala 


He 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 


Leu 


Arg 


15 








20 










25 










Met Cys Thr 


Pro 


Leu 


Gly Arg 


Glu 


Gly 


Glu 


Glu 


Cys 


His 


Pro 


Gly 


Ser 


30 






35 










40 










45 


His Lys lie 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 


Pro 


Cys 






50 










55 










60 




Leu Pro Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly Arg 


Tyr 


Arg 


Cys 




65 










70 










75 






Ser Met Asp 


Leu 


Lys 


Asn 


He 


Asn 


Phe 
















80 










85 



















<210> 


195 


<211> 


102 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SIGNAL 


<222> 


-36. .-1 


<220> 




<221> 


UNSURE 


<222> 


21 


<223> 


Xaa = He, Met 



<220> 

<221> UNSURE 
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<222> 52 

<223> Xaa = He, Ser 



<400> 195 



Met Ser Asn 


Thr 


Thr 


Val 


Pro 


Asn 


Ala 


Pro 


Gin 


Ala 


Asn 


Ser 


Asp 


Ser 


-35 








-30 










-25 










Met Val Gly 


Tyr 


Val 


Leu 


Gly 


Pro 


Phe 


Phe 


Leu 


He 


Thr 


Leu 


Val 


Gly 


-20 






-15 










-10 










-5 


v aJL Vd-L v a. _L 


a l * 

n±a 


val 


V a _L 


L 1 ti L- 


i yr 


V Cl _L 


\3 JL 1 1 


-i-i _y o 


-i—t _y o 


J-i y o 


Arg 


v a _L 








1 








5 










10 






Arg Leu Arg 


His 


His 


Leu 


Leu 


Pro 


Xaa 


Tyr 


Ser 


Tyr 


Asp 


Pro 


Ala 


Glu 


15 










20 










25 








Glu Leu His 


Glu 


Ala 


Glu 


Gin 


Gin 


Leu 


Leu 


Ser 


Asp 


Met 


Gly 


Asp 


Pro 










35 










40 










Lys Val Val 


His 


Gly 


Trp 


Gin 


Xaa 


Gly 


Tyr 


Gin 


His 


Lys 


Arg 


Met 


Pro 


45 






50 










55 










60 


Leu Leu Asp 


Val 


Lys 


Thr 


























65 
























<210> 196 




























<211> 137 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -18 . . • 


-1 


























<400> 196 




























Met Lys Ala 


Leu 


He 


Val 


Leu 


Gly 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 




-15 










-10 










-5 






Gin Gly Lys 


Val 


Phe 


Glu 


Arg 


Cys 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


1 








5 










10 










Leu Gly Met 


Asp 


Gly 


Tyr 


Arg 


Gly 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


15 






20 










25 










30 


Leu Ala Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr 


Asn 






35 










40 










45 




Ala Gly Asp 


Arg 


Ser 


J. 1 1 .L 




i yr 


uiy 


Tip 


±r 1 1 fc: 




11c 


rlD 1 1 


o t; i_ 






50 










55 










60 






Tyr Trp Cys 


Asn 


Asp 


Gly 


Lys 


Thr 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


65 










70 










75 








Leu Ser Cys 


Ser 


Gly 


Trp 


His 


Gly 


Glu 


He 


Val 


Val 


Lys 


Thr 


Glu 


Met 


80 








85 










90 










Ser Val Ser 


Met 


Phe 


Lys 


Val 


Val 


Glu 


Cys 


Asn 


Ser 


Arg 


He 


Phe 


Leu 


95 






100 










105 










110 


Leu Gin Leu 


He 


Leu 


Ser 


Leu 


Ser 


His 




















115 
























<210> 197 




























<211> 101 




























<212> PRT 




























<213> Homo , 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -29. . 


-1 


























<400> 197 




























Met Arg Glu 


Glu 


Lys 


Lys 


Pro 


Phe 


Glu 


Arg 


Glu 


Arg 


Glu 


Ser 


Val 


Cys 






-25 










-20 










-15 




Val Cys Met 


Cys 


Val 


Phe 


Ser 


Thr 


Gin 


Gly 


Ala 


Leu 


Gly 


Glu 


Met 


Ala 




-10 










-5 










1 






Ala His Phe 


He 


Asp 


Glu 


Lys 


Leu 


Arg 


Pro 


Ser 


Glu 


Gly 


Asn 


Gly 


His 
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5 10 15 

Arg Gly Thr Leu Asp Ser Leu Ser Ser Asp Gin Glu Ser Tyr lie Pro 
20 25 30 35 

Ser Thr Ala Asp Pro Thr Gin Ala Gly Pro Glu Leu Leu His Lys Asn 

40 ~ 45 50 

Leu Pro Val Thr Ser Arg Ser Gin Pro Leu Pro Ser Asp Leu Ala lie 

55 60 65 

Pro Ala Ala Ala Leu 
70 



<210> 198 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



<220> 

<221> UNSURE 
<222> 49 

<223> Xaa = Glu, Lys 



<400> 198 



Met 


Cys 


Trp 


Leu 




-25 






Ser 


Leu 


Phe 


Leu 


-10 








He 


His 


Ser 


Pro 








10 


Glu 


Cys 


Ala 


Val 






25 




Cys 


Val 


Trp 


Cys 




40 






Tyr 


Phe 


Gly 


Cys 


55 








Asp 


Met 


Phe 


Gly 


Gly 


Phe 


Val 


Trp 








90 



Arg Ala Trp Gly 
-20 

He Gin Leu Leu 
-5 

Arg Asn Asn Gin 

Lys Lys Ser Cys 
30 

Ser Glu Glu Lys 
45 

Arg Phe Ser Ser 
60 

He Met Met Leu 
75 

Tyr Cys Cys Ala 



Gin He Leu Leu 
-15 

He Ser Phe Ser 
1 

Lys Pro Arg Asp 
15 

Gin Leu Cys Thr 

Ala Cys Xaa Lys 
50 

He Tyr Trp Leu 
65 

Leu Leu He Ala 
80 

Tyr His Phe Tyr 
95 



Pro Val Phe Leu 

Glu Asn Gly Phe 
5 

Gly Asn Glu Glu 
20 

Glu Asp Lys Lys 
35 

Tyr Cys Phe Pro 

Asn Cys Lys Val 
7 0 

Val Leu He Thr 
85 

Leu Gin Asp He 
100 



<210> 199 
<211> 142 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24 . . -1 



<220> 

<221> UNSURE 
<222> 90 

<223> Xaa = Asp, Glu 
<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 



<220> 

<221> UNSURE 
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<222> 79 

<223> Xaa = Lys, Asn 
<400> 199 

Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu Ser lie Cys Leu 

-20 -15 -10 

Ser Ala Val Ala Thr Ala Thr Gly Ala Glu Gly Lys Arg Lys Leu Gin 

-5 1 5 

lie Gly Val Lys Lys Arg Val Asp His Cys Pro lie Lys Ser Arg Lys 

10 1 15 2 0 

Gly Asp Val Leu His Met His Tyr Thr Gly Lys Leu Glu Asp Gly Thr 
25 30 35 40 

Glu Phe Asp Ser Ser Leu Pro Gin Asn Gin Pro Phe Val Phe Ser Leu 

45 50 55 

Gly Thr Gly Gin Val lie Lys Gly Trp Asp Gin Gly Leu Leu Gly Met 

60 65 70 

Cys Glu Gly Glu Lys Arg Xaa Xaa Val lie Pro Ser Glu Leu Gly Tyr 

7 5 8 0 8 5 

Gly Xaa Arg Gly Ala Pro Pro Lys lie Pro Gly Gly Ala Thr Leu Val 

90 95 100 

Phe Glu Val Glu Leu Leu Lys lie Glu Arg Arg Thr Glu Leu 
105 110 115 



<210> 200 
<211> 70 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -24 . .-1 

<223> Von Heijne matrix 

score 6.54728084640774 
seq ILLTRPACLGSWA/EI 



<400> 200 




























Met 


Ala 


Ser 


Ser 


Thr 

-20 


Ser 


Val 


Ser 


Thr 


Gly 
-15 


Gin 


He 


Leu 


Leu 


Thr 

-10 


Arg 


Pro 


Ala 


Cys 


Leu 
-5 


Gly 


Ser 


Trp 


Ala 


Glu 
1 


He 


Arg 


Ser 


Pro 
5 


Val 


Arg 


Thr 


He 


Ser 
10 


He 


Ala 


Ser 


Asp 


Phe 
15 


Pro 


Thr 


Ala 


Arg 


Val 
20 


Ser 


Leu 


Trp 


Val 


Pro 


Pro 


Ala 


Pro 


Gly 


Met 


Val 


Pro 


He 


Lys 


He 


Ser 


Gly 


Cys 


Ala 


Asn 


25 










30 










35 










40 


Trp 


Ala 


Phe 


Ser 


Pro 
45 


Ala 























<210> 201 
<211> 113 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -23. . -1 



<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = His , Lys, Asn, Gin 



<400> 201 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly Ala Leu Ser Gly 
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-20 










-15 










-10 






Ala Ala Ala 


Leu 


Gly 


Phe 


Ala 


Ser 


Tyr 


Gly Ala 


His 


Gly Ala 


Xaa 


Phe 


-5 










1 








5 










Pro Asp Ala 


Tyr 


Gly 


Lys 


Glu 


Leu 


Phe 


Asp 


Lys 


Ala 


Asn 


Lys 


His 


His 


10 






15 










20 










25 


Phe Leu His 


Ser 


Leu 


Ala 


Leu 


Leu 


Gly 


Val 


Pro 


His 


Cys 


Arg 


Lys 


Pro 






30 










35 










40 




Leu Trp Ala 


Gly 


Leu 


Leu 


Leu 


Ala 


Ser 


Glv 


Thr 


Thr 


Leu 


Phe 


Cys 


Thr 




45 










50 










55 








fur 


Gin 


Ala 


Leu 


Ser 


Gly 


Asp 


Pro 


Ser 


He 


Gin 


Thr 


Leu 


60 










65 










70 








Ala Pro Ala 


Gly 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Gly 


Trp 


Leu 


Ala 


Leu 


Ala 


75 








80 










85 










Leu 




























90 




























<210> 202 




























<211> 113 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -23. .- 


•1 


























<220> 




























<221> UNSURE 


























<222> 8 




























<223> Xaa = 


Lys , Gin 
























<400> 202 




























Met Ala Gly 


Pro 


Ala 


Ala 


Ala 


Phe 


Arg 


Arg 


Leu 


Gly 


Ala 


Leu 


Ser 


Gly 




-20 










-15 










-10 






Ala Ala Ala 


Leu 


Gly 


Phe 


Ala 


Ser 


Tyr 


Gly Ala 


His 


Gly Ala 


Xaa 


Phe 


-5 










1 








5 










Pro Asp Ala 


Tyr 


Gly 


Lys 


Glu 


Leu 


Phe 


Asp 


Lys 


Ala 


Asn 


Lys 


His 


His 


10 






15 










20 










25 


Phe Leu His 


Ser 


Leu 


Ala 


Leu 


Leu 


Gly 


Val 


Pro 


His 


Cys 


Arg 


Lys 


Pro 






30 










35 










40 




Leu Trp Ala 


Gly 


Leu 


Leu 


Leu 


Ala 


Ser 


Gly 


Thr 


Thr 


Leu 


Phe 


Cys 


Thr 




45 










50 










55 






Ser Phe Tyr 


Tyr 


Gin 


Ala 


Leu 


Ser 


Gly 


Asp 


Pro 


Ser 


He 


Gin 


Thr 


Leu 


60 










65 










70 








Ala Pro Ala 


Gly 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Gly 


Trp 


Leu 


Ala 


Leu 


Ala 



75 80 85 

Leu 
90 



<210> 203 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -17 . . -1 

<223> Von Heijne matrix 

score 6.48133858229514 
seq LTFAQLLFATVLG/IA 



<400> 203 

Met Phe Arg Arg Leu Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu 
-15 -10 -5 
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Gly lie Ala Gly Gly Val Tyr lie Phe Gin Pro Val Phe Glu Gin Tyr 

15 10 15 

Ala Lys Asp Gin Lys Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu 

20 25 30 

Ser Glu Glu Lys Lys Ser 
35 



<210> 204 
<211> 83 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15. .-1 



<400> 204 



Met 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Pro 


Val Leu 


Val 


Val 


Val 


Leu 


Ser 


He 


-15 










-10 








-5 










1 


Val 


Leu 


Glu 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 








5 










10 








15 






Ala 


Leu 


Asp 


Lys 


Leu 


Lys 


Glu 


Phe 


Gly Asn 


Thr 


Leu 


Glu 


Asp 


Lys 


Ala 






20 










25 








30 








Arg 


Glu 


Leu 


He 


Ser 


Arg 


He 


Lys 


Gin Ser 


Glu 


Leu 


Ser 


Ala 


Lys 


Met 




35 










40 








45 










Arg 


Glu 


Trp 


Phe 


Ser 


Glu 


Thr 


Phe 


Gin Lys 


Val 


Lys 


Asp 


Lys 


Leu 


Lys 


50 










55 








60 










65 



He Asp Ser 



<210> 205 
<211> 98 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 205 
Met Phe Arg He 
-40 

Lys Arg Lys Met 
-25 

He Tyr Val Ala 
-10 

Cys Asp Thr Ser 

Gly His Arg Gly 
25 

Gly Ser He Tyr 
4 0 

His He 
55 



Glu Gly Leu Ala 

Arg Glu Asp Val 
-20 

Leu Leu Arg Val 
-5 

Gly Glu Leu Thr 
10 

Leu Arg Thr Glu 

Leu Lys Glu He 
4 5 



Pro Lys Leu Asp 
-35 

He Ser Ser He 

Ser Glu Cys Leu 
1 

Asp Gly His Pro 
15 

Leu Asn Gly Ser 
30 

Gly Gin His Met 



Pro 


Glu 


Glu 


Met 




-30 






Arg 


Asn 


Phe 


Leu 


-15 








Pro 


Gly 


Cys 


Asp 








5 


Leu 


Thr 


Leu 


Arg 






20 




Gly 


Glu 


Gin 


Gly 




35 






Lys 


Thr 


Gly 


His 


50 









<210> 206 
<211> 82 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -43. . -1 

<223> Von Heijne matrix 
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score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<400> 206 
Met Ala Lys Tyr 
-40 

Ser Val He Glu 
-25 

Ser Phe Ala Leu 
-10 

His Gin Asn He 

Glu Gin Leu Gin 
25 

Leu His 



Gin Gly Glu Val 

Gly Val Ser Asp 
-20 

He Ala Thr Leu 
-5 

His Pro Glu Asn 
10 

Thr Glu Gin Asp 



Gin Ser Leu Lys 
-35 

Gin Val Leu Val 

Val Tyr Ala Leu 
1 

Gin Glu Leu Val 
15 

Ala Pro Ala Asp 
30 



Leu Asp Asp Asp 
-30 

Ala Val Val Val 
-15 

Phe Arg Asn Val 
5 

Arg Val Leu Arg 
20 

Ser Thr Ala Val 
35 



<210> 207 
<211> 82 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 207 
Met Ala Lys Tyr 
-4 0 

Ser Val He Glu 
-25 

Ser Phe Ala Leu 
-10 

His Gin Asn He 

Glu Gin Leu Gin 
25 

Leu His 



Gin Gly Glu Val 

Gly Val Ser Asp 
-20 

He Ala Thr Leu 
-5 

His Pro Glu Asn 
10 

Thr Glu Gin Asp 



Gin Ser Leu Lys 
-35 

Gin Val Leu Val 

Val Tyr Ala Leu 
1 

Gin Glu Leu Val 
15 

Ala Pro Ala Asp 
30 



Leu Asp Asp Asp 
-30 

Ala Val Val Val 
-15 

Phe Arg Asn Val 
5 

Arg Val Leu Arg 
20 

Ser Thr Ala Val 
35 



<210> 208 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43. .-1 



<220> 

<221> UNSURE 
<222> 32 

<223> Xaa - Ala, Gly 



<400> 208 
Met Ala Lys Tyr 
-40 

Ser Val He Glu 
-25 

Ser Phe Ala Leu 
-10 

His Gin Asn lie 

Glu Gin Leu Gin 
25 



Gin Gly Glu Val 

Gly Val Ser Asp 
-20 

He Ala Thr Leu 
-5 

His Pro Glu Asn 
10 

Thr Glu Gin Asp 



Gin Ser Leu Lys 
-35 

Gin Val Leu Val 

Val Tyr Ala Leu 

1 

Gin Glu Leu Val 
15 

Ala Pro Xaa Asp 
30 



Leu Asp Asp Asp 
-30 

Ala Val Val Val 
-15 

Phe Arg Asn Val 
5 

Arg Val Leu Arg 
20 

Ser Thr Ala Val 
35 
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<210> 209 
<211> 82 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 209 
Met Ala Lys Tyr 
-40 

Ser Val lie Glu 
-25 

Ser Phe Ala Leu 
-10 

His Gin Asn lie 

Glu Gin Leu Gin 
25 

Leu His 



Gin Gly Glu Val 

Gly Val Ser Asp 
-20 

He Ala Thr Leu 
-5 

His Pro Glu Asn 
10 

Thr Glu Gin Asp 



Gin Ser Leu Lys 
-35 

Gin Val Leu Val 

Val Tyr Ala Leu 
1 

Gin Glu Leu Val 
15 

Ala Pro Ala Asp 
30 



Leu Asp Asp Asp 
-30 

Ala Val Val Val 
-15 

Phe Arg Asn Val 

5 

Arg Val Leu Arg 
20 

Ser Thr Ala Val 

35 



<210> 210 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28. .-1 



<220> 

<221> UNSURE 

<222> 2,16 

<223> Xaa = Phe, Ser 



<400> 210 
Met Thr Ser Ala 
-25 

Gly Tyr Leu Val 
-10 

Leu Glu Asn Asp 
5 

Thr Ala Cys Gly 

Arg Leu Ser Val 
4 0 

Gly Gin Arg Val 
55 

He Asp Val 
70 



Leu Thr Gin Gly 

Leu Ser Glu Gly 
-5 

Glu Gin Ala Ala 
10 

Phe Arg Leu His 
25 

Val Phe Gly Glu 

Phe Val Val Lys 
60 



Leu Glu Arg lie 
-20 

Ala Val Leu Ala 

Ser Ala He Xaa 
15 

Arg Gly Met Asn 
30 

His Thr Leu Leu 
45 

Arg Gin Asn Arg 



Pro Asp Gin Leu 
-15 

Ser Xaa Gly Asp 
1 

Glu Leu Val Ser 
20 

Val Pro Phe Lys 
35 

Val Thr Val Ser 
50 

Gly Arg Glu Pro 
65 



<210> 211 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28 . . -1 
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<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Phe,Ser 
<400> 211 



Met 


Thr 


Ser 


Ala 


Leu 


Thr 


Gin 


Gly 


Leu 


Glu 


Arg 


He 


Pro 


Asp 


Gin 


Leu 








-25 










-20 










-15 






Gly 


Tyr 


Leu 


Val 


Leu 


Ser 


Glu 


Gly Ala 


Val 


Leu 


Ala 


Ser 


Ser 


Gly 


Asp 






-10 










-5 










1 








Leu 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Xaa 


Glu 


Leu 


Val 


Ser 


5 










10 










15 










20 


Thr 


Ala 


Cys 


Gly 


Phe 


Arg 


Leu 


His 


Arg 


Gly 


Met 


Asn 


Val 


Pro 


Phe 


Lys 










25 










30 










35 




Arg 


Leu 


Ser 


Val 


Val 


Phe 


Gly 


Glu 


His 


Thr 


Leu 


Leu 


Val 


Thr 


Val 


Ser 








40 










45 










50 






Gly Gin 


Arg 


Val 


Phe 


Val 


Val 


Lys 


Arg 


Gin 


Asn 


Arg 


Gly 


Arg 


Glu 


Pro 



55 60 65 



He Asp Val 
70 

<210> 212 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 37 

<223> Xaa = Cys, Gly 
<220> 

<221> UNSURE 
<222> 11 

<223> Xaa = Cys,Trp 
<220> 

<221> UNSURE 
<222> 149 

<223> Xaa = Gly, Arg 



<400> 212 



Met 


His 


Val 


Met 


Ala 


Ala 


Ser 


Met 


Ala 


Arg 


Gly 


Gly 


Val 


Ser 


Ala 


Arg 


-15 










-10 










-5 










1 


Val 


Leu 


Leu 


Gin 


Ala 


Ala 


Arg 


Gly 


Thr 


Xaa 


Trp 


Asn 


Arg 


Pro 


Gly 


Gly 








5 










10 










15 






Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 


Leu 


Gly 


Thr 


Thr 


Arg 


Lys 


Phe 






20 










25 










30 








Gin 


Ala 


Thr 


Xaa 


Ser 


Arg 


Pro 


Ala 


Gly 


Glu 


Glu 


Asp 


Ala 


Gly 


Gly 


Pro 




35 










40 










45 










Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe 


Val 


Asp 


Arg 


Ser 


Gly 


50 










55 










60 










65 


Gin 


Arg 


He 


Pro 


Val 


Ser 


Gly 


Arg 


Val 


Gly 


Asp 


Asn 


Val 


Leu 


His 


Leu 










70 










75 










80 




Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 


Gly Ala 


Cys 


Glu 


Ala 


Ser 


Leu 








85 










90 










95 






Ala 


Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 


Ser 


Glu 


Asp 


His 


Leu 


Asp 


Leu 






100 










105 










110 








Leu 


Pro 


Pro 


Pro 


Glu 


Glu 


Arg 


Glu 


Asp 


Asp 


Met 


Leu 


Asp 


Met 


Ala 


Pro 
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144 



WO 02/083898 



115 120 
Leu Leu Gin Glu Asn Ser Arg Leu Gly 
130 135 
Glu Leu Glu Xaa Ala Glu Phe Thr Leu 
150 

Tyr Val Asp Gly His Val Pro Lys Pro 
165 170 
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145 

125 

Cys Gin lie Val Leu Thr Pro 
140 145 
Pro Lys lie Thr Arg Asn Phe 
155 160 
His 



<210> 213 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -48. .-1 



<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Ala, Asp 



<400> 213 




























Met 


Glu 


Ala 


Leu 


Gly 


Lys 


Leu 


Lys 


Gin 


Phe 


Asp 


Ala 


Tyr 


Pro 


Lys 


Thr 








-45 










-40 










-35 






Leu 


Glu 


Asp 


Phe 


Arg 


Val 


Lys 


Thr 


Cys 


Gly 


Gly 


Ala 


Thr 


Val 


Thr 


He 






-30 










-25 










-20 








Val 


Ser 


Gly 


Leu 


Leu 


Met 


Leu 


Leu 


Leu 


Phe 


Leu 


Ser 


Glu 


Leu 


Gin 


Tyr 




-15 










-10 










-5 










Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


Leu 


Tyr 


Val 


Asp 


Lys 


Ser 


Arg 


1 








5 










10 










15 




Gly 


Asp 


Lys 


Leu 


Lys 


He 


Asn 


He 


Asp 


Val 


Leu 


Phe 


Pro 


His 


Met 


Pro 








20 










25 










30 






Cys 


Ala 


Tyr 


Leu 


Ser 


He 


Asp 


Ala 


Met 


Asp 


Val 


Ala 


Gly 


Glu 


Gin 


Gin 






35 










40 










45 








Leu 


Asp 


Val 


Glu 


His 


Asn 


Leu 


Phe 


Lys 


Gin 


Arg 


Leu 


Asp 


Lys 


Asp 


Gly 




50 










55 










60 










lie 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu 


Arg 


His 


Xaa 












65 










70 










75 













<210> 214 
<211> 77 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -29. .-1 



<400> 214 

Met Ala Ala Thr 

Leu Leu Leu Lys 
-10 

Val Ala Phe Ser 
5 

Leu Leu He Ala 
20 

Arg Arg Gly Arg 



Asp Phe Val Gin 
-25 

Leu Gin Arg Leu 

Val He He Leu 
10 

Cys Ser Cys Cys 
25 

Lys Val Gin Val 
40 



Glu Met Arg Ala 
-20 

Pro Gin Ala Glu 
-5 

Phe Thr Ala Thr 
15 

Cys Thr His Cys 
30 

Gin Pro Thr Pro 
45 



Val Gly Glu Arg 
-15 

Pro Val Glu He 
1 

Val Leu Leu Leu 

Cys Cys Pro Glu 
35 

Pro 



<210> 215 
<211> 107 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -27 . .-1 



<400> 215 
Met Leu Phe Glu 
-25 

Ser Thr Leu Ser 
-10 

Leu Leu Ser Pro 

Leu Gly Leu Ala 

25 

Pro Asp Pro Pro 
4 0 

Leu Arg lie Leu 
55 

Gin Pro Val Trp 
70 



Glu Ala Leu Pro 
-20 

Leu Val Gin Phe 
-5 

Gly Val Glu Asn 
10 

Cys Cys Leu Leu 

Glu Gly Pro Gly 
45 

Gin Leu Pro Gly 
60 

Ser Arg Thr Pro 
75 



Leu Ser Cys Ser 

Ser Pro Ser Gly 
1 

Leu Ser Val Leu 
15 

Thr Leu Thr Met 
30 

Thr Gin Arg Val 

Ala Pro Asp Gly 
65 

Asp Leu Lys 
80 



Asp Pro Val Leu 
-15 

Arg Thr Gin Asp 
5 

Asp Val Ser Pro 
20 

Ser Cys Pro Gly 
35 

Trp Gin Gly Ala 
50 

Val Ser Pro Tyr 



<210> 216 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 



<400> 216 

Met Leu Glu His 

Gin Lys Leu Ala 
-20 

lie Val Gly Phe 
-5 

Val Tyr lie Val 
10 

Pro Pro Trp Pro 

Gin Ala Gin Thr 
45 

Leu Lys lie lie 
60 



Leu Ser Ser Leu 
-35 

Glu Gin Met Phe 

lie Tyr Gly Tyr 

1 

Met Ala Gly Phe 
15 

lie Tyr Arg Arg 
30 

Thr Arg Asn Gin 

Glu Val Phe Met 
65 



Pro Thr Gin Met 
-30 

Gin Gly lie lie 
-15 

Val Ala Glu Gin 

5 

Ala Phe Ser Cys 
20 

His Pro Leu Lys 

35 

Gly Lys Glu Lys 
50 

lie Gin His Leu 



Asp Tyr Lys Gly 
-25 

Leu Phe Ser Ala 
-10 

Phe Gly Trp Thr 

Leu Leu Thr Leu 
25 

Trp Leu Pro Val 
40 

Leu Arg Gly Met 
55 

Leu Leu Phe Leu 
70 



<210> 217 
<211> 71 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -23. .-1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 217 

Met Met Asn Asn Gly Leu Leu Gin Gin Pro Ser Ala Leu Met Leu Leu 
-20 -15 -10 
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Pro Cys Arg Pro Val Leu Thr Ser Val Ala Leu Asn Ala Asn Phe Val 

-5 15 
Ser Trp Lys Ser Arg Thr Lys Tyr Thr lie Thr Pro Val Lys Met Arg 
10 15 20 25 

Lys Ser Gly Gly Arg Asp His Thr Gly Ala Gly Asn Val Arg Ser Asn 

30 35 40 

Ser Arg Pro Ser lie Gin Arg 
45 



<210> 218 

<211> 71 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -23. .-1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 218 



Met 


Met 


Asn 


Asn 
-20 


Gly 


Leu 


Leu 


Gin 


Gin Pro 
-15 


Ser 


Ala 


Leu 


Met 
-10 


Leu 


Leu 


Pro 


Cys 


Arg 
-5 


Pro 


Val 


Leu 


Thr 


Ser 
1 


Val Ala 


Leu 


Asn 
5 


Ala 


Asn 


Phe 


Val 


Ser 


Trp 


Lys 


Ser 


Arg 


Thr 


Lys 


Tyr 


Thr He 


Thr 


Pro 


Val 


Lys 


Met 


Arg 


10 










15 








20 










25 


Lys 


Ser 


Gly 


Gly 


Arg 
30 


Asp 


His 


Thr 


Gly Ala 
35 


Gly 


Asn 


Val 


Arg 


Ser 
40 


Asn 


Ser 


Arg 


Pro 


Ser 
45 


He 


Gin 


Arg 



















<210> 219 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -15. .-1 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 



<400> 219 

Met Ala Val Leu 

-15 

Ala Leu Leu Gly 
5 

Pro Asp Ala Trp 
20 

Arg Gly Asn Glu 
35 

Gly Trp Val Arg 
50 

Arg Arg Met Leu 



Ala Gly Ser Leu 
-10 

Gly Arg Trp Leu 

Gly Leu Pro Thr 
25 

Tyr Gin Pro Ser 
40 

Arg Leu Ser Thr 
55 

Lys Gly Arg Lys 
70 



Leu Gly Pro Thr 
-5 

Gin Pro Arg Ala 
10 

Pro Gin Gin Ala 

Asn He Lys Arg 
4 5 

Pro Ala Gly Xaa 
60 

Ser Leu Ser His 
75 



Ser Arg Ser Ala 
1 

Trp Leu Gly Phe 
15 

Arg Gly Lys Ala 
30 

Lys Asn Lys His 

Gin Val He Leu 
65 



<210> 220 
<211> 92 
<212> PRT 

<213> Homo sapiens 



WO 02/083898 



148 



PCT/IB01/00914 



<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Asn,Ser 



<400> 220 

Met Ala Val Leu 

-15 

Ala Leu Leu Gly 
5 

Pro Asp Ala Trp 
20 

Arg Gly Asn Glu 
35 

Gly Trp Val Arg 
50 

Arg Arg Met Leu 



Ala Gly Ser Leu 
-10 

Gly Arg Trp Leu 

Gly Leu Pro Thr 
25 

Tyr Gin Pro Ser 
40 

Arg Leu Xaa Thr 
55 

Lys Gly Arg Lys 
70 



Leu Gly Pro Thr 
-5 

Gin Pro Arg Ala 
10 

Pro Gin Gin Ala 

Asn lie Lys Arg 
45 

Pro Ala Gly Val 
60 

Ser Leu Ser His 
75 



Ser Arg Ser Ala 
1 

Trp Leu Gly Phe 
15 

Arg Gly Lys Ala 
30 

Lys Asn Lys His 

Gin Val lie Leu 
65 



<210> 221 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Asn, Ser 
<220> 

<221> UNSURE 
<222> 62 

<223> Xaa = Lys, Gin, Arg 



<400> 221 

Met Ala Val Leu 

-15 

Ala Leu Leu Gly 
5 

Pro Asp Ala Trp 
20 

Arg Gly Asn Glu 
35 

Gly Trp Val Arg 
50 

Arg Arg Met Leu 



Ala Gly Ser Leu 
-10 

Gly Arg Trp Leu 

Gly Leu Pro Thr 
25 

Tyr Gin Pro Ser 
40 

Arg Leu Xaa Thr 
55 

Lys Gly Arg Lys 
70 



Leu Gly Pro Thr 
-5 

Gin Pro Arg Ala 
10 

Pro Gin Gin Ala 

Asn lie Lys Arg 
45 

Pro Ala Gly Val 
60 

Ser Leu Ser His 
75 



Ser Arg Ser Ala 
1 

Trp Leu Gly Phe 
15 

Arg Gly Lys Ala 
30 

Lys Asn Lys His 

Xaa Val lie Leu 
65 



<210> 222 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -15. .-1 



<220> 

<22l> UNSURE 
<222> 61 

<223> Xaa = * ,Glu,Leu,Val 
<220> 

<221> UNSURE 
<222> 62 

<223> Xaa = Ala, Glu, Pro, Gin 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Asn,Ser 
<220> 

<221> UNSURE 
<222> -6 

<223> Xaa = Asp, Gly 
<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Gly, Ser 
<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = Phe, Val 



<400> 222 

Met Ala Val Leu 

-15 

Ala Leu Leu Gly 
5 

Pro Asp Ala Trp 
20 

Arg Gly Asn Glu 
35 

Gly Trp Val Arg 
50 

Arg Arg Met Leu 



Ala Gly Ser Leu 
-10 

Xaa Arg Trp Leu 

Gly Leu Pro Thr 
25 

Tyr Gin Pro Ser 
40 

Arg Leu Xaa Thr 
55 

Lys Gly Arg Lys 
7 0 



Leu Xaa Pro Thr 
-5 

Gin Pro Arg Ala 
10 

Pro Gin Gin Ala 

Asn lie Lys Arg 
45 

Pro Ala Gly Xaa 
60 

Ser Leu Ser His 
75 



Ser Arg Ser Ala 
1 

Trp Leu Gly Phe 
15 

Arg Gly Lys Ala 
30 

Lys Asn Lys His 

Xaa Xaa lie Leu 
65 



<210> 223 
<211> 87 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. . -1 



<400> 223 

Met Lys Val Leu Leu Leu Thr Gly 
-35 

Tyr Trp Asp Asp Asn Phe Asp Pro 
-20 

Leu Leu Thr Gly Ala Asn Ala Gly 
-5 1 



Leu Gly Ala Leu Phe Phe Ala Tyr 

-30 -25 
Ala Ser Leu Gin Gly Ala Arg Val 
-15 -10 
Val Gly Glu Glu Leu Ala Tyr His 
5 
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Tyr Ala Arg Leu Gly Ser His Leu Val Leu Thr Ala His Thr Glu Ala 
10 15 20 25 

Leu Leu Gin Lys Ala Arg Trp Leu Thr Leu Val Val Ser Thr Leu Gly 

30 35 40 

Gly Arg Gly Lys Trp lie Thr 
45 

<210> 224 
<211> 125 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Asn,Thr 
<220> 

<221> UNSURE 
<222> 30 

<223> Xaa = lie, Met 
<400> 224 



Met 


Lys 


Leu 


Leu 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val 


Arg 


Gly 


Val 


-15 










-10 










-5 










1 


Gly 


Ser 


Arg 


Gly 
5 


Phe 


Pro 


Leu 


Arg 


Leu 
10 


Gin 


Ala 


Thr 


Glu 


Val 
15 


Arg 


He 


Cys 


Pro 


Val 

20 


Glu 


Phe 


Asn 


Pro 


Asn 
25 


Phe 


Val 


Ala 


Arg 


Xaa 
30 


He 


Pro 


Lys 


Val 


Glu 
35 


Trp 


Ser 


Ala 


Phe 


Leu 
40 


Glu 


Ala 


Ala 


Asp 


Asn 
45 


Leu 


Arg 


Leu 


lie 


Gin 


Val 


Pro 


Lys 


Gly 


Pro 


Val 


Glu 


Gly 


Tyr 


Glu 


Glu 


Asn 


Glu 


Glu 


Phe 


50 










55 










60 










65 


Leu 


Arg 


Thr 


Met 


His 
70 


His 


Leu 


Leu 


Leu 


Glu 
75 


Val 


Glu 


Val 


He 


Glu 
80 


Gly 


Thr 


Leu 


Gin 


Cys 
85 


Pro 


Glu 


Ser 


Gly 


Arg 
90 


Met 


Phe 


Pro 


He 


Ser 
95 


Arg 


Gly 


He 


Pro 


Xaa 
100 


Met 


Leu 


Leu 


Ser 


Glu 
105 


Glu 


Glu 


Thr 


Glu 


Ser 
110 









<210> 225 
<211> 93 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22. .-1 



<400> 225 



Met 


Lys 


He 


Ser 


Val 


Ala 


Ala 


He 


Pro 


Phe 


Phe 


Leu 


Leu 


He 


Thr 


He 






-20 










-15 










-10 








Ala 


Leu 


Gly 


Thr 


Lys 


Thr 


Glu 


Ser 


Ser 


Ser 


Arg 


Gly 


Pro 


Tyr 


His 


Pro 




-5 










1 








5 










10 


Ser 


Glu 


Cys 


Cys 


Phe 


Thr 


Tyr 


Thr 


Thr 


Tyr 


Lys 


He 


Pro 


Arg 


Gin 


Arg 










15 










20 










25 




He 


Met 


Asp 


Tyr 


Tyr 


Glu 


Thr 


Asn 


Ser 


Gin 


Cys 


Ser 


Lys 


Pro 


Gly 


He 








30 










35 










40 






Val 


Phe 


He 


Thr 


Lys 


Arg 


Gly 


His 


Ser 


Val 


Cys 


Thr 


Asn 


Pro 


Ser 


Asp 
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45 50 55 

Lys Trp Val Gin Asp Tyr lie Lys Asp Met Lys Glu Asn 
60 65 70 



<210> 226 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -42. .-1 



<400> 226 

Met Asn Arg Leu Ala Gly Val Gly 
-40 -35 
Ser Ser Leu Leu Gin Ser Val Glu 

-25 -20 
Glu Val Ala Ala Ala Pro Gly Thr 
-10 -5 
Leu Ser Ala Asp Lys Phe Gin Val 
10 

Gin Thr Leu Met Ser Ser Leu Gly 
25 30 



Trp Arg Val Asp Tyr Thr Leu Ser 
-30 

Glu Pro Met Val His Leu Arg Leu 
-15 

Pro Ala Gin Pro Val Ala Met Ser 

1 5 
Leu Leu Ala Glu Leu Lys Gin Ala 
15 20 



<210> 227 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -20. .-1 

<223> Von Heijne matrix 

score 4.37180298395146 
seq SLLLSLPPHQGLT/FS 



<400> 227 

Met Ala Ala Ala 

-20 

Gin Gly Leu Thr 

Val Leu His Pro 
15 

Ser Arg Gin Leu 
30 

Leu Cys Gin Thr 
45 

Thr Val Arg Thr 



Ala Val Pro Ser 
-15 

Phe Ser Asn Lys 
1 

Glu Pro Gly Leu 
20 

Arg Val Ala Leu 

35 

Gin Leu Pro Ala 
50 



Leu Leu Leu Ser 
-10 

lie Gin Pro Phe 

5 

Arg Asp Trp Leu 

Pro Glu Lys Gly 
40 

Thr Pro Cys Phe 
55 



Leu Pro Pro His 
-5 

Gly Ala Gin Gly 
10 

Leu Pro Thr Cys 
25 

Ser Glu Gly Ser 

Leu Pro Ser Asn 
60 



<210> 228 

<211> 54 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 228 

Met Lys Phe Asn Phe Val Leu Leu lie Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 
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15 10 
Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 " 20 * 25 

Ser Thr Gly Ser Arg Leu 
30 35 



<210> 229 

<211> 54 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4.24406450728052 
seq LLILSKVLDDTFQ/SV 



<400> 229 

Met Lys Phe Asn 

Thr Phe Gin Ser 
1 

Arg Asn Gly Leu 
15 

Ser Thr Gly Ser 
30 



Phe Val Leu Leu 
-15 

Val Lys Lys Trp 
5 

Met Trp Pro Gly 
20 

Arg Leu 
35 



lie Leu Ser Lys 
-10 

Leu Asn Tyr Phe 

Ala Val Ala His 
25 



Val Leu Asp Asp 
-5 

Gin Phe Thr Leu 
10 

Ala Cys Asn Pro 



<210> 230 

<211> 54 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4.24406450728052 
seq LLILSKVLDDTFQ/SV 



<400> 230 

Met Lys Phe Asn 

Thr Phe Gin Ser 
1 

Arg Asn Gly Leu 
15 

Ser Thr Gly Ser 
30 



Phe Val Leu Leu 
-15 

Val Lys Lys Trp 
5 

Met Trp Pro Gly 
20 

Arg Leu 
35 



lie Leu Ser Lys 
-10 

Leu Asn Tyr Phe 

Ala Val Ala His 
25 



Val Leu Asp Asp 
-5 

Gin Phe Thr Leu 
10 

Ala Cys Asn Pro 



<210> 231 

<211> 91 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 231 

Met Tyr Gly Lys lie lie Phe Val Leu Leu Leu Ser Glu lie Val Ser 

-15 -10 -5 

lie Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser 



WO 02/083898 



1 5 
Ser Ser Val Thr Lys Ser Tyr lie Ser 

15 20 
Gly Gin Leu Val His Arg Phe Thr Val 
30 35 
Leu lie lie Leu Cys Val Met Ala Gly 
50 

lie Ser Tyr Ser lie Arg Arg Leu lie 
65 70 
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10 

Ser Gin Thr Asn Gly Glu Met 

25 

Pro Ala Pro Val Val lie lie 

40 45 
lie lie Gly Thr lie Leu Leu 
55 60 
Lys Ala 



<210> 232 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 232 




























Met 


Gin 


Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 






-30 










-25 










-20 








Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe 


Ser 


Leu 


Thr 


Ala 




-15 










-10 










-5 










Phe 


Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


1 








5 










10 










15 




He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 








20 










25 










30 






Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe 


He 


Phe 


Ser 


Met 






35 










40 










45 








Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He 


Ser 


Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 




50 










55 










60 










Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 


Ser 


Lys 


Lys 


65 










70 










75 










80 


Arg 


Asn 































<210> 233 
<211> 75 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 233 

Met Ala Pro Glu 

-35 

Arg Arg Leu Arg 

Val Ala Ala Leu 
1 

Tyr Ala Asp Phe 
15 

Met Arg Lys Ala 
30 



Val Leu Pro Lys 
-30 

Asn His Met Ala 
-15 

Tyr Lys Phe Arg 
5 

Tyr Arg Asn Tyr 
20 

Gly He Phe Gin 
35 



Pro Arg Met Arg 
-25 

Val Ala Phe Val 
-10 

Val Ala Asp Gin 

Asp Val Met Lys 
2 5 

Ser Val Lys 
40 



Gly Leu Leu Ala 
-20 

Leu Ser Leu Gly 
-5 

Arg Lys Asn Ala 
10 

Asp Phe Glu Glu 



<210> 234 
<211> 108 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -40. .-1 



<400> 234 



Met Gin Lys 


Phe 


Arg 


Lys 


Met 


Ser 


Glu 


Thr 


His 


His 


Ser 


Val 


He 


Ser 


-40 






-35 










-30 










-25 


Val Lys Ala 


Ser 


Ser 


Pro 


Trp 


Leu 


Ser 


Ser 


Ser 


Val 


Thr 


Ala 


Pro 


Ser 






-20 










-15 










-10 




Met Val Ala 


Pro 


Val 


Thr 


Phe 


Ala 


Ser 


He 


Val 


Glu 


Glu 


Glu 


Leu 


Gin 




-5 










1 








5 








Gin Glu Ala 


Ala 


Leu 


He 


Arg 


Ser 


Arg 


Glu 


Lys 


Pro 


Leu 


Ala 


Leu 


He 


10 








15 










20 










Gin He Glu 


Glu 


His 


Ala 


He 


Gin 


Asp 


Leu 


Leu 


Val 


Phe 


Tyr 


Glu 


Ala 


25 






30 










35 










4 0 


Phe Gly Asn 


Pro 


Glu 


Glu 


Phe 


Val 


He 


Val 


Glu 


Arg 


Thr 


Pro 


Gin 


Gly 






45 










50 










55 




Pro Leu Ala 


Val 


Pro 


Met 


Trp 


Asn 


Lys 


His 


Gly 


Cys 












60 










65 
















<210> 235 




























<211> 127 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -47. .- 


-1 


























<400> 235 




























Met Phe Thr 


Pro 


Leu 


Thr 


Val 


Lys 


Tyr 


Ala 


Tyr 


Tyr 


Asp 


Thr 


Glu 


Arg 


-45 










-40 










-35 








He Gly Val 


Asp 


Leu 


He 


Met 


Lys 


Thr 


Cys 


Phe 


Ser 


Pro 


Asn 


Arg 


Val 


-30 








-25 










-20 










lie Gly Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly 


Gly Ala 


Ser 


Ala 


Arg 


-15 






-10 










-5 










1 


lie Gin Asp 


Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr 


Ala 


Glu 


Asp 


Val 


Leu 




5 










10 










15 






Ser Gly Lys 


Val 


Ser 


Ala 


Asp 


Asn 


Thr 


Val 


Gly 


Arg 


Phe 


Leu 


Met 


Ser 


20 










25 










30 








Leu Val Asn 


Gin 


Val 


Pro 


Lys 


He 


Val 


Pro 


Asp 


Asp 


Phe 


Glu 


Thr 


Met 


35 








4 U 










45 










Leu Asn Ser 


Asn 


He 


Asn 


Asp 


Leu 


Leu 


Met 


Val 


Thr 


Tyr 


Leu 


Ala 


Asn 


50 






55 










60 










65 


Leu Thr Gin 


Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu 


Lys 


Leu 


Val 


Asn 


Leu 








70 




















O U 




<210> 236 




























<211> 85 




























<212> PRT 




























<213> Homo : 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -32. . 


-1 


























<400> 236 




























Met Lys Ala 


Ala 


Leu 


Glu 


Asp 


Thr 


Leu 


Ala 


Glu 


Thr 


Glu 


Ala 


Arg 


Phe 


-30 










-25 










-20 








Gly Ala Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 


He 


Ser 


Gly 


He 


Glu 


Ala 


-15 








-10 










-5 










Gin Leu Gly Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 


1 




5 










10 










15 




Gin Arg Leu 


Met 


Asp 


He 


Lys 


Ser 


Arg 


Leu 


Glu 


Gin 


Glu 


He 


Ala 


Thr 
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20 25 30 

Tyr Arg Ser Leu Leu Glu Gly Gin Glu Asp His Tyr Asn Asn Leu Ser 

35 40 4 5 

Ala Ser Lys Val Leu 
50 



<210> 237 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 237 




























Met 


Val 


Asp 


Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu 


Asp 


Ala 


Lys 


Thr 


Cys 


-25 










-20 










-15 










-10 


Leu 


Val 


Leu 


Cys 


Ser 
-5 


Arg 


Val 


Leu 


Ser 


Val 
1 


He 


Ser 


Val 


Lys 
5 


Glu 


He 


Lys 


Thr 


Gin 
10 


Leu 


Ser 


Leu 


Gly 


Arg 
15 


His 


Pro 


He 


He 


Ser 
20 


Asn 


Trp 


Phe 


Asp 


Tyr 
25 


He 


Pro 


Ser 


Thr 


Arg 
30 


Tyr 


Lys 


Asp 


Pro 


Cys 
35 


Glu 


Leu 


Leu 


His 


Leu 


Cys 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


40 










45 










50 










55 


Pro 


Asp 


Gly 


He 


Phe 
60 


Ser 


Leu 


Leu 


He 


Pro 
65 


Ala 


Arg 


Leu 


Gin 


Asn 
70 


Tyr 


Leu 


Asn 


Leu 


Glu 
75 


He 

























<210> 238 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47. .-1 

<400> 238 
Met Val Lys Ser 
-45 

Arg Glu Lys Glu 
-30 

Thr Met Pro Leu 
-15 

Ser Ser Arg Val 
5 

Arg Trp Cys Ser 
20 



Ser Leu Gin Arg 
-40 

Gly Asp Lys Pro 
-25 

Leu Ser Leu His 
-10 

Ser Leu His Cys 



He Leu Asn Ser 

Ser Ala Thr He 
-20 

Ser Arg Gly Gly 
-5 

Cys Ser Asn Pro 
10 



His Cys Phe Ala 
-35 

His Ala Ser Arg 

Ser Ser Ser Glu 
1 

Gly Pro Gly Pro 
15 



<210> 239 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47 . . -1 



<400> 239 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn Ser His Cys Phe Ala 
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-45 -40 -35 

Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr lie His Ala Ser Arg 

-30 " * -25 -20 

Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly Gly Ser Ser Ser Glu 
-15 -10 -5 1 

Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn Pro Gly Pro Gly Pro 

5 10 15 

Arg Trp Cys Ser 
20 



<210> 240 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -47 . .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<400> 240 
Met Val Lys Ser 
-45 

Arg Glu Lys Glu 
-30 

Thr Met Pro Leu 
-15 

Ser Ser Arg Val 
5 

Arg Trp Cys Ser 
20 



Ser Leu Gin Arg 
-40 

Gly Asp Lys Pro 
-25 

Leu Ser Leu His 
-10 

Ser Leu His Cys 



lie Leu Asn Ser 

Ser Ala Thr lie 
-20 

Ser Arg Gly Gly 
-5 

Cys Ser Asn Pro 
10 



His Cys Phe Ala 
-35 

His Ala Ser Arg 

Ser Ser Ser Glu 
1 

Gly Pro Gly Pro 
15 



<210> 241 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -47. .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> unsure 
<222> 12 

<223> Xaa = Lys, Asn 



<400> 241 



Met 


Val 


Lys 


Ser 


Ser 


Leu 


Gin 


Arg 


He 


Leu 


Asn 


Ser 


His 


Cys 


Phe 


Ala 






-45 










-40 










-35 








Arg 


Glu 


Lys 


Glu 


Gly 


Asp 


Lys 


Pro 


Ser 


Ala 


Thr 


He 


His 


Ala 


Ser 


Arg 




-30 










-25 










-20 










Thr 


Met 


Pro 


Leu 


Leu 


Ser 


Leu 


His 


Ser 


Arg 


Gly 


Gly 


Ser 


Ser 


Ser 


Glu 


-15 










-10 










-5 










1 


Ser 


Ser 


Arg 


Val 


Ser 


Leu 


Xaa 


Xaa 


Cys 


Ser 


Xaa 


Pro 


Gly 


Pro 


Gly 


Pro 



5 10 15 



Arg Trp Cys Ser 
20 
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<210> 242 
<211> 129 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -16. .-1 

<223> Von Heijne matrix 

score 3.61791418904325 
seq LLLLLRALRRGPG/PG 



<220> 

<221> unsure 
<222> 92 

<223> Xaa = His, Gin 
<220> 

<221> unsure 
<222> 84 

<223> Xaa = Phe, Tyr 



<400> 242 



Met 


Ala 
-15 


Val 


Leu 


Leu 


Leu 


Leu 
-10 


Leu 


Arg 


Ala 


Leu 


Arg 
-5 


Arg 


Gly 


Pro 


Gly 


Pro 


Gly 


Pro 


Arg 


Pro 


Leu 


Trp 


Gly 


Pro 


Gly 


Pro 


Ala 


Trp 


Ser 


Pro 


Gly 


1 








5 










10 










15 




Phe 


Pro 


Ala 


Arg 
20 


Pro 


Gly 


Arg 


Gly 


Arg 
25 


Pro 


Tyr 


Met 


Ala 


Ser 
30 


Arg 


Pro 


Pro 


Gly 


Asp 
35 


Leu 


Ala 


Glu 


Ala 


Gly 
40 


Gly 


Arg 


Ala 


Leu 


Gin 
45 


Ser 


Leu 


Gin 


Leu 


Arg 
50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


lie 


Asn 
60 


Gly 


Leu 


Leu 


Leu 


Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn 


Pro 


Val 


Arg 


Leu 


Trp 


Gin 


Leu 


Leu 


Gly 


65 










70 










75 










80 


Gly 


Thr 


Phe 


Xaa 


Phe 
85 


Asn 


Thr 


Ser 


Arg 


Leu 
90 


Lys 


Xaa 


Lys 


Asn 


Lys 
95 


Glu 


Lys 


Asp 


Lys 


Ser 
100 


Lys 


Gly 


Lys 


Ala 


Pro 
105 


Glu 


Glu 


Asp 


Glu 


Gly 
110 


He 


Phe 



He 



<210> 243 
<211> 129 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -16. .-1 

<223> Von Heijne matrix 

score 3.61791418904325 
seq LLLLLRALRRGPG/PG 



<400> 243 
Met Ala Val Leu 
-15 

Pro Gly Pro Arg 
1 

Phe Pro Ala Arg 
20 

Pro Gly Asp Leu 
35 

Leu Arg Leu Leu 



Leu Leu Leu Leu 
-10 

Pro Leu Trp Gly 
5 

Pro Gly Arg Gly 

Ala Glu Ala Gly 
40 

Thr Pro Thr Phe 



Arg Ala Leu Arg 
-5 

Pro Gly Pro Ala 
10 

Arg Pro Tyr Met 
25 

Gly Arg Ala Leu 
Glu Gly He Asn 



Arg Gly Pro Gly 

Trp Ser Pro Gly 
15 

Ala Ser Arg Pro 
30 

Gin Ser Leu Gin 
45 

Gly Leu Leu Leu 
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50 55 60 

Lys Gin His Leu Val Gin Asn Pro Val Arg Leu Trp Gin Leu Leu Gly 
65 70 75 80 

Gly Thr Phe Tyr Phe Asn Thr Ser Arg Leu Lys Gin Lys Asn Lys Glu 

85 90 95 

Lys Asp Lys Ser Lys Gly Lys Ala Pro Glu Glu Asp Glu Gly lie Phe 
100 105 110 

lie 



<210> 244 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 



<220> 

<221> UNSURE 
<222> 43 

<223> Xaa = Asp, Gly, Asn, Ser 



<400> 244 




























Met 


Ala 


Ala 


Ser 


Gly 


Ala 


Pro 


Arg 


lie 


Leu 


Val 


Asp 


Leu 


Leu 


Lys 


Leu 








-30 










-25 










-20 






Asn 


Val 


Ala 


Pro 


Leu 


Ala 


Val 


Phe 


Gin 


Met 


Leu 


Lys 


Ser 


Met 


Cys 


Ala 






-15 










-10 










-5 








Gly 


Gin 


Arg 


Leu 


Ala 


Ser 


Glu 


Pro 


Gin 


Asp 


Pro 


Ala 


Ala 


Val 


Ser 


Leu 




1 








5 










10 










15 


Pro 


Thr 


Ser 


Ser 


Val 


Pro 


Glu 


Thr 


Arg 


Gly Arg 


Asn 


Lys 


Gly 


Ser 


Ala 










20 










25 










30 




Ala 


Leu 


Gly 


Gly 


Ala 


Leu 


Ala 


Leu 


Ala 


Glu 


Arg 


Xaa 


Ser 


Arg 


Glu 


Gly 








35 










40 










45 






Ser 


Ser 


Gin 


Arg 


Met 


Pro 


Arg 


Gin 


Pro 


Ser 


Ala 


Thr 


Arg 


Leu 


Pro 


Lys 






50 










55 










60 








Gly 


Gly 


Gly 


Pro 


Gly 


Lys 


Ser 


Pro 


Thr 


Arg 


Gly 


Ser 


Thr 









65 70 75 



<210> 245 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 245 
Met Ala Asn Leu 
-35 

Arg Arg His Thr 
-20 

Ser lie Arg Gly 

Arg Ser Leu Leu 
15 

Gly Phe Lys Asn 
30 

Leu Val Lys Arg 
45 

Arg His Lys Gin 



Phe lie Arg Lys 
-30 

Val Lys Pro Arg 
-15 

Ala Ala Pro Val 
1 

Ser Pro Gly Leu 
20 

Lys Thr Val Leu 
35 

Arg Gly Arg Trp 
50 

Arg Gin Met 
65 



Met Val Asn Pro 
-25 

Ala Leu Ser Thr 
-10 

Ala Val Glu Pro 
5 

Leu Pro His Leu 

Lys Lys Arg Cys 
40 

Tyr Val Tyr Cys 
55 



Leu Leu Tyr Leu 

Phe Leu Phe Gly 
-5 

Gly Ala Ala Val 
10 

Leu Pro Ala Leu 
25 

Lys Asp Cys Tyr 

Lys Thr His Pro 
60 
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<210> 246 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 246 

Met Gly Pro Ala Leu 
-35 

Ala Asp Pro Leu Pro 
-20 

Gly Thr Leu Ser Glu 
1 

Lys Glu Cys Pro lie 
15 

Asp Ala Leu Glu Asp 
30 

Asp Pro Phe Thr Phe 
45 



Arg Ser Leu Gin Val 
-30 

Phe Pro Ser Val Ser 
-15 

Thr Ser Thr Gly Gly 
5 

Cys Lys Glu Arg Phe 
20 

His Met Asp Gly His 
35 

Glu 
50 



Lys Lys Gly Thr Glu His 
-25 

Leu Ser Gly Phe Thr Val 
-10 -5 
Pro Ala Thr Pro Thr Trp 
10 

Pro Ala Glu Ser Asp Lys 
25 

Phe Phe Phe Ser Thr Gin 
40 



<210> 247 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -30. .-1 



<400> 247 

Met Ser Ser Cys Gly 
-30 

Ala Val Lys Leu Leu 
-10 

Cys Cys Asn Gly Ser 
5 

lie Lys Asn Phe 
20 



lie Val Gly Ser Ser 
-25 

Leu Lys Met Val Ser 
-5 

Ala Asn Phe His Lys 
10 



Val Ser Phe Gin Leu Asp 
-20 -15 
Ser Ala Thr Thr Glu Arg 
1 

Asn Leu Cys Ala Thr Gly 

15 



<210> 248 
<211> 219 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -40. . -1 

<223> Von Heijne matrix 

score 6.4143157541603 
seq I TMS AT VVDAVNA/ AP 



<220> 

<221> unsure 
<222> 13 

<223> Xaa = Glu, Gin 
<220> 

<221> unsure 
<222> -39 
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<223> Xaa = Ile,Asn 



160 
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<400> 248 



Pro 


Xaa 


Arg 


Lys 


He 


Thr 


Arg 


Leu 


Ala 


Trp 


He 


Leu 


Ser 


Asp 


Gly 


Leu 


-40 










-35 










-30 










-25 


Gly 


Ala 


Ser 


Cys 


Gly 


Gly Ala 


Gly 


Gly 


Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 










-20 










-15 










-10 




Thr 


Val 


Val 


Asp 


Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 








-5 










1 








5 








Met 


Ser 


Leu 


Glu 


Xaa 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 




10 










15 










20 










Lys 


Lys 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


25 










30 










35 










40 


Val 


Asp 


Glu 


Glu 


Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 










45 










50 










55 




Ser 


Ser 


Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 








60 










65 










70 






His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 






75 










80 










85 








Glu 


Trp 


Tyr 


Lys 


Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 




90 










95 










100 










Arg 


Lys 


Gly 


Ala 


Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Lys 


105 










110 










115 










12 0 


Asp 


Cys 


Phe 


Glu 


Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 










125 










130 










135 




Asn 


He 


Ala 


Pro 


Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 








140 










145 










150 






Asp 


Gly 


Lys 


Arg 


Asp 


Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 






155 










160 










165 








Lys 


He 


Val 


Glu 


Glu 


Tyr 


Ala 


Lys 


Val 


Asp 


Leu 














170 










175 




















<210> 249 




























<211> 215 




























<212> PRT 




























<213> Homo sapiens 
























<220> 






























<221> SIGNAL 


























<222> -36. 


-1 


























<223> Von Heijne matrix 
























score 


5.52542399647426 




















seq I T M S AT VX D AVN A / A P 




















<400> 249 




























He 


Thr 


Arg 


Leu 


Ala 


Trp 


He 


Leu 


Xaa 


Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 




-35 










-30 










-25 










Gly 


Gly 


Ala 


Gly 


Gly 


Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 


Thr 


Val 


Xaa 


Asp 


-20 










-15 










-10 










-5 


Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 










1 








5 










10 






Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 






15 










20 










25 








Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 




30 










35 










40 










Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 


45 










50 










55 










60 


Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 










65 










70 










75 




Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 








80 










85 










90 






Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 






95 










100 










105 
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Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 




110 










115 






Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


125 










130 








Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 










145 










Asp 


Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 








160 










165 


Glu 


Tyr 


Ala 


Lys 


Val 


Asp 


Leu 







175 



<210> 250 
<211> 214 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 

<220> 

<221> UNSURE 
<222> 120 

<223> Xaa = Glu, Lys 
<400> 250 



He 


Thr 


Arg 


Leu 


Ala 


Trp 


He 


Leu 


Ser 




-35 










-30 






Gly 


Gly 


Ala 


Gly 


Gly 


Gly 


Gin 


He 


Thr 


-20 










-15 








Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 










1 








5 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 






15 










20 




Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 




30 










35 






Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


45 










50 








Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 










65 










Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 








80 










85 


Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 






95 










100 




Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 




110 










115 






Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


125 










130 








Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 








145 










Asp 


Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 








160 










165 


Glu 


Tyr 


Ala 


Lys 


Val 


Asp 









175 



<210> 251 
<211> 147 
<212> PRT 

<213> Homo sapiens 



PCT/IB01/00914 

161 



Lys 


Lys 


Lys 


Asp 


Cys 


Phe 


Glu 






120 










Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 




135 










140 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 


150 










155 




Glu 


His 


Met 


Lys 


He 


Val 


Glu 










170 






Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 






-25 










Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 




-10 










-5 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 










10 






Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 








25 








Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 






40 










Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 




55 










60 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 


70 










75 




Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 










90 






Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 








105 








Lys 


Lys 


Xaa 


Asp 


Cys 


Phe 


Glu 






120 










Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 




135 










140 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 


150 










155 




Glu 


His 


Met 


Lys 


He 


Val 


Glu 










170 







<220> 

<221> SIGNAL 
<222> -15. .-1 
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<220> 

<221> UNSURE 
<222> -9 

<223> Xaa = Glu, Lys 



<400> 251 



Met 


Asp 


Val 


Leu 


-15 








Glu 


Glu 


He 


Lys 








5 


Gly 


Cys 


Leu 


Gly 






20 




Leu 


Lys 


Leu 


Lys 




35 






Glu 


Arg 


Asn 


Lys 


50 








Gin 


Glu 


Val 


Phe 


Asn 


Pro 


Pro 


Leu 








85 


Tyr 


Gin 


Cys 


Asn 






100 




Asp 


Ser 


Trp 


Leu 




115 






Leu 


Gin 


Lys 





130 



Val Ser Xaa Cys 
-10 

Ser Leu Thr Ala 

Ala Ser Pro Asn 
25 

Tyr Arg Leu Asn 
40 

Pro Thr Lys Asn 
55 

Gly His Ala He 
70 

Leu Val Thr Pro 

Ser Ala Met Gly 
105 

Phe Asp Phe Phe 
120 



Ser Ala Arg Leu 
-5 

Glu He Asp Arg 
10 

Leu Glu Gin Leu 

He Leu Arg Lys 
45 

Met He Asn He 
60 

Lys Ala Ala Tyr 
75 

Ser Gin Gin Ala 
90 

He Ser Gin Val 

Lys Tyr Tyr Tyr 
125 



Leu Gin Gin Glu 
1 

Leu Lys Asn Cys 
15 

Gin Glu Glu Asn 
30 

Ser Leu Gin Ala 

He Ser Arg Leu 
65 

Pro Asp Leu Glu 
80 

Lys Phe Gly Asp 
95 

Met Tyr Cys His 
110 

His His Cys His 



<210> 252 
<211> 117 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -43. .-1 

<223> Von Heijne matrix 

score 4.10872570449087 
seq LGQVAAAAAAANA/AS 



<400> 252 
Leu Ser Pro Ala 
-40 

Ser Gly Trp Ser 
-25 

Gin Val Ala Ala 
-10 

Ser Ala Asn Ser 

Thr Ser Arg Pro 
25 

Phe Pro Pro Ser 
40 

Cys Tyr Cys Phe 
55 

Lys Lys Lys Lys 



Gin Ala Ser Gly 

Glu Arg Gly Ser 
-20 

Ala Ala Ala Ala 
-5 

Leu Gly Ser Gly 
10 

Ser Gin Ser Arg 

Pro Leu Pro Pro 
45 

Leu Leu He Asn 
60 

Lys 



He Trp Cys Arg 
-35 

Ser Ser Arg Arg 

Ala Asn Ala Ala 
1 

Gly Arg Cys Lys 
15 

Pro Arg Ser Leu 
30 

Ser Pro Ala Gly 

Ala Asp Leu He 
65 



Asp Ser Ala Phe 
-30 

Gly Cys Leu Gly 
-15 

Ser Pro Pro Pro 
5 

Leu Arg Ala Pro 
20 

Pro Gin Ala Arg 
35 

Thr Ser Cys Thr 
50 

Lys Trp Ser Pro 



<210> 253 
<211> 88 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -36. .-1 

<400> 253 

Met Ala Ser Val Val Pro 
-35 

Leu Gly Glu Leu Pro Ser 
-20 -15 
Gly lie Phe Gly Ala Phe 
1 

Tyr lie Asn Val Lys Lys 
15 

Ala Cys Tyr Val Leu Phe 
30 

His Glu Arg Leu Arg Lys 
45 50 

<210> 254 
<211> 62 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -42. .-1 

<220> 

<221> UNSURE 
<222> -11 

<223> Xaa = Glu, Lys 
<400> 254 

Met Asp Pro Asn Cys Ser 
-40 

Ala Ser Ser Cys Lys Cys 
-25 

Ser Cys Cys Ser Cys Cys 
-10 -5 
Cys lie Cys Lys Gly Ala 
10 

<210> 255 
<211> 132 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 118 

<223> Xaa = Gly, Val 
<400> 255 

Leu Phe Pro Ala Pro Ala 
1 5 
Pro Lys Val Pro Ser Pro 
20 

Ser Pro Gin Pro Ser Tyr 
35 

Pro Pro Pro Ala Cys Leu 
50 

Ala Met Asn Leu Phe Arg 
65 70 



Val Lys Asp Lys Lys 
-30 

Trp lie Leu Met Arg 
-10 

Gin Arg Gly Tyr Tyr 
5 

Gly Ser lie Ser Gly 
20 

Ser Tyr Ser Phe Ser 
35 

Tyr His 



Leu Leu Glu Val Lys 
-25 

Asp Phe Ser Pro Ser 
-5 

Arg Tyr Tyr Asn Lys 
10 

He Thr Met Val Leu 
25 

Tyr Lys His Leu Lys 
40 



Cys Ala Ala Ala Gly 
-35 

Lys Glu Cys Lys Cys 
-20 

Pro Val Gly Cys Ala 
1 

Ser Glu Lys Cys Ser 
15 



Val Ser Cys Thr Cys 
-30 

Thr Ser Cys Lys Xaa 
-15 

Lys Cys Ala Gin Gly 
5 

Cys Cys Ala 
20 



Pro Pro Pro Ala Pro 
10 

Glu Arg Ser Ala Pro 

25 

Pro Phe Arg Pro Ala 
40 

Pro Pro Ala Gin Pro 
55 

Phe Leu Gly Asp Leu 
7 5 



Ala Phe Ala Pro Pro 
15 

Arg Val Pro Leu Pro 
30 

Ala Ser Gly Gly Thr 
45 

Cys Gin Gly Ser Pro 
60 

Ser His Leu Leu Ala 
80 
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lie lie Leu Leu Leu Leu Lys lie Trp Lys Ser Arg Ser Cys Ala Ala 

85 90 95 

His Pro Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys Leu Ser Val 

100 105 110 

Ser Leu Ser Leu Ser Xaa Ser Leu Ser Leu Ser Phe Ser Val Ser Lys 

115 120 125 

Lys Lys Lys Lys 
130 



<210> 256 
<211> 163 
<212> PRT 

<213> Homo sapiens 



<400> 256 

Pro His Arg Leu 

1 

Pro Ser Arg Ser 
20 

Val Ser Glu Leu 
35 

Gly Arg Lys His 
50 

Gly Cys Ser Pro 
65 

Arg Tyr Pro Arg 

Ser Ser Val Phe 
100 

Leu Ala Val Ala 
115 

His Ser Pro Ser 
130 

Pro Thr Phe Glu 
145 

Pro Asp Val 



Thr Leu Pro Thr 
5 

Glu Lys Leu Gly 

Gin Val Leu Leu 
40 

Asp Leu Leu Met 
55 

Ala Val Gin lie 
70 

Thr Leu Glu Gly 
85 

Ser Leu Asp Gly 

Gly lie His Ser 
120 

Ser Pro Val Gly 
135 

Met Gin Gin Pro 
150 



Leu Gin Ser Val 
10 

Lys Asn Met Val 
25 

Gly Phe Ala Gly 

Arg Ala Leu His 
60 

Lys lie Arg Glu 
75 

Leu Ser Asp Leu 
90 

Gly Ser Ser Pro 
105 

Leu Pro Ser Thr 

Ser Val Leu Leu 
140 

Ser Pro Pro lie 
155 



Thr Phe Arg Cys 
15 

Ser Ser Phe Arg 
30 

Arg Asn Lys Ser 
45 

Leu Leu Lys Ser 

Leu Tyr Arg Arg 
80 

Ser Thr lie Lys 

95 

Val Glu Pro Asp 
110 

Ser Val Thr Pro 
125 

Gin Asp Thr Lys 

Pro Pro Val His 
160 



<210> 257 
<211> 166 
<212> PRT 

<213> Homo sapiens 



<400> 257 



Pro 


His 


Arg 


Leu 


Thr 


Leu 


Pro 


Thr 


Leu 


Gin 


Ser 


Val 


Thr 


Phe 


Arg 


Cys 


1 








5 










10 










15 




Pro 


Ser 


Arg 


Ser 


Glu 


Lys 


Leu 


Gly 


Lys 


Asn 


Met 


Val 


Ser 


Ser 


Phe 


Arg 








20 










25 










30 






Val 


Ser 


Glu 


Leu 


Gin 


Val 


Leu 


Leu 


Gly 


Phe 


Ala 


Gly 


Arg 


Asn 


Lys 


Ser 






35 










40 










45 








Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala 


Leu 


His 


Leu 


Leu 


Lys 


Ser 




50 










55 










60 










Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He 


Arg 


Glu 


Leu 


Tyr 


Arg 


Arg 


65 










70 










75 










80 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser 


Asp 


Leu 


Ser 


Thr 


He 


Lys 










85 










90 










95 




Ser 


Ser 


Val 


Phe 


Ser 


Leu 


Asp 


Gly 


Gly 


Ser 


Ser 


Pro 


Val 


Glu 


Pro 


Asp 








100 










105 










110 






Leu 


Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro 


Ser 


Thr 


Ser 


Val 


Thr 


Pro 






115 










120 










125 








His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly 


Ser 


Val 


Leu 


Leu 


Gin 


Asp 


Thr 


Lys 




130 










135 










140 










Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro 


Pro 


He 


Pro 


Pro 


Val 


His 


145 










150 










155 










160 
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Pro Asp Val Gin Leu Lys 
165 



<210> 258 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 258 



Met 


Ser 


Tyr 


Glu 


1 








Arg 


Leu 


Asn 


Arg 








20 


Leu 


Arg 


Lys 


Ala 






35 




Val 


Lys 


Thr 


His 




50 






Ser 


Met 


Val 


Gly 


65 








Lys 


Pro 


Glu 


Met 


Lys 


Pro 


Val 


Lys 








100 


Arg 


Phe 


He 


Pro 



115 



Gin 


Leu 


Met 


Gin 


5 








Gly 


Leu 


Arg 


Arg 


Lys 


Lys 


Glu 


Ala 
40 


Leu 


Arg 


Asp 
55 


Met 


Val 


Tyr 
70 


Asn 


Gly 


He 
85 


Gly 


His 


Tyr 


His 


Gly 


Arg 


Pro 


Leu 


Lys 







Leu Tyr Ser Ala 
10 

Lys Gin His Ser 
25 

Pro Pro Met Glu 

He He Leu Pro 
60 

Lys Thr Phe Asn 
75 

Leu Gly Glu Phe 
90 

Gly He Gly Ala 
105 



Arg Gin Arg Arg 
15 

Leu Leu Lys Arg 
30 

Lys Pro Glu Val 
45 

Glu Met Val Gly 

Gin Val Glu lie 
80 

Ser He Thr Tyr 
95 

Thr His Ser Ser 
110 



<210> 259 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 98 

<223> Xaa = He, Asn 
<220> 

<221> unsure 
<222> 19 

<223> Xaa = Ser, Thr 



<400> 259 




Arg 


Thr 


Ala 


Gly 


1 








Ala 


Arg 


Xaa 


Cys 








20 


Gly 


Ser 


Arg 


Ala 






35 




Pro 


Lys 


Pro 


Ser 




50 






Ala 


Glu 


Arg 


Leu 


65 








Phe 


Cys 


Thr 


Ala 


Lys 


Xaa 


Xaa 


Lys 



100 



Arg Gly Arg Pro 
5 

Ala Glu Asp Pro 

Met Ala Glu Glu 
40 

Val Leu Phe Leu 
55 

Val Leu Leu Pro 
70 

Pro Arg Ser Leu 

85 

Ser 



Ala Val Ala Ser 
10 

His Gin Gly Ala 
25 

Gin Gly Arg Glu 

His Pro Asp Leu 
60 

Phe Pro Thr Glu 
75 

Asn Ser Leu Pro 
90 



Trp Glu Leu Arg 
15 

Gly Cys Ser Cys 
30 

Arg Asp Ser Val 
45 

Gly Val Gly Gly 

Arg Arg Leu Leu 
80 

Leu Lys Lys Lys 
95 



<210> 260 
<211> 100 
<212> PRT 

<213> Homo sapiens 



<400> 260 



WO 02/083898 
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Val 


Arg 


Gin 


Gly 


1 








Arg 


Gly 


Ala 


Cys 








20 


Gly 


Ser 


Arg 


Ala 






35 




Pro 


Lys 


Pro 


Ser 




50 






Ala 


Glu 


Arg 


Leu 


65 








Phe 


Cys 


Thr 


Ala 


Lys 


Asn 


Ala 


Lys 



100 



Gly Val Gly Pro 
5 

Ala Glu Asp Pro 

Met Ala Glu Glu 
40 

Val Leu Phe Leu 
55 

Val Leu Leu Pro 
70 

Pro Arg Ser Leu 
85 



166 

Gin Trp Pro Ala 
10 

His Gin Gly Ala 
25 

Gin Gly Arg Glu 

His Pro Asp Leu 
60 

Phe Pro Thr Glu 
75 

Asn Ser Leu Pro 
90 



Gly 


Ser 


Tyr 


Glu 






15 




Gly 


Cys 


Ser 


Cys 




30 






Arg 


Asp 


Ser 


Val 


45 








Gly 


Val 


Gly 


Gly 


Arg 


Arg 


Leu 


Leu 








80 


Leu 


Lys 


Lys 


Lys 



95 



<210> 261 
<211> 147 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 90 

<223> Xaa = Ala, Gly 



<400> 261 

Met Glu Tyr Val 

1 

Gin Phe Thr Val 

20 

Val Ser Arg Asp 
35 

Xaa Cys Tyr Lys 
50 

Leu lie Val lie 
65 

Val lie Gin Glu 

Leu Pro Gin Lys 
100 

Lys Leu Gly Leu 
115 

Ser Met Lys Ser 
130 

Glu Ala Glu 
145 



Leu Glu Val Lys 
5 

Cys Asp Val Pro 

Arg Cys Gin Glu 
40 

Lys Ala Val Pro 
55 

Lys Ala Gly Ala 
70 

Ser Arg Lys Glu 
85 

Ser Ser Glu Lys 

Lys Pro Ala Ser 
120 

Asp Glu Asp Lys 
135 



Asn Ser Pro Arg 
10 

Leu Tyr Asp lie 

25 

Leu Gly Cys Cys 

lie Tyr lie His 
60 

Phe Val lie Thr 
75 

Lys Xaa lie Pro 
90 

Ala Glu Leu Ala 
105 

Pro Gly Pro Pro 

Asp Asp Val Thr 
140 



His Leu Leu Lys 
15 

Cys Asp Tyr Asn 
30 

Xaa Tyr Glu Gly 
45 

Val Phe Ser Ala 

lie lie Tyr Arg 
80 

Val Tyr Val Ala 
95 

Ser Ser Ser Ser 
110 

Ser Ala Gly Pro 
125 

Gly Thr He Thr 



<210> 262 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<400> 262 



Met 


Glu 


Tyr 


Val 


Leu 


Glu 


Val 


Lys 


Asn 


Ser 


Pro 


Arg 


His 


Leu 


Leu 


Lys 


1 








5 










10 










15 




Gin 


Phe 


Thr 


Val 


Cys 


Asp 


Val 


Pro 


Leu 


Tyr 


Asp 


He 


Cys 


Asp 


Tyr 


Asn 








20 










25 










30 






Val 


Ser 


Arg 


Asp 


Arg 


Cys 


Gin 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 






35 










40 










45 








Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 


Pro 


He 


Tyr 


lie 


His 


Val 


Phe 


Ser 


Ala 




50 










55 










60 










Leu 


He 


Val 


He 


He 


Ala 


Gly 


Ala 


Phe 


Val 


lie 


Thr 


lie 


lie 


Tyr 


Arg 


65 










70 










75 










80 
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Val lie Gin 


Glu 


Ser 


Arg 


Lys 


Glu 


Lys 


Ala 


He 


Pro 


Val 


Tyr 


Val 


Ala 






85 










90 










95 




Leu Pro Gin 


Lys 


Ser 


Ser 


Glu 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 




100 










105 










110 






Lys Leu Gly 


Leu 


Lys 


Pro 


Ala 


Ser 


Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 


115 










120 










125 








Ser Met Lys 


Ser 


Asp 


Glu 


Asp 


Lys 


Asp 


Asp 


Val 


Thr 


Gly 


Thr 


He 


Thr 


130 








135 










140 










Glu Ala Glu 




























145 




























<210> 263 




























<211> 149 




























<212> PRT 




























<213> Homo sapiens 
























<400> 263 




























Met Arg Tyr 


Asn 


Glu 


Lys 


Glu 


Leu 


Gin 


Ala 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


1 




5 










10 










15 




Glu Met Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 




20 










25 










30 






Leu Lys Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 


35 










40 










45 








Arg Thr Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 


50 








55 










60 










Arg Val Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 


Ser 


He 


Ser 


Phe 


He 


Glu 


65 






70 










75 










80 


Asp Pro Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 






85 










90 










95 




Gin Glu Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 


Ala 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 




100 










105 










110 






Arg Ser Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 


He 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 


115 










120 










125 








Asp Pro Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 


130 








135 










140 










Ser Gly Leu 


Gin 


Ala 
























145 




























<210> 264 




























<211> 103 




























<212> PRT 




























<213> Homo sapiens 
























<400> 264 




























Met Pro Val 


Val 


Pro 


Ala 


Leu 


Gly 


Arg 


Pro 


Arg 


Trp 


Ala 


Asp 


His 


Leu 


1 




5 










10 










15 




Arg Ser Gly 


Val 


Arg 


Asp 


Gin 


Pro 


Gly 


Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 




20 










25 










30 






Ser Leu Leu 


Lys 


He 


Gin 


Lys 


Leu 


Ala 


Gly 


Tyr 


Gly 


Gly 


Gly 


Cys 


Leu 


35 










40 










45 








Trp Ser Gin 


Leu 


Leu 


Gly Arg 


Leu 


Arg 


Arg 


Glu 


Asn 


His 


Leu 


Ser 


Pro 


50 








55 










60 










Gly Gly Gly 


Gly 


Cys 


Ser 


Glu 


Pro 


Arg 


Leu 


Cys 


His 


Cys 


Thr 


Pro 


Ala 


65 






70 










75 










80 


Trp Val Thr 


Glu 


Gin 


Asp 


Ser 


He 


Ser 


Lys 


He 


Glu 


Lys 


He 


Ser 


Gin 






85 










90 










95 





Leu Pro Lys Lys Lys Lys Lys 
100 



<210> 265 
<211> 78 
<212> PRT 

<213> Homo sapiens 
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<400> 265 




























Met Asp Pro 


Asn 


Cys 


Ser 


Cys 


Ala 


Ala 


Gly 


Val 


Ser 


Cys 


Thr 


Cys 


Ala 


1 




5 










10 










15 




Ser Ser Cys 


Lys 


Cys 


Lys 


Glu 


Cys 


Lys 


Cys 


Thr 


Ser 


Cys 


Lys 


Lys 


Ser 




20 










25 










30 






Cys Cys Ser 


Trp 


Cys 


Thr 


Gly 


Pro 


Thr 


Leu 


Val 


Asp 


Thr 


Cys 


Pro 


Ala 


35 










40 










45 








Leu Pro Pro 


Val 


Cys 


Leu 


Ser 


Gin 


Arg 


Ser 


Ser 


Gly 


Met 


Asn 


Leu 


Arg 


50 








55 










60 










Thr His Val 


Gin 


Trp 


Glu 


Val 


Arg 


Pro 


Pro 


Leu 


Asn 


He 


Gin 






65 






70 










75 












<210> 266 




























<211> 92 




























<212> PRT 




























<213> Homo : 


sapiens 
























<400> 266 




























Met Ser Pro 


Gin 


Thr 


His 


Ser 


Gin 


Thr 


Cys 


He 


Arg 


Asn 


Leu 


Val 


Thr 


1 




5 










10 










15 




Cys lie Asn 


Tyr 


Pro 


Arg 


Thr 


Ser 


Thr 


Gly 


Cys 


Lys 


Gly 


Thr 


Thr 


Thr 




20 










25 










30 






Gin Arg lie 


Met 


Glu 


Pro 


Val 


Glu 


Leu 


Glu 


Val 


Glu 


Gly 


Thr 


Glu 


Gin 


35 










40 










4 5 








Asp Asn Ala 


Lys 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly 


Arg 


Gly 


Asn 


Glu 


Asn 


Thr 


50 








55 










60 










Met Leu Arg 


Gly 


Gly 


Phe 


Ser 


Met 


Asn 


Thr 


Thr 


Val 


Gly 


Gin 


Gly 


He 


65 






70 










75 










80 


Ser Lys Gin 


Thr 


His 


His 


Thr 


Ser 


Thr 


Thr 


Ser 


Ser 














85 










90 














<210> 267 




























<211> 51 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<400> 267 




























Met Leu Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 


Asn 


Asn 


He 


Glu 


1 




5 










10 










15 




Leu Gly Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Tyr 


Arg 


Val 


Cys 


Thr 


Leu 


Ala 


He 




20 










25 










30 






lie Asp Pro 


Gly 


Asp 


Ser 


Asp 


He 


He 


Arg 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


35 










40 










45 








Gly Glu Lys 




























50 




























<210> 268 




























<211> 99 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 268 




























Met Lys Lys 


Lys 


Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


1 




5 










10 










15 




Pro Gin Tyr 


Gin 


Leu 


Val 


Asn 


Ala 


Glu 


Ser 


Thr 


Ser 


Pro 


Phe 


Leu 


His 




20 










25 










30 






Cys Leu Arg 


Glu 


Val 


He 


Gly 


Glu 


Tyr 


Ser 


Val 


His 


Glu 


Phe 


Ser 


Leu 


35 










40 










45 








Leu Gly Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


He 


Ala 


Leu 


Val 


50 








55 










60 










Val Phe Leu 


Ser 


Phe 


Leu 


He 


Phe 


Ser 


Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 


65 






70 










75 










80 
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Ala Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu 
85 90 95 

Leu Ser Leu 

<210> 269 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 269 

Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp Trp 

15 10 15 

Val Pro Leu Ala Cys Arg Ser Phe Ser Leu Gly Val Pro Arg Leu lie 

20 25 30 

Gly lie Arg Leu Thr Leu Pro Pro Pro Lys Val Val Asp Arg Trp Asn 

35 40 45 

Glu Lys Arg Ala Met Phe Gly Val Tyr Asp Asn lie Gly lie Leu Gly 

50 55 60 

Asn Phe Glu Lys His Pro Lys Glu Leu lie Arg Gly Pro lie Trp Leu 
65 70 75 80 

Arg Gly Trp Lys Gly Asn Glu Leu Gin Arg Cys lie Arg Lys Arg Lys 

85 90 95 

Met Val Gly Ser Arg Met Phe Ala Asp Asp Leu His Asn Leu Asn Lys 

100 105 110 

Arg lie Arg Tyr Leu Tyr Lys His Phe Asn Arg His Gly Lys Phe Arg 
115 120 125 



<210> 270 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<400> 270 

Met Phe Leu Thr Leu Ala Asp Thr 
1 5 

Leu Leu Asn Val Tyr Glu Gly Glu 
20 

Leu Leu Glu Ser Trp lie lie Val 

35 40 
lie Ser Ser His 
50 



Cys Lys Leu Arg Gly Met Ser Phe 

10 15 

Ala Thr Val Ser Ser Val Leu Glu 
25 30 

Gly Asn Glu Arg Tyr Phe Asp Gly 
45 



<210> 271 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<400> 271 



Met 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 








5 








10 










15 




Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 








20 










25 








30 






Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 






35 










40 








45 








Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 




50 










55 








60 










His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


65 










70 








75 










80 


Lys 


Phe 


Lys 


Glu 

























<210> 272 
<211> 84 
<212> PRT 
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<213> Homo sapiens 
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<400> 272 



Met Lys Val 


Lys 


lie 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










10 










15 




t / 1 TV "1 — » TV ~ „ 

vai A±a Asn 


Asp 


Li -LU 


As n 


Cys 




l JLe 


Cys 


Arg 


Met 


a j. a 


jrne 


Asn 


kd± y 




20 










25 










30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


35 










40 










45 








Gly Gin Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


50 








55 










60 










His Ala Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


65 






70 










75 










80 


Lys Phe Lys 


Glu 


























<210> 273 




























<211> 84 




























<212> PRT 




























<213> Homo , 


sapiens 
























<400> 273 




























Met Lys Val 


Lys 


lie 


Lys 


Cys 


Trp 


Asn 


Gly Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










1U 










15 




Val Ala Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


lie 


Cys 


Arg 


Met 


TV 1 -» 

Ala 


Pne 


Asn 


(aly 




20 










25 










30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


35 










40 










45 








Gly Gin Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


50 








55 










60 










His Ala Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


65 






70 










75 










80 


Lys Phe Lys 


Glu 


























<210> 274 




























<211> 84 




























<212> PRT 




























<213> Homo . 


sapiens 
























<400> 274 




























Met Lys Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










10 










15 




vai ax a Asn 


Asp 


Glu 


Asn 


Cys 




ire 


Cys 


Arg 


ixie c. 


AJ_a 


tr ne 


Asn 


ij_L y 




20 










25 










30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


35 










40 










45 








Gly Gin Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


OU 








JJ 










DU 










His Ala Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


65 






70 










75 










80 


Lys Phe Lys 


Glu 


























<210> 275 




























<211> 84 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 275 




























Met Lys Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










10 










15 




Val Ala Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 




20 










25 










30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


35 










40 










45 
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Gly Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu 

50 55 60 

His Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp 
65 70 75 80 

Lys Phe Lys Glu 



<210> 276 
<211> 80 
<212> PRT 

<213> Homo sapiens 



<400> 276 




























Met 


Ala 


Thr 


Pro 


Thr 


Gin 


Thr 


Pro 


Thr 


Lys 


Ala 


Pro 


Glu 


Glu 


Pro 


Asp 


1 








5 










10 










15 




Pro 


Phe 


Tyr 


Tyr 


Asp 


Tyr 


Asn 


Thr 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr 


Leu 








20 










25 










30 






Ala 


Thr 


He 


Leu 


Phe 


Leu 


Leu 


Gly 


He 


Leu 


He 


Val 


He 


Ser 


Lys 


Lys 






35 










40 










45 








Val 


Lys 


Cys 


Arg 


Lys 


Ala 


Asp 


Ser 


Arg 


Ser 


Glu 


Ser 


Pro 


Thr 


Cys 


Lys 




50 










55 










60 










Ser 


Cys 


Lys 


Ser 


Glu 


Leu 


Pro 


Ser 


Ser 


Ala 


Pro 


Gly 


Gly 


Gly 


Gly 


Val 



65 70 75 80 



<210> 277 
<211> 127 
<212> PRT 

<213> Homo sapiens 



<400> 277 

Met Ser Phe Ser 

1 

Ala Phe Met Lys 
20 

Lys Asp He Lys 
35 

Lys Phe Thr He 
50 

Val Gly Glu Glu 
65 

Thr Val Val Gin 

Asn He Lys Ser 
100 

Met Thr Leu Gly 
115 



Gly 


Lys 


Tyr 


Gin 


5 








Ala 


He 


Gly 


Leu 


Gly 


Val 


Ser 


Glu 








40 


Thr 


Ala 


Gly 


Ser 






55 




Cys 


Glu 


Leu 


Glu 




70 






Leu 


Glu 


Gly 


Asp 


85 








Val 


Thr 


Glu 


Leu 


Asp 


He 


Val 


Phe 








120 



Leu Gin Ser Gin 
10 

Pro Glu Glu Leu 
25 

He Val Gin Asn 

Lys Val He Gin 
60 

Thr Met Thr Gly 
75 

Asn Lys Leu Val 
90 

Asn Gly Asp He 
105 

Lys Arg He Ser 



Glu Asn Phe Glu 
15 

He Gin Lys Gly 
30 

Gly Lys His Phe 
4 5 

Asn Glu Phe Thr 

Glu Lys Val Lys 
80 

Thr Thr Phe Lys 
95 

lie Thr Asn Thr 

110 
Lys Arg He 
125 



<210> 278 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<400> 278 

Asp Ala Val Ala Val 
1 5 
Pro Ala Asp Ala Leu 
20 

Gly Ala Phe Tyr Asn 
35 

Tyr Ser Ser Ala Ser 
50 

Lys Ser Gin Ser Ser 
65 

Gly Lys Gin Glu Glu 



Gly Lys Phe Ser Gin Trp 
10 

Cys Asp Gly Lys Lys Ala 
25 

Glu Leu Arg Ala Leu Glu 
40 

Thr Val Val Leu Leu Trp 
55 

Arg Asn Ser Gin Cys Ser 
70 75 
His Glu 



Pro Pro Glu Arg Ser 
15 

Ala Gly Pro Thr Ser 
30 

Ser Leu Pro Gly Asn 
45 

He Ala Ser Arg Gin 
60 

Arg Lys Asp Lys Gly 
80 
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85 



172 
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<210> 279 
<211> 96 
<212> PRT 

<213> Homo sapiens 



<400> 279 



Met 


Pro 


Gly 


Pro 


Thr 


Pro 


Ser 


Gly 


Thr 


Asn 


Val 


Gly 


Ser 


Ser 


Gly Arg 


1 








5 










10 










15 




Ser 


Pro 


Ser 


Lys 
20 


Ala 


Val 


Ala 


Ala 


Arg 

25 


Ala 


Ala 


Gly 


Ser 


Thr 
30 


Val 


Arg 


Gin 


Arg 


Lys 
35 


Asn 


Ala 


Ser 


Cys 


Gly 
40 


Thr 


Arg 


Ser 


Ala 


Gly 
45 


Arg 


Thr 


Thr 


Ser 


Ala 
50 


Gly 


Thr 


Gly 


Gly 


Met 
55 


Trp 


Arg 


Phe 


Tyr 


Thr 
60 


Glu 


Asp 


Ser 


Pro 


Gly 


Leu 


Lys 


Val 


Gly 


Pro 


Val 


Pro 


Val 


Leu 


Val 


Met 


Ser 


Leu 


Leu 


Phe 


65 










70 










75 










80 


He 


Ala 


Ser 


Val 


Phe 
85 


Met 


Leu 


His 


He 


Trp 
90 


Gly 


Lys 


Tyr 


Thr 


Arg 
95 


Ser 



<210> 280 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 280 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe He Arg Asp Lys Tyr Glu Lys 
45 



<210> 281 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 281 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

1 5 " 10 ' 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



<210> 282 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 282 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 
35 40 



Met Gin Asp Met Gly Asn Thr Lys 

10 " 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe He Arg Asp Lys Tyr Glu Lys 
45 
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Lys Lys Tyr Tyr Glu Lys Met Pro 
50 ^ 55 

<210> 283 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 283 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 284 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 284 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

1 5 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 4 0 4 5 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 285 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 285 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

1 5 10 ~ 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 286 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 286 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 
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<210> 287 

<211> 56 

<212> PRT 

<213> Homo sapiens 



<400> 287 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 

20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 

5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 

55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
4 5 



<210> 288 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 288 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 
20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 
5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 
55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
4 5 



<210> 289 

<211> 56 

<212> PRT 

<213> Homo sapiens 



<400> 289 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 
20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 
5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 
55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
45 



<210> 290 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 290 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 
20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 
5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 
55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
45 



<210> 291 
<211> 56 
<212> PRT 



WO 02/083898 

<213> Homo sapiens 



175 
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<400> 291 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 
20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 
5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 
55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
45 



<210> 292 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 292 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 293 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 293 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 ~ 15 

Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 294 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 294 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 295 
<211> 122 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> UNSURE 
<222> 113 

<223> Xaa = Ala, Asp 



<400> 295 



Met 


Glu 


Ala 


Ser 


Ala 


1 








5 


Val 


Val 


Arg 


Val 


Ala 








20 




lie 


Ala 


Thr 


Val 


Val 






35 






Met 


Val 


Leu 


Ser 


Ala 




50 








Ser 


He 


Ala 


Ala 


Lys 


65 










Gly 


Val 


Ala 


Ser 


Gly 










85 


Thr 


Gly 


Leu 


Ser 


Gly 








100 




Xaa 


He 


Ala 


Ala 


Val 



Leu Thr Ser Ser Ala 
10 

Ser Gly Ser Ala Val 
25 

He Gly Gly Val Val 
40 

Met Gly Phe Thr Ala 
55 

Met Met Ser Ala Ala 
70 

Ser Leu Val Ala Thr 
90 

Leu Thr Lys Phe He 
105 

He Ala Arg Phe Tyr 
120 



Val Thr Ser Val Ala Lys 
15 

Val Leu Pro Leu Ala Arg 
30 

Ala Met Ala Ala Val Pro 
45 

Ala Gly He Ala Ser Ser 
60 

Ala He Ala Asn Gly Gly 
75 80 
Leu Gin Ser Leu Gly Ala 
95 

Leu Gly Ser He Gly Ser 
110 



<210> 296 
<211> 122 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 

<222> 79 

<223> Xaa = Gly, Arg 



<400> 296 

Met Glu Ala Ser 

1 

Val Val Arg Val 
20 

He Ala Thr Val 
35 

Met Val Leu Ser 
50 

Ser He Ala Ala 
65 

Gly Val Ala Ser 

Thr Gly Leu Ser 
100 

Ala He Ala Ala 
115 



Ala Leu Thr Ser 
5 

Ala Ser Gly Ser 

Val He Gly Gly 
40 

Ala Met Gly Phe 
55 

Lys Met Met Ser 
70 

Gly Ser Leu Val 
85 

Gly Leu Thr Lys 

Val He Ala Arg 
120 



Ser Ala Val Thr 
10 

Ala Val Val Leu 
25 

Val Val Ala Met 

Thr Ala Ala Gly 
60 

Ala Ala Ala He 
75 

Ala Thr Leu Gin 
90 

Phe He Leu Gly 
105 

Phe Tyr 



Ser Val Ala Lys 
15 

Pro Leu Ala Arg 
30 

Ala Ala Val Pro 
45 

He Ala Ser Ser 

Ala Asn Xaa Gly 
80 

Ser Leu Gly Ala 
95 

Ser He Gly Ser 
110 



<210> 297 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<400> 297 

Met Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser Ala Asn Glu 

15 10 15 

Thr Pro Glu He Pro Asp Asn Val Gly Asp Trp Leu Arg Gly Val Tyr 

20 25 30 

Arg Phe Ala Thr Asp Arg Asn Asp Phe Arg Arg Asn Leu He Leu Asn 

35 4 0 4 5 

Leu Gly Leu Phe Ala Ala Gly Val Trp Leu Ala Arg Asn Leu Ser Asp 
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50 










55 










60 










lie 


Asp 


Leu 


Met 


Ala 


Pro 


Gin 


Pro 


Gly 


Val 














65 










70 






















<210> 298 




























<211> 74 




























<212> PRT 




























<213> Homo sapiens 
























<400> 298 




























Met 


Ala 


Ser 


Ser 


Thr 


Val 


Pro 


Val 


Ser 


Ala 


Ala 


Gly 


Ser 


Ala 


Asn 


Glu 


1 








5 










10 










15 




Thr 


Pro 


Glu 


He 


Pro 


Asp 


Asn 


Val 


Gly 


Asp 


Trp 


Leu 


Arg 


Gly 


Val 


Tyr 








20 










25 










30 






Arg 


Phe 


Ala 


Thr 


Asp 


Arg 


Asn 


Asp 


Phe 


Arg 


Arg 


Asn 


Leu 


He 


Leu 


Asn 






35 










40 










45 








Leu 


Gly 


Leu 


Phe 


Ala 


Ala 


Gly 


Val 


Trp 


Leu 


Ala 


Arg 


Asn 


Leu 


Ser 


Asp 




50 










55 










60 










lie 


Asp 


Leu 


Met 


Ala 


Pro 


Gin 


Pro 


Gly 


Val 














65 










70 






















<210> 299 




























<211> 147 




























<212> PRT 




























<213> Homo sapiens 
























<400> 299 




























Met 


Asp 


Val 


Leu 


Val 


Ser 


Glu 


Cys 


Ser 


Ala 


Arg 


Leu 


Leu 


Gin 


Gin 


Glu 


1 








5 










10 










15 




Glu 


Glu 


He 


Lys 


Ser 


Leu 


Thr 


Ala 


Glu 


He 


Asp 


Arg 


Leu 


Lys 


Asn 


Cys 








20 










25 










30 






Gly 


Cys 


Leu 


Gly 


Ala 


Ser 


Pro 


Asn 


Leu 


Glu 


Gin 


Leu 


Gin 


Glu 


Glu 


Asn 






35 










40 










45 








Leu 


Lys 


Leu 


Lys 


Tyr 


Arg 


Leu 


Asn 


He 


Leu 


Arg 


Lys 


Ser 


Leu 


Gin 


Ala 




50 










55 










60 










Glu 


Arg 


Asn 


Lys 


Pro 


Thr 


Lys 


Asn 


Met 


He 


Asn 


He 


He 


Ser 


Arg 


Leu 


65 










70 










75 










80 


Gin 


Glu 


Val 


Phe 


Gly 


His 


Ala 


He 


Lys 


Ala 


Ala 


Tyr 


Pro 


Asp 


Leu 


Glu 










85 










90 










95 




Asn 


Pro 


Pro 


Leu 


Leu 


Val 


Thr 


Pro 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Asp 








100 










105 










110 






Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 


He 


Ser 


Gin 


Val 


Met 


Tyr 


Cys 


His 






115 










120 










125 








Asp 


Ser 


Trp 


Leu 


Phe 


Asp 


Phe 


Phe 


Lys 


Tyr 


Tyr 


Tyr 


His 


His 


Cys 


His 




130 










135 










140 










Leu 


Gin 


Lys 




























145 
































<210> 300 




























<211> 97 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<400> 300 




























Met 


Ser 


Ser 


Ala 


Pro 


Glu 


Pro 


Pro 


Thr 


Phe 


Lys 


Lys 


Glu 


Pro 


Pro 


Lys 


1 








5 










10 










15 




Glu 


Lys 


Glu 


Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly 


Val 


Arg 


Thr 


Thr 


Thr 








20 










25 










30 






Leu 


Phe 


Arg 


Ala 


Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He 


Lys 


Pro 


Asn 


Lys 


Pro 






35 










40 










45 








Val 


Met 


Ala 


Phe 


Gly 


Leu 


Val 


Thr 


Leu 


Ser 


Leu 


Cys 


Val 


Ala 


Tyr 


He 




50 










55 










60 










Gly 


Tyr 


Leu 


His 


Ala 


He 


Gin 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 


Tyr 


Glu 


Ala 
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65 70 75 80 

He Asp Ser Glu Gly His Ser Tyr Met Arg Arg Lys Thr Ser Lys Trp 
85 " 90 95 

Asp 

<210> 301 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 30 

<223> Xaa = Lys,Asn 
<400> 301 

Met Ala Ala Glu Pro Leu Thr Glu Leu Glu Glu Ser He Glu Asn Val 

15 10 15 

Val Thr Thr Phe Phe Thr Phe Ala Arg Gin Glu Gly Arg Xaa Asp Ser 

20 25 30 

Leu Ser Val Asn Glu Phe Lys Glu Leu Val Thr Gin Gin Leu Pro His 

35 40 . 45 

Leu Leu Lys Asp Val Gly Ser Leu Asp Glu Lys Met Lys Ser Leu Asp 

50 " ~ 55 60 

Val Asn Gin Asp Ser Glu Leu Lys Phe Asn Glu Tyr Trp Arg Leu He 
65 70 J 75 80 

Gly Glu Leu Ala Lys Glu He Arg Lys Lys Lys Asp Leu Lys He Arg 
85 " 90 95 

Lys Lys 

<210> 302 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 103 

<223> Xaa = Asp, Tyr 
<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Gly, Ser 



<400> 302 



Met 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


Leu 


Glu 


Ser 


He 


Lys 


Ser 


1 








5 










10 










15 




Arg 


Leu 


Gin 


Leu 
20 


Val 


Met 


Lys 


Ser 


Gly 
25 


Lys 


Tyr 


Val 


Leu 


Gly 
30 


Tyr 


Lys 


Gin 


Thr 


Leu 
35 


Lys 


Met 


He 


Arg 


Gin 
40 


Gly 


Lys 


Ala 


Lys 


Leu 
45 


Val 


He 


Leu 


Ala 


Asn 
50 


Asn 


Cys 


Pro 


Ala 


Leu 
55 


Arg 


Lys 


Ser 


Glu 


He 
60 


Glu 


Tyr 


Tyr 


Ala 


Met 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly Asn 


Asn 


He 


Glu 


65 










70 










75 










80 


Leu 


Gly 


Thr 


Ala 


Cys 
85 


Gly 


Lys 


Tyr 


Tyr 


Arg 
90 


Val 


Cys 


Thr 


Leu 


Ala 
95 


He 


He 


Asp 


Pro 


Xaa 
100 


Asp 


Ser 


Xaa 


He 


He 
105 


Arg 


Ser 


Met 


Pro 


Glu 
110 


Gin 


Thr 



Gly Glu Lys 
115 
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<210> 303 
<211> 102 
<212> PRT 

<213> Homo sapiens 



Met 


Ala 


Gly 


Gin 


Ala 


Phe 


Arg 


Lys 


Phe 


Leu 


Pro 


Leu 


Phe 


Asp 


Arg 


Val 


1 








5 










10 










15 




Leu 


Val 


Glu 


Arg 
20 


Ser 


Pro 


Ala 


Glu 


Thr 
25 


Val 


Thr 


Lys 


Gly 


Gly 
30 


He 


Met 


Leu 


Pro 


Glu 

35 


Lys 


Ser 


Gin 


Gly 


Lys 
40 


Val 


Leu 


Gin 


Ala 


Thr 
45 


Val 


Val 


Ala 


Val 


Gly 
50 


Ser 


Gly 


Ser 


Lys 


Gly 
55 


Lys 


Gly 


Gly 


Glu 


He 
60 


Gin 


Pro 


Val 


Ser 


Val 


Lys 


Val 


Gly 


Asp 


Lys 


Val 


Leu 


Leu 


Pro 


Glu 


Tyr 


Gly 


Gly 


Thr 


Lys 


65 










70 










75 










80 


Val 


Val 


Leu 


Asp 


Asn 
85 


Lys 


Asp 


Tyr 


Phe 


Leu 
90 


Phe 


Arg 


Asp 


Gly 


Asp 
95 


He 


Leu 


Gly 


Lys 


Tyr 
100 


Val 


Asp 























<210> 304 
<211> 102 
<212> PRT 

<213> Homo sapiens 



<400> 304 

Met Ala Gly Gin 

1 

Leu Val Glu Arg 
20 

Leu Pro Glu Lys 
35 

Val Gly Ser Gly 
50 

Val Lys Val Gly 
65 

Val Val Leu Asp 

Leu Gly Lys Tyr 
100 



Ala Phe Arg Lys 
5 

Ser Ala Ala Glu 

Ser Gin Gly Lys 
40 

Ser Lys Gly Lys 
55 

Asp Lys Val Leu 
70 

Asp Lys Asp Tyr 
85 

Val Asp 



Phe Leu Pro Leu 
10 

Thr Val Thr Lys 
25 

Val Leu Gin Ala 

Gly Gly Glu He 
60 

Leu Pro Glu Tyr 
75 

Phe Leu Phe Arg 
90 



Phe Asp Arg Val 
15 

Gly Gly He Met 
30 

Thr Val Val Ala 
45 

Gin Pro Val Ser 

Gly Gly Thr Lys 
80 

Asp Gly Asp lie 
95 



<210> 305 
<211> 116 
<212> PRT 

<213> Homo sapiens 



<400> 305 

Met Ser Ala Thr 

1 

Gly Gly Pro Pro 
20 

Gin Gin Thr Gin 
35 

Asn Val Asp Lys 
50 

Asp Arg Ala Asp 
65 

Ala Ala Lys Leu 

He He Leu Gly 
100 

Tyr Phe Ser Thr 



Ala Ala Thr Ala 
5 

Ala Pro Pro Pro 

Ala Gin Val Asp 
40 

Val Leu Glu Arg 
55 

Ala Leu Gin Ala 
70 

Lys Arg Lys Tyr 
85 

Val He Cys Ala 



Pro Pro Ala Ala 
10 

Asn Leu Thr Ser 
25 

Glu Val Val Asp 

Asp Gin Lys Leu 
60 

Gly Pro Ser Gin 
75 

Trp Trp Lys Asn 
90 

He He Leu He 
105 



Pro Ala Gly Glu 
15 

Asn Arg Arg Leu 
30 

He Met Arg Val 
45 

Ser Glu Leu Asp 

Phe Glu Thr Ser 
80 

Leu Lys Met Met 
95 

He lie lie Val 
110 
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<210> 306 

<211> 97 

<212> PRT 

<213> Homo sapiens 



<400> 306 



Met Ser 


Leu 


Ser 


Leu 


Val 


Phe 


Arg 


Ala 


Ala 


Ser 


Tyr 


Phe 


Lys 


Leu 


Val 


1 






5 










10 










15 




Pro Phe 


His 


Ser 


Ser 


Ser 


Ser 


Asn 


Gin 


Phe 


Leu 


Gin 


Pro 


Pro 


Gly 


Trp 






20 










25 










30 






Val Val 


Leu 


Thr 


Gin 


Thr 


Leu 


Val 


Leu 


Leu 


His 


Phe 


Glu 


Arg 


Phe 


Ser 




35 










40 










45 








Tyr Gin 


Asn 


Val 


Pro 


Lys 


Ser 


Ala 


Gin 


Gly 


Lys 


Gly 


Asn 


Leu 


Gin 


Pro 


50 










55 










60 










Glu Thr 


Asn 


He 


His 


Leu 


Phe 


His 


Phe 


Leu 


Thr 


Phe 


Pro 


Lys 


Gin 


He 


65 








70 










75 










80 


Ser Arg 


Asn 


Leu 


Phe 


Asn 


Ser 


Leu 


Leu 


Cys 


Leu 


Met 


Cys 


Leu 


Thr 


Tyr 








85 










90 










95 




Phe 






























<210> 307 




























<211> 110 




























<212> PRT 




























<213> Homo sapiens 
























<400> 307 




























Met Ser 


Met 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


1 






5 










10 










15 




Gly Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 






20 










25 










30 






Met Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 




35 










40 










45 








His Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


50 










55 










60 










Gly Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


65 








70 










75 










80 


Lys Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly Asp 


Lys 


Asp 


Gly 


Asp 


Gly 








85 










90 










95 




Lys lie 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr 


Leu 


Val 


Ala 


Glu 


Ser 










100 










105 










110 







<210> 308 
<211> 110 
<212> PRT 

<213> Homo sapiens 



<400> 308 



Met 


Ser 


Met 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


1 








5 










10 










15 




Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 






20 










25 










30 






Met 


Val 


Gly 

35 


Leu 


Lys 


Lys 


Lys 


Ser 
40 


Ala 


Asp 


Asp 


Val 


Lys 
45 


Lys 


Val 


Phe 


His 


Met 
50 


Leu 


Asp 


Lys 


Asp 


Lys 
55 


Ser 


Gly 


Phe 


He 


Glu 
60 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


65 










70 










75 










80 


Lys 


Glu 


Thr 


Lys 


Met 
85 


Leu 


Met 


Ala 


Ala 


Gly 
90 


Asp 


Lys 


Asp 


Gly 


Asp 
95 


Gly 


Lys 


He 


Gly 


Val 
100 


Asp 


Glu 


Phe 


Ser 


Thr 
105 


Leu 


Val 


Ala 


Glu 


Ser 
110 
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<210> 309 
<211> 110 
<212> PRT 

<213> Homo sapiens 



<400> 309 

Met Ser Met Thr 

1 

Gly Ala Phe Ser 
20 

Met Val Gly Leu 
35 

His Met Leu Asp 
50 

Gly Phe lie Leu 
65 

Lys Glu Thr Lys 

Lys lie Gly Val 
100 



Asp Leu Leu Asn 

5 

Ala Thr Asp Ser 

Lys Lys Lys Ser 
40 

Lys Asp Lys Ser 
55 

Lys Gly Phe Ser 
70 

Met Leu Met Ala 
85 

Asp Glu Phe Ser 



Ala Glu Asp lie 
10 

Phe Asp His Lys 
25 

Ala Asp Asp Val 

Gly Phe He Glu 
60 

Pro Asp Ala Arg 
75 

Ala Gly Asp Lys 
90 

Thr Leu Val Ala 
105 



Lys 


Lys 


Ala 


Val 






15 




Lys 


Phe 


Phe 


Gin 




30 






Lys 


Lys 


Val 


Phe 


45 








Glu 


Asp 


Glu 


Leu 


Asp 


Leu 


Ser 


Ala 








80 


Asp 


Gly 


Asp 


Gly 






95 




Glu 


Ser 








110 







<210> 310 
<211> 112 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 

<222> 8,33 

<223> Xaa = Asp, Glu 



<400> 310 




Met 


Asp 


Pro 


Arg 


1 








Lys 


Gin 


Asp 


Pro 








20 


Xaa 


Trp 


Val 


Glu 






35 




Ala 


Lys 


Ser 


Glu 




50 






Val 


Glu 


Glu 


Asp 


65 








Leu 


Glu 


He 


Asp 


Gin 


Glu 


Met 


Gly 



100 



Lys Val Asn Xaa 
5 

Ser Val Leu His 

Ser Met Gly Gly 
40 

Glu Asn Thr Lys 
55 

Leu Lys Ala Asp 
70 

Lys Glu Gly Val 
85 

Asp Glu Asn Ala 



Leu Arg Ala Phe 
10 

Thr Glu Glu Met 
25 

Lys Val Pro Pro 

Glu Glu Lys Pro 
60 

Glu Pro Ser Ser 
75 

He Glu Pro Asp 
90 

Glu He Thr Ala 
105 



Val Lys Met Cys 
15 

Arg Phe Leu Arg 
30 

Ala Thr Gin Lys 
45 

Asp Ser Lys Lys 

Glu Glu Ser Asp 
80 

Thr Asp Ala Pro 
95 

Gin His Phe Leu 
110 



<210> 311 
<211> 118 
<212> PRT 
<213> Homo sapiens 

<400> 311 

Met Val Leu Leu Glu Ser Glu Gin 
1 5 

Phe Gin Lys Cys Arg Thr Ser Gly 
20 

Tyr Asp Gly Arg Thr Lys Pro He 

35 40 
Phe Glu Pro Ala Asp Asn Lys Cys 

50 55 
Lys Lys He Ser Thr Val Val Ser 



Phe Leu Thr Glu Leu Thr Arg Leu 

10 15 
Ser Val Tyr He Thr Leu Lys Lys 
25 30 
Pro Lys Lys Gly Thr Val Glu Gly 
45 

Leu Leu Arg Ala Thr Asp Gly Lys 
60 

Ser Lys Glu Val Asn Lys Phe Gin 
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65 70 75 80 

Met Ala Tyr Ser Asn Leu Leu Arg Ala Asn Met Asp Gly Leu Lys Lys 

85 " 90 95 

Arg Asp Lys Lys Asn Glu Thr Lys Lys Thr Lys Ala Ala Ala Ala Ala 

100 105 110 

Ala Ala Thr Ala Ala Gin 
115 



<210> 312 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<400> 312 

Met Asp Ser Leu Pro Ser Gly Lys lie His Arg Lys Val Lys lie lie 

15 10 15 

Leu Gly Arg Asn Arg Lys Glu Asn Leu Glu Pro Asn Ala Glu Phe Asp 

20 25 30 

Lys Arg Thr Glu Phe lie Thr Gin Glu Glu Asn Arg lie Cys Ser Ser 

35 40 45 

Pro Val Gin Ser Leu Leu Asp Leu Phe Gin Thr Ser Glu Glu Lys Ser 

50 55 60 

Glu Phe Leu Gly Phe Thr Ser Tyr Thr Glu Lys Ser Gly lie Cys Asn 
65 70 75 80 

Val Leu Asp lie Trp Glu Glu Glu Asn Ser Asp Asn Leu Leu Thr Ala 

85 90 95 

Phe Phe Ser Ser Pro Ser Thr Ser Thr Phe Thr Gly Phe 
100 105 



<210> 313 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 99 

<223> Xaa = Arg, Thr 
<220> 

<221> unsure 
<222> 103 

<223> Xaa = His, Asn 



<400> 313 




Met 


Ser 


Gly 


Asp 


1 








Lys 


Val 


Lys 


Lys 








20 


Asp 


Ala 


Trp 


Ser 






35 




Lys 


Ala 


Arg 


Lys 




50 






Gly 


Arg 


Phe 


Trp 


65 








Tyr 


Phe 


Phe 


Leu 


Gin 


Val 


Xaa 


Asn 



100 



Gly Ala Thr Glu 
5 

Ala Glu Lys Lys 

lie Leu lie Arg 
40 

Thr Tyr Glu Arg 
55 

Lys Leu Tyr lie 
70 

Tyr Gin Tyr Cys 
85 

lie Ala Xaa 



Gin Ala Ala Glu 
10 

Leu Glu Glu Asn 
25 

Glu Ala Gin Asn 

Leu Val Ala Gin 
60 

Glu Ala Glu Val 
75 

Ser lie His Cys 
90 



Tyr Val Pro Glu 
15 

Pro Tyr Asp Leu 
30 

Gin Pro lie Asp 
45 

Phe Pro Ser Ser 

Thr lie Leu Phe 
80 

Ser Asp Arg Lys 
95 



<210> 314 
<211> 72 
<212> PRT 
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<400> 314 



Met Ser Gly 


Asp 


Gly 


Ala 


Thr 


Glu 


Gin 


Ala 


Ala 


Glu 


Tyr 


Val 


Ser 


Glu 


1 




5 










10 










15 




Lys Val Lys 


Lys 


Ala 


Glu 


Lys 


Lys 


Leu 


Glu 


Glu 


Asn 


Pro 


Tyr 


Asp 


Leu 




20 










25 










30 






Asp Ala Trp 


Ser 


He 


Leu 


He 


Arg 


Glu 


Ala 


Gin 


Asn 


Gin 


Pro 


He 


Asp 


35 










40 










45 








Lys Ala Arg 


Lys 


Thr 


Tyr 


Glu 


Arg 


Leu 


Val 


Ala 


Gin 


Phe 


Pro 


Ser 


Ser 


50 








55 










60 










Gly Arg Phe 


Trp 


Lys 


Leu 


Tyr 


He 


















65 






70 






















<210> 315 




























<211> 106 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<400> 315 




























Met Leu Lys 


Ser 


Asn 


Gly 


Glu 


Arg 


Arg 


Ser 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


1 




5 










10 










15 




Val Tyr Ala 


Arg 


Lys 


Met 


Ala 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala 


Ser 


Ala 


Ala 




20 










25 










30 






Ser Ser Ala 


Ala 


Gly 


Val 


Ser 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly 


Gly 


Pro 


Gly 


35 










40 










4 5 








Pro Gin Gin 


Gin 


Pro 


Gin 


Pro 


Pro 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 


Ala 


Gin 


50 








55 










60 










Ser Gly Leu 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Asp 


Phe 


Asp 


Pro 


Val 


Gin 


Arg 


65 






70 










75 










80 


Tyr Lys Met 


Leu 


He 


Pro 


Gin 


Leu 


Lys 


Glu 


Ser 


Leu 


Gin 


Val 


He 


Gly 






85 










90 










95 




Leu Lys Gin 


Arg 


Glu 


Ala 


Asn 


Trp 


He 


Trp 
















100 










105 
















<210> 316 




























<211> 86 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<400> 316 




























Met Ala Glu 


Asp 


Met 


Glu 


Thr 


Lys 


lie 


Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


1 




5 










10 










15 




Phe Asp Ser 


Arg 


Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr 


Arg 


Asn 


Cys 


Trp 


Gin 




20 










25 










30 






Asn Tyr Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


35 










40 










45 








Gly Asp lie 


Ser 


Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg 


Val 


Tyr 


Gin 


Ser 


Leu 


50 








55 










60 










Cys Pro Thr 


Ser 


Trp 


Val 


Thr 


Asp 


Trp 


Asp 




Gin 


Arg 


Ala 




Caiy 


65 






70 










75 










80 


Thr Phe Pro 


Gly 


Lys 


He 


























85 
























<210> 317 




























<211> 105 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 317 




























Met Cys Val 


Gly 


Leu 


Leu 


Met 


Val 


Ser 


He 


Leu 


Thr 


Asp 


He 


Asn 


Leu 


1 




5 










10 










15 




Ser Asn Leu 


Val 


Ala 


Val 


Gin 


Tyr 


Ser 


Pro 


Asp 


Tyr 


Cys 


Asn 


Phe 


Arg 
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20 25 30 

Lys Arg Ser Asp Lys Asn Gin Asp Ala Ser Thr Phe Cys His Asn Cys 

35 ~ 40 45 

Asn Gin Phe His Leu Cys Leu Gin Tyr His His Lys lie Val Leu Pro 

50 55 60 

Trp Ser Val Leu Ala lie Leu Ser Gin Leu Phe Leu His lie Thr Ser 
65 70 75 80 

Arg lie Arg Pro Thr lie Tyr Lys Ser Asn Lys Pro Lys Ser lie Glu 

85 90 95 

lie Leu lie Gly Phe Val Leu Asn Leu 
100 105 

<210> 318 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<400> 318 

Met Glu Arg Pro Asp Lys Ala Ala Leu Asn Ala Leu Gin Pro Pro Glu 

15 10 15 

Phe Arg Asn Glu Ser Ser Leu Ala Ser Thr Leu Lys Thr Leu Leu Phe 

20 25 30 

Phe Thr Ala Leu Met lie Thr Val Pro lie Gly Leu Tyr Phe Thr Thr 

35 40 45 

Lys Ser Tyr lie Phe Glu Gly Ala Leu Gly Met Ser Asn Arg Asp Ser 

50 " 55 60 

Tyr Phe Tyr Ala Ala lie Val Ala Val Val Ala Val His Val Val Leu 
65 70 75 80 

Ala Leu Phe Val Tyr Val Ala Trp Asn Glu Gly Ser Arg Gin Trp Arg 

85 90 95 

Glu Gly Lys Gin Asp 
100 



<210> 319 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 29 

<223> Xaa = Gly, Arg 



<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = Lys , Asn, Arg, Ser 



<400> 319 

Met Phe Arg Asp 

1 

Val Asp Ala Arg 
20 

Lys Pro Lys Pro 
35 

Tyr Ala Val Trp 
50 

Phe Gin Val Cys 
65 

lie Cys Arg His 



Phe Gly Arg Arg 
5 

Leu Arg Leu Ser 

Val Glu Val Gin 
40 

Phe Gly Gly Ser 
55 

His Thr Lys Lys 
70 

Asn Pro Val Phe 
85 



Leu Gin Arg Asp 
10 

Glu Glu Leu Xaa 
25 

Val Val Thr His 

Met Leu Ala Ser 
60 

Asp Tyr Glu Glu 
75 

Gly Val Met Ser 
90 



Leu Lys Arg Val 
15 

Xaa Gly Arg lie 
30 

His Met Gin Arg 
4 5 

Thr Pro Glu Phe 

Tyr Gly Pro Ser 
80 



<210> 320 



WO 02/083898 
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185 

<211> 92 
<212> PRT 

<213> Homo sapiens 
<400> 320 



Met Phe Arg 


Asp 


Phe 


Gly 


Arg 


Arg 


Leu 


Gin 


Arg 


Asp 


Leu 


Lys 


Arg 


Val 


1 




5 










10 










15 




Val Asp Ala 


Arg 


Leu 


Arg 


Leu 


Ser 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


He 




20 










25 










30 






Lys Pro Lys 


Pro 


Val 


Glu 


Val 


Gin 


Val 


Val 


Thr 


His 


His 


Met 


Gin 


Arg 


35 










40 










45 








Tyr Ala Val 


Trp 


Phe 


Gly 


Gly 


Ser 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


50 








55 










60 










Phe Gin Val 


Cys 


His 


Thr 


Lys 


Lys 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


65 






70 










75 










80 


lie Cys Arg 


His 


Asn 


Pro 


Val 


Phe 


Gly 


Val 


Met 


Ser 














85 










90 














<210> 321 




























<211> 98 




























<212> PRT 




























<213> Homo sapiens 
























<400> 321 




























Met Ser Val 


Ser 


Phe 


His 


Thr 


His 


Thr 


Lys 


Glu 


Leu 


Trp 


Thr 


Trp 


Met 


1 




5 










10 










15 




Glu Asp Leu 


Gin 


Lys 


Glu 


Met 


Leu 


Glu 


Asp 


Val 


Cys 


Ala 


Asp 


Ser 


Val 




20 










25 










30 






Asp Ala Val 


Gin 


Glu 


Leu 


He 


Lys 


Gin 


Phe 


Gin 


Gin 


Gin 


Gin 


Thr 


Ala 


35 










40 










45 








Thr Leu Asp 


Ala 


Thr 


Leu 


Asn 


Val 


He 


Lys 


Glu 


Gly 


Glu 


Asp 


Leu 


He 


50 








55 










60 










Gin Gin Leu 


Arg 


Ser 


Ala 


Pro 


Pro 


Ser 


Leu 


Gly 


Glu 


Pro 


Ser 


Glu 


Ala 


65 






70 










75 










80 


Arg Ser Ala 


Trp 


Ala 


Glu 


Leu 


Ser 


Ser 


Gly 


Lys 


Cys 


Leu 


Gly 


Leu 


Asp 






85 










90 










95 




Val Arg 




























<210> 322 




























<211> 89 




























<212> PRT 




























<213> Homo : 


sapiens 
























<400> 322 




























Met Val Ser 


Gly 


Asp 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 


He 


His 


1 




5 










10 










15 




Leu Gly Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 


Ser 


Arg 




20 










25 










30 






Arg Val Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 


Arg 


Gly 


35 










40 










45 








Arg Ser Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro 


Ala 


Val 


Glu 


50 








55 










60 










Pro Asp Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 


Leu 


Gly 


65 






70 










75 










80 


Gly Val lie 


Ala 


Ala 


Arg 


He 


Ser 


Val 

















85 



<210> 323 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<400> 323 



WO 02/083898 



Met 


Ala 


Leu 


Asn 


Asn 


Val 


Ser 


Leu 


Ser 


1 








5 










Val 


Ala 


Tyr 


Arg 


Ser 


Ser 


His 


Gly Asp 








20 










25 


Leu 


Ala 


Met 


Val 


Ser 


Gly Asp 


Gly 


Phe 






35 










40 




lie 


His 
50 


Leu 


Gly 


Pro 


Arg 


Gin 
55 


Ala 


Val 


Ser 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly Arg 


65 










70 








Arg 


Gly 


Arg 


Ser 


Arg 
85 


Gin 


Ala 


Arg 


Phe 


Val 


Glu 


Pro 


Asp 
100 


Leu 


Leu 


Arg 


Ser 


Val 
105 


Leu 


Gly 


Gly 
115 


Val 


He 


Ala 


Ala 


Arg 
120 


He 



PCT/IB01/00914 

186 

Ser Gly Asp Gin Arg Ser Arg 
10 15 
Leu Arg Pro Arg Ala Ser Ala 
30 

Leu Val Ser Arg Pro Glu Ala 
45 

Arg Pro Ser Val Arg Ala Glu 
60 

Ser Pro Arg Glu Pro Asp Gly 

75 80 
Ser Pro Tyr Pro lie Pro Ala 
90 95 
Leu Gin Gin Arg Leu He Ala 
110 

Ser Val 



<210> 324 
<211> 172 
<212> PRT 

<213> Homo sapiens 
<400> 324 

Met Ala Gly Ala Ala Thr Gin Ala Ser Leu Glu Ser Ala Pro Arg lie 

15 10 15 

Met Arg Leu Val Ala Glu Cys Ser Arg Ser Arg Ala Arg Ala Gly Glu 

20 ^ 25 30 

Leu Trp Leu Pro His Gly Thr Val Ala Thr Pro Val Phe Met Pro Val 

35 40 45 

Gly Thr Gin Ala Thr Met Lys Gly He Thr Thr Glu Gin Leu Asp Ala 

50 55 60 

Leu Gly Cys Arg He Cys Leu Gly Asn Thr Tyr His Leu Gly Leu Arg 
65 70 75 80 

Pro Gly Pro Glu Leu He Gin Lys Ala Asn Gly Leu His Gly Phe Met 

85 90 95 

Asn Trp Pro His Asn Leu Leu Thr Asp Ser Gly Gly Phe Gin Met Val 

100 105 110 

Ser Leu Val Ser Leu Ser Glu Val Thr Glu Glu Gly Val Arg Phe Arg 

115 120 125 

Ser Pro Tyr Asp Gly Asn Glu Thr Leu Leu Ser Pro Glu Lys Ser Val 

130 ~ 135 140 

Gin He Gin Asn Ala Leu Gly Ser Asp He lie Met Gin Leu Asp Asp 
145 150 155 160 

Val Val Ser Ser Thr Val Thr Gly Pro Arg Val Glu 
165 170 



<210> 325 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 325 

Met Ala He Lys Phe Leu Glu Val He Lys Pro Phe Cys Val He Leu 
15 10 15 

Gin Leu Thr Ser 
20 



<210> 326 

<211> 23 

<212> PRT 

<213> Homo sapiens 



<400> 326 
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Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

Gin Leu Ser Ser Thr His Arg 
20 

<210> 327 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Phe, Ser 
<400> 327 

Met Glu Ser Asp Asp 
1 5 
Asp Cys Tyr Lys Pro 
20 

lie Arg Gly Met Gin 
35 

<210> 328 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<400> 328 



Met 


Ala 


Ala 


Ser 


Ala 


Ala 


Arg 


Gly Ala 


Ala 


Ala 


Leu 


Arg 


Arg 


Ser 


He 


1 








5 










10 










15 




Asn 


Gin 


Pro 


Val 
20 


Ala 


Phe 


Val 


Arg 


Arg 
25 


He 


Pro 


Trp 


Thr 


Ala 
30 


Ala 


Ser 


Ser 


Gin 


Leu 
35 


Lys 


Glu 


His 


Phe 


Ala 
40 


Gin 


Phe 


Gly 


His 


Val 
45 


Arg 


Arg 


Cys 


He 


Leu 
50 


Pro 


Phe 


Asp 


Lys 


Glu 
55 


Thr 


Gly 


Phe 


His 


Arg 
60 


Gly 


Leu 


Gly 


Trp 


Val 


Gin 


Phe 


Ser 


Ser 


Glu 


Glu 


Gly 


Leu 


Arg 


Asn 


Ala 


Leu 


Gin 


Gin 


Glu 


65 










70 










75 










80 


Asn 


His 


He 


He 


Asp 
85 


Gly 


Val 


Lys 


Val 


Gin 
90 


Val 


His 


Thr 


Arg 


Arg 
95 


Pro 


Lys 


Leu 


Pro 


Gin 
100 


Thr 


Ser 


Asp 


Asp 


Glu 
105 


Lys 


Lys 


Asp 


Phe 









<210> 329 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 329 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu His Lys Tyr Gin 

15 10 15 

Ser Ala Gly Cys Phe Cys Glu Lys Asn Ser Leu Asp Cys Gly Val Glu 

20 25 30 

Ser Ala Glu Arg Thr Leu Cys Thr Val Arg Pro Cys Gin Lys Val His 

35 " 40 45 

Phe Thr Phe 
50 

<210> 330 
<211> 99 
<212> PRT 

<213> Homo sapiens 



Ala Arg Ala Ala Arg Val Lys Glu Leu Leu Val 

10 15 
Thr Glu Ala Phe He Xaa Gly Leu Leu Asp Lys 

25 30 
Lys Leu Ser Thr Pro Gin Lys Lys 
40 45 
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<400> 330 

Pro Val Arg Gly Leu Asn Met Ser 
1 5 
Arg Pro Tyr Val Tyr Asp Leu lie 
20 

Met Gly Val Gly His Tyr 
35 



Glu Phe Val Cys Asn Leu Ser Ala 

10 15 
Ala Val Ser Asn His Tyr Gly Ala 
25 30 



<210> 331 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 331 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
1 5 10 



<210> 332 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 332 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
1 5 10 



<210> 335 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 335 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
1 5 10 



<210> 336 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 336 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
1 5 * 10 



<210> 337 
<211> 109 
<212> PRT 

<213> Homo sapiens 



<400> 337 

Met Ala Ala Ser 

1 

Asn Gin Pro Val 
20 

Ser Gin Leu Lys 
35 

He Leu Pro Phe 
50 

Val Gin Phe Ser 
65 

Asn His lie He 



Ala Ala Arg Gly 
5 

Ala Phe Val Arg 

Glu His Phe Ala 
40 

Asp Lys Glu Thr 
55 

Ser Glu Glu Gly 
70 

Asp Gly Val Lys 
85 



Ala Ala Ala Leu 
10 

Arg Xaa Pro Trp 
25 

Gin Phe Gly His 

Gly Phe His Arg 
60 

Leu Arg Asn Ala 
75 

Val Gin Val His 
90 



Arg Arg Ser He 
15 

Thr Ala Ala Ser 
30 

Val Arg Arg Cys 
45 

Gly Leu Gly Trp 

Leu Gin Gin Glu 
80 

Thr Arg Arg Pro 
95 
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Lys Leu Pro Gin Thr Ser Asp Asp Glu Lys Lys Asp Phe 
100 105 



<210> 338 
<211> 59 
<212> PRT 

<213> Homo sapiens 



<400> 338 

Met Lys Phe Gly 

1 

Leu Asn Gly lie 
20 

Ser Gin Arg Asn 
35 

Glu Gly Asp Ala 
50 



Cys Leu Ser Phe 
5 

Lys Thr Val Glu 

Cys Thr lie Ala 
40 

Cys Arg Glu Leu 
55 



Arg Gin Pro Tyr 
10 

Thr Arg Trp Arg 

25 

Val His lie Ala 
Leu Val Glu 



Ala Gly Phe Val 
15 

Pro Leu Leu Ser 
30 

His Arg Asp Trp 
45 



<210> 339 
<211> 506 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 88 . . 471 



<220> 

<221> sig__peptide 

<222> 88 . . 189 

<223> Von Heijne matrix 

score 13.6000003814697 
seq VLLLLSLLLTALA/SS 



<400> 339 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ctt gat acc acc cct 114 

Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 



tec 


tgt 


aac 


agt 


gcg 


aga 


cca 


ctt 


cat 


gee 


ttg 


cag 


gtg 


ctg 


ctg 


ctt 


162 


Ser 


Cys 


Asn 


Ser 


Ala 


Arg 


Pro 


Leu 


His 


Ala 


Leu 


Gin 


Val 


Leu 


Leu 


Leu 




-25 










-20 










-15 










-10 




ctg 


tea 


ttg 


ctg 


ctg 


act 


get 


ctg 


get 


tec 


tec 


acc 


aaa 


gga 


caa 


act 


210 


Leu 


Ser 


Leu 


Leu 


Leu 
-5 


Thr 


Ala 


Leu 


Ala 


Ser 
1 


Ser 


Thr 


Lys 


Gly 

5 


Gin 


Thr 




aag 


aga 


aac 


ttg 


gcg 


aaa 


ggc 


aaa 


gag 


gaa 


agt 


eta 


gac 


agt 


gac 


ttg 


258 


Lys 


Arg 


Asn 


Leu 


Ala 


Lys 


Gly 


Lys 


Glu 


Glu 


Ser 


Leu 


Asp 


Ser 


Asp 


Leu 








10 










15 










20 










tat 


get 


gaa 


etc 


cgc 


tgc 


atg 


tgt 


ata 


aag 


aca 


acc 


tct 


gga 


att. 


cat 


306 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 






25 










30 










35 












ccc 


aaa 


aac 


ate 


caa 


agt 


ttg 


gaa 


gtg 


ate 


ggg 


aaa 


gga 


acc 


cat 


tgc 


354 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 


Glu 


Val 


He 


Gly 


Lys 


Gly 


Thr 


His 


Cys 




40 










45 










50 










55 




aac 


caa 


gtc 


gaa 


gtg 


ata 


gec 


aca 


ctg 


aag 


gat 


ggg 


agg 


aaa 


ate 


tgc 


402 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 


Thr 


Leu 


Lys 


Asp 


Gly 


Arg 


Lys 


He 


Cys 












60 










65 










70 






ctg 


gac 


cca 


gat 


get 


ccc 


aga 


ate 


aag 


aaa 


att 


gta 


cag 


aaa 


aaa 


ttg 


450 


Leu 


Asp 


Pro 


Asp 


Ala 


Pro 


Arg 


He 


Lys 


Lys 


He 


Val 


Gin 


Lys 


Lys 


Leu 










75 










80 










85 








gca 


ggt 


gat 


gaa 


tct 


get 


gat 


taatttgttc tgtttctgcc aaacttcttt 


501 


Ala 


Gly 


Asp 


Glu 


Ser 


Ala 


Asp 























90 
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506 

<210> 340 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41 . .235 

<220> 

<221> sig_peptide 

<222> 41 . .94 

<223> Von Heijne matrix 

score 12.1999998092651 

seq LTLAVLFLTGSQA/RH 



WO 02/083898 

190 

aactc 



<400> 340 

ttagagactg cgagaaggag gtcccccacg gcccttcagg 



ctg acc ttg 
Leu Thr Leu 

tgg cag caa 
Trp Gin Gin 

5 

ctg gcc act 
Leu Ala Thr 
20 

gtg tec cag 
Val Ser Gin 

gctccttgac 
ccctgtgacc 
gatgagcaag 
gaagaagtgg 
aacaagcett 



gcc gtg etc ttc ctg acg ggg age 
Ala Val Leu Phe Leu Thr Gly Ser 
-10 -5 
gat gaa ccc ccc cag age ccc tgg 
Asp Glu Pro Pro Gin Ser Pro Trp 
10 

gtg tac gtg gat gtg etc aaa gac 
Val Tyr Val Asp Val Leu Lys Asp 

25 30 
ttt gaa ggc tec get tgg gaa aac 
Phe Glu Gly Ser Ala Trp Glu Asn 
40 45 
aactgggaca gcgtracctc crccttcagc 
caggagttct gggataacct ggaaaaggag 
gatctggagg aggtgaaggc caasgtgcag 
caggaggaga tggagctcta ccgccagaag 
ggaccaaatg gaaataaagc tttttgat 



atg 
Met 

cag 
Gin 

gat 

Asp 

15 

age 

Ser 

age 
Ser 



aaa get 
Lys Ala 

get egg 
Ala Arg 
1 

cga gtg 
Arg Val 



gcg gtg 
Ala Val 
-15 

cat ttc 
His Phe 

aag gac 
Lys Asp 



ggc aga gac tat 
Gly Arg Asp Tyr 
35 

taaacctaaa 



aakctgcgcg aacagctegg 
acagagggee tgaggcagga 
ccctacctgg acgacttcca 
gcagctttct taactatcct 



55 



103 



151 



199 



245 



305 
365 
425 
485 
523 



<210> 341 
<211> 580 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 134 . . 373 



<220> 

<221> sig_peptide 
<222> 134 . .211 
<223> Von Heijne matrix 
score 12 

seq LLLALLLPTQIYS/SE 



<400> 341 

atttgtggct ttcctggtat ataaggtctc gccggctcgc cgcgctcccc accttgcctg 60 
cgcccgcccg gagecagegg ttctccaagc acccagcatc ctgctagacg cgccgcgcac 120 
egaeggaggg gac atg ggc aga gca atg gtg gcc agg etc ggg ctg ggg 169 
Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 
-25 -20 -15 

ctg ctg ctg ctg gca ctg etc eta ccc acg cag att tat tec agt gaa 217 
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191 

Leu Leu Leu Leu Ala Leu Leu Leu Pro Thr Gin lie Tyr Ser Ser Glu 

-10 -5 1 

aca aca act gga act tea agt aac tec tec cag agt act tec aac tct 265 

Thr Thr Thr Gly Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser 

5 ~ 10 15 

ggg ttg gec cca aat cca act aat gec ace ace aag gyg get ggt ggt 313 

Gly Leu Ala Pro Asn Pro Thr Asn Ala Thr Thr Lys Xaa Ala Gly Gly 

20 25 30 

gec ctg cag tea aca gec agt etc ttc gtg gtc tea etc tct ctt ctg 361 

Ala Leu Gin Ser Thr Ala Ser Leu Phe Val Val Ser Leu Ser Leu Leu 
35 40 45 50 
cat etc tac tct taagagactc aggecaagaa aegtcttcta aatttcccca 413 
His Leu Tyr Ser 

tcttctaaac ccaatccaaa tggcgtctgg aagtccaatg tggcaaggaa aaacaggtct 473 

tcatcgaatc tactaattcc acacctttta ttgacacaga aaatgttgag aatcccaaat 533 

ttgattgatt tgaagaacat gtgagaggkt tgactagatg atggatg 580 



<210> 342 
<211> 508 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 76. . 378 



<220> 

<221> sig_peptide 

<222> 76. . 156 

<223> Von Heijne matrix 

score 11.8000001907349 
seq LFLGLLLLPLVVA/FA 



<400> 342 

aattggccac agagacccag cccgagtttc ccatcgcact gagcactgag atectgetgg 60 
aagctctgcc gcagc atg age tec gca gec ggg ttc tgc gec tea cgc ccc 111 
Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 



ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cca 


ctt 


gtg 


gtc 


gec 


ttc 


159 


Gly 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gee 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gac 


ctg 


cag 


tgc 


ctg 


tgt 


gtg 


207 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 


Asp 


Leu 


Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


ccc 


agg 


cac 


ate 


acc 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


ccc 


cac 


tgc 


ccc 


act 


gee 


caa 


ctg 


ata 


gee 


acg 


ctg 


303 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gac 


ctg 


caa 


gee 


ccg 


ctg 


tac 


aag 


351 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




398 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 458 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcta 508 



<210> 343 
<211> 459 
<212> DNA 
<213> Homo sapiens 
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192 



<220> 
<221> CDS 
<222> 75. .332 



<220> 

<221> sig_peptide 

<222> 75. . 128 

<223> Von Heijne matrix 

score 11.6999998092651 
seq VLLLLLLVEQAAA/LG 



<400> 343 

gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 60 

110 



158 



atctccagcc caag 


atg 


att 


cca 


gca 


gtg 


gtc 


ttg 


etc 


tta 


etc 


ctt 


ttg 










Met 


He 


Pro 


Ala 


Val 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 
















-15 










-10 








gtt 


gaa 


caa 


gca 


gcg 


gec 


ctg 


gga 


gag 


cct 


cag 


etc 


tgc 


tat 


ate 


ctg 


Val 


Glu 


Gin 


Ala 


Ala 


Ala 


Leu 


Gly 


Glu 


Pro 


Gin 


Leu 


Cys 


Tyr 


He 


Leu 




-5 










1 








5 










10 


gat 


gec 


ate 


ctg 


ttt 


ctg 


tat 


gga 


att 


gtc 


etc 


acc 


etc 


etc 


tac 


tgt 


Asp 


Ala 


He 


Leu 


Phe 


Leu 


Tyr 


Gly 


He 


Val 


Leu 


Thr 


Leu 


Leu 


Tyr 


Cys 










15 










20 










25 




cga 


ctg 


aag 


ate 


caa 


gtg 


cga 


aag 


gca 


get 


ata 


acc 


age 


tat 


gag 


aaa 


Arg 


Leu 


Lys 


He 


Gin 


Val 


Arg 


Lys 


Ala 


Ala 


He 


Thr 


Ser 


Tyr 


Glu 


Lys 








30 










35 










40 






tea 


gat 


ggt 


gtt 


tac 


acg 


ggc 


ctg 


age 


acc 


agg 


aac 


cag 


gag 


act 


tac 


Ser 


Asp 


Gly 


Val 


Tyr 


Thr 


Gly 


Leu 


Ser 


Thr 


Arg 


Asn 


Gin 


Glu 


Thr 


Tyr 






45 










50 










55 








gag 


act 


ctg 


aag 


cat 


gag 


aaa 


cca 


cca 


cag 


tagecttaga atagatgegg 


Glu 


Thr 


Leu 


Lys 


His 


Glu 


Lys 


Pro 


Pro 


Gin 















206 



254 



302 



352 



60 65 
tcatattctt ctttggcttc tggttcttcc agccctcatg tttggcatca catatgectg 4 12 
catgecatta acaccagctg gccctacccc tataatgatc ctgtgtc 459 



<210> 344 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 160. .435 



<220> 

<221> sig_peptide 

<222> 160. .231 

<223> Von Heijne matrix 

score 11.6999998092651 
seq FVLGLGLTPPTLA/QD 



<400> 344 

agatgeaaag caggattcaa aagaacatct ttgcgttttc taccggctcc ccatcatcgt 60 
actagggagg aagaagcggg tgagaaacaa aacttctttc cattgtcctg cccgtttctg 120 
cggacttgtt ctgaggcega ggagcctgtg ttggaagag atg gtg atg ggc ctg 174 

Met Val Met Gly Leu 
-20 

ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg acc cca ccg 222 
Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu Thr Pro Pro 

-15 -10 -5 

acc ctg get cag gat aac tec agg tac aca cac ttc ctg acc cag cac 270 
Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu Thr Gin His 
1 5 " 10 
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tat 


gat 


gcc 


aaa 


cca 


cag 


ggc 


egg 


gat 


gac 


aga 


tac tgt 


gaa 


age 


ate 


318 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly 


Arg 


Asp 


Asp 


Arg 


Tyr Cys 


Glu 


Ser 


lie 






15 










20 










25 










atg 


agg 


aga 


egg 


ggc 


ctg 


acc 


tea 


ccc 


tgc 


aaa 


gac ate 


aac 


aca 


ttt 


366 


Met 


Arg 


Arg 


Arg 


Gly 


Leu 


Thr 


Ser 


Pro 


Cys 


Lys 


Asp lie 


Asn 


Thr 


Phe 




30 










35 










40 








45 




att 


cat 


ggc 


aac 


aag 


cgc 


aga 


tea 


agg 


cca 


tct 


gtg aaa 


aca 


aga 


atg 


414 


lie 


His 


Gly 


Asn 


Lys 


Arg 


Arg 


Ser 


Arg 


Pro 


Ser 


Val Lys 


Thr 


Arg 


Met 












50 










55 








60 






gaa 


acc 


etc 


aca 


gag 


aaa 


acc 


taagaataag < 


caagtcttct ttccaggtca 


465 


Glu 


Thr 


Leu 


Thr 


Glu 


Lys 


Thr 





















65 



ccacttgcaa gctacatgga ggttcccctg gcctccatgc cagtaccgag ccacagcggg 525 
gttcagaaac gttgttgttg cttgtgaa 553 



<210> 345 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 39. .371 



<220> 

<221> sig_jpeptide 
<222> 39. . 86 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 



<400> 345 

agcataaagt tgggagtgac accagagcct tetgeaag atg ctt ctg att ctg ctg 56 

Met Leu Leu lie Leu Leu 
-15 



tea 


gtg 


gcc 


ctg 


ctg 


gcc 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


104 


Ser 


Val 


Ala 


Leu 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gac 


tct 


152 


Val 


Ser 


Gin 


Glu 


Asp 


Val 


Pro 


Leu 


Val 


He 


Ser 


Asp 


Gly 


Gly 


Asp 


Ser 










10 










15 










20 








gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cct 


ttg 


gga 


gga 


cag 


200 


Glu 


Gin 


Phe 


He 


Asp 


Glu 


Glu 


Arg 


Gin 


Gly 


Pro 


Pro 


Leu 


Gly 


Gly 


Gin 








25 










30 










35 










caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 


aac 


cag 


rat 


gat 


ggc 


cct 


cag 


248 


Gin 


Ser 


Gin 


Pro 


Ser 


Ala 


Gly 


Asp 


Gly 


Asn 


Gin 


Xaa 


Asp 


Gly 


Pro 


Gin 






40 










45 










50 












cag 


gga 


cca 


ccc 


caa 


caa 


gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cct 


296 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 


cca 


car 


gga 


cca 


cct 


ccc 


caa 


ggg 


ggc 


cgc 


cca 


caa 


344 


Pro 


Gin 


Gly 


Lys 


Pro 


Gin 


Gly 


Pro 


Pro 


Pro 


Gin 


Gly 


Gly 


Arg 


Pro 


Gin 












75 










80 










85 






gga 


cct 


cca 


cag 


ggg 


cag 


tct 


cct 


cag 


taatctagga ttcaatgaca 




391 


Gly 


Pro 


Pro 


Gin 


Gly 


Gin 


Ser 


Pro 


Gin 






















90 










95 



















ggaagtgaat aagaagatga cagtgtttca aatgccttga aacataatgt gatcatgetc 451 
taacttcaat ataccaataa aataatcagc ttgcaaaaaa aaaaaaaa 499 



<210> 346 
<211> 398 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 36. .353 



<220> 

<221> sig_peptide 
<222> 36. .83 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 



<400> 346 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 
-15 



tea 


gtg 


gec 


ctg 


ctg 


gec 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


101 


Ser 


Val 


Ala 


Leu 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gac 


tct 


149 


Val 


Ser 


Gin 


Glu 


Asp 


Val 


Pro 


Leu 


Val 


He 


Ser 


Asp 


Gly 


Gly 


Asp 


Ser 










10 










15 










20 








gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cct 


ttg 


gga 


gga 


cag 


197 


Glu 


Gin 


Phe 


He 


Asp 


Glu 


Glu 


Arg 


Gin 


Gly 


Pro 


Pro 


Leu 


Gly 


Gly 


Gin 








25 










30 










35 










caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 


aac 


cag 


gat 


gat 


ggc 


cct 


cag 


245 


Gin 


Ser 


Gin 


Pro 


Ser 


Ala 


Gly 


Asp 


Gly 


Asn 


Gin 


Asp 


Asp 


Gly 


Pro 


Gin 






40 










45 










50 












cag 


gga 


cca 


ccc 


caa 


caa 


gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cct 


293 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 


cca 


caa 


gga 


cca 


ccc 


caa 


cag 


gga 


ggc 


cag 


tct 


tgt 


341 


Pro 


Gin 


Gly 


Lys 


Pro 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Ser 


Cys 





75 80 85 



tgc tgt gac aag taacagcett ttttttaaat tgttttttat acaaaaaaaa aaaaa 398 
Cys Cys Asp Lys 
90 



<210> 347 
<211> 525 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 48. .470 



<220> 

<221> sig_pepticie 

<222> 48 . . 119 

<223> Von Heijne matrix 

score 10.8999996185303 
seq LAGLCCLVPVSLA/ED 



<400> 347 

ctcctcagct tcaggcacca ccactgacct gggacagtga atcgaca atg ccg tct 56 

Met Pro Ser 

tct gtc teg tgg ggc ate etc ctg ctg gca ggc ctg tgc tgc ctg gtc 104 
Ser Val Ser Trp Gly He Leu Leu Leu Ala Gly Leu Cys Cys Leu Val 

-20 -15 -10 

cct gtc tec ctg get gag gat ccc cag gga gat get gec cag aag aca 152 
Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala Gin Lys Thr 
-5 15 10 
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gat 


aca 


tec 


cac 


cat 


gat 


cag 


gat 


cac 


cca 


acc 


ttc 


aac 


aag 


ate 


acc 


200 


Asp 


Thr 


Ser 


His 
15 


His 


Asp 


Gin 


Asp 


His 
20 


Pro 


Thr 


Phe 


Asn 


Lys 

25 


He 


Thr 




ccc 


aac 


ctg 


get 


gag 


ttc 


gcc 


ttc 


age 


eta 


tac 


cgc 


cag 


ctg 


gca 


cac 


248 


Pro 


Asn 


Leu 
30 


Ala 


Glu 


Phe 


Ala 


Phe 

35 


Ser 


Leu 


Tyr 


Arg 


Gin 
40 


Leu 


Ala 


His 




cag 


tec 


aac 


age 


ace 


aat 


ate 


ttc 


ttc 


tec 


cca 


gtg 


age 


ate 


get 


aca 


296 


Gin 


Ser 
45 


Asn 


Ser 


Thr 


Asn 


He 
50 


Phe 


Phe 


Ser 


Pro 


Val 
55 


Ser 


lie 


Ala 


Thr 




gcc 


ttt 


gca 


atg 


etc 


tec 


ctg 


ggg 


ace 


aag 


get 


gac 


act 


cac 


gat 


gaa 


344 


Ala 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys 


Ala 


Asp 


Thr 


His 


Asp 


Glu 




60 










65 










70 










75 




ate 


ctg 


gag 


ggc 


ctg 


aat 


ttc 


aac 


etc 


acg 


gag 


att 


ccg 


gag 


get 


cag 


392 


He 


Leu 


Glu 


Gly 


Leu 
80 


Asn 


Phe 


Asn 


Leu 


Thr 
85 


Glu 


He 


Pro 


Glu 


Ala 
90 


Gin 




ate 


cat 


gaa 


ggc 


ttc 


cag 


gaa 


etc 


etc 


cgt 


acc 


etc 


aac 


cag 


cca 


gac 


440 


He 


His 


Glu 


Gly 
95 


Phe 


Gin 


Glu 


Leu 


Leu 
100 


Arg 


Thr 


Leu 


Asn 


Gin 
105 


Pro 


Asp 




age 


cag 


etc 


cag 


ctg 


ace 


ace 


ggc 


aag 


aat 


taatgettte ttaactaagt 


490 


Ser 


Gin 


Leu 
110 


Gin 


Leu 


Thr 


Thr 


Gly 
115 


Lys 


Asn 

















ttactttgag tattaacaag tagcttatac tgagg 



525 



<210> 348 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91 . .462 



<220> 

<221> sig_peptide 

<222> 91. .180 

<223> Von Heijne matrix 

score 9.89999961853027 
seq LLFGFTLVSGTGA/EK 



<400> 348 

acecctaccc cacgccccct cccgcgcgcg eggttaaate cccgcacctg ageategget 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta ggc ccg 114 

Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 



eta 


gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


acc 


eta 


162 


Leu 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Phe 


Gly 


Phe 


Thr 


Leu 








-20 










-15 










-10 










gtc 


tea 


ggc 


aca 


gga 


gca 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


210 


Val 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 






-5 










1 








5 










10 




get 


gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gcc 


258 


Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


Cys 


Val 


Ser 


Asp 


Ser 


Glu 


Cys 


Ala 












15 










20 










25 






gac 


aac 


etc 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gcc 


acc 


ttc 


tgc 


tct 


ctg 


306 


Asp 


Asn 


Leu 


Lys 


Cys 


Cys 


Ser 


Ala 


Gly 


Cys 


Ala 


Thr 


Phe 


Cys 


Ser 


Leu 










30 










35 










40 








ccc 


aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


354 


Pro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


Pro 


Gin 


Val 


Asn 


He 


Asn 


Phe 


Pro 








45 










50 










55 










cag 


etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 






60 










65 










70 












ggc 


cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


450 
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Gly Gin Met Lys Cys Cys Arg Asn Gly Cys Gly Lys Val Ser Cys Val 

75 80 85 90 

act ccc aat ttc tgagctccag ccaccaccag gctgagcagt gaggagag 500 

Thr Pro Asn Phe 



<210> 349 
<211> 519 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .399 



<220> 

<221> sig_peptide 

<222> 28 . .117 

<223> Von Heijne matrix 

score 9.89999961853027 
seq LLFGFTLVSGTGA/EK 



<400> 349 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta ggc ccg eta 54 

Met Pro Ala Cys Arg Leu Gly Pro Leu 
-30 -25 



gec 


gee 


gee 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


ace 


eta 


gtc 


102 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 


Val 




tea 


ggc 


aca 


gga 


gca 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gee 


gac 


198 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 

20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 


Asp 




aac 


ate 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gee 


ace 


ttc 


tgc 


tct 


ctg 


ccc 


246 


Asn 


He 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 


Pro 




aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


cag 


294 


Asn 


Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 


Gin 




etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


ggc 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 
80 


Arg 


Asn 


Gly 


Cys 


Gly 
85 


Lys 


Val 


Ser 


Cys 


Val 
90 


Thr 





ccc aat ttc tgagctccag ccacaggctg agcagtgagg agagaaagtt 439 
Pro Asn Phe 

tctgcctggc cctgcatctg gttccagccc acctgccctc ccctttttcg gggactctgt 4 99 

attccctctt gggctgacca 519 



<210> 350 
<211> 482 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 40. .246 



<220> 

<221> sig_peptide 
<222> 40. . 90 



WO 02/083898 



PCT/IB01/00914 



197 

<223> Von Heijne matrix 
score 9.5 

seq FVFLLFVISLAAA/AH 



<400> 350 

aaaaagtagg acaccggagc tagcagtggc aacccactt atg ttt cct ggc ttt 

Met Phe Pro Gly Phe 
-15 

gtt ttt ttg etc ttt gta ata agt ctt get get get get cat ctt tgg 
Val Phe Leu Leu Phe Val lie Ser Leu Ala Ala Ala Ala His Leu Trp 

-10 -5 1 

gtc ctg gec gee ttt atg ggc cgt ate ace gtg aag gtc tgc age ttc 
Val Leu Ala Ala Phe Met Gly Arg lie Thr Val Lys Val Cys Ser Phe 
5 10 15 20 

act cct gag gee age aag acc gtg age cca cca gaa gga gcg aac aac 
Thr Pro Glu Ala Ser Lys Thr Val Ser Pro Pro Glu Gly Ala Asn Asn 

2 5 30 35 

teg aga cgc act get ttt aag age tgt aac act cac cac aaa ggt ctg 
Ser Arg Arg Thr Ala Phe Lys Ser Cys Asn Thr His His Lys Gly Leu 

40 45 50 

tagcatcact cctgaagtca gcgggaccac gaacctacca gaaagaagaa actccataca 
gatctgaaaa tctgaaggaa caaactccgg acacaccatc tttaagaact gtaacactca 
ctgcgagggt ccacggcttc atttttgaag tcagccagac caagaaccca ccaattccgc 
acacacttct gctgtttcgt tccattctca aggaaaactg tgatgggtcc attttt 



54 



102 



150 



198 



246 



306 
366 
426 
482 



<210> 351 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . .344 



<220> 

<221> sig_peptide 

<222> 114 . .212 

<223> Von Heijne matrix 

score 9.30000019073486 
seq MVLLS FALTLCSA/ FW 



<400> 351 

wctgctcact gctgggtact gttctgctgt gggtgcccag gaagggacta cacctcttcg 60 
cagtgtttta tacctttggt aatategcat caattgggag taccatcttc etc atg 116 

Met 

164 



212 



gga 


cca 


gtg 


aaa 


cag 


ctg 


aag 


cga 


atg 


ttt 


gag 


cct 


act 


cgt 


ttg 


att 


Gly 


Pro 


Val 
-30 


Lys 


Gin 


Leu 


Lys 


Arg 
-25 


Met 


Phe 


Glu 


Pro 


Thr 
-20 


Arg 


Leu 


He 


gca 


act 


ate 


atg 


gtg 


ctg 


ttg 


agt 


ttt 


gca 


ctt 


acc 


ctg 


tgt 


tct 


gee 


Ala 


Thr 
-15 


He 


Met 


Val 


Leu 


Leu 
-10 


Ser 


Phe 


Ala 


Leu 


Thr 
-5 


Leu 


Cys 


Ser 


Ala 


ttt 


tgg 


tgg 


cat 


aac 


atg 


gga 


ctt 


gca 


ctt 


ate 


ttc 


tgc 


att 


ttg 


cag 


Phe 


Trp 


Trp 


His 


Asn 


Met 


Gly 


Leu 


Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


Gin 


1 








5 










10 










15 




tct 


ttg 


gca 


ttg 


acg 


tgg 


tac 


age 


ctt 


tec 


ttc 


ata 


cca 


ttt 


gca 


agg 


Ser 


Leu 


Ala 


Leu 
20 


Thr 


Trp 


Tyr 


Ser 


Leu 
25 


Ser 


Phe 


He 


Pro 


Phe 
30 


Ala 


Arg 


gat 


get 


gtg 


aag 


aag 


tgt 


ttt 


gee 


gtg 


tgt 


ctt 


gca 


taattcatgg 




Asp 


Ala 


Val 
35 


Lys 


Lys 


Cys 


Phe 


Ala 
40 


Val 


Cys 


Leu 


Ala 











260 



308 



354 



ccagttttat gaagctttgg aaggcactat ggacagaagc tggtggacag ttttgtaact 414 
atcttcgaaa cctctgtctt acagacatgt gecttttate ttgeagea 462 



WO 02/083898 



PCT/IB01/00914 



<210> 352 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .466 



<220> 

<221> sig_peptide 

<222> 29. . 106 

<223> Von Heijne matrix 

score 8.89999961853027 
seq GLLWAFCAPGARA/EE 



<400> 352 

cgtctgcttc ggagaccgta agatattg atg acc atg aga tec ctg etc aga 52 

Met Thr Met Arg Ser Leu Leu Arg 
-25 -20 
acc ccc ttc ctg tgt ggc ctg etc tgg gec ttt tgt gec cca ggc gec 100 
Thr Pro Phe Leu Cys Gly Leu Leu Trp Ala Phe Cys Ala Pro Gly Ala 

-15 -10 -5 

agg get gag gag cct gca gec age ttc tec caa ccc ggc age atg ggc 148 
Arg Ala Glu Glu Pro Ala Ala Ser Phe Ser Gin Pro Gly Ser Met Gly 

15 10 
ctg gat aag aac aca gtg cac gac caa gag cat ate atg gag cat eta 196 
Leu Asp Lys Asn Thr Val His Asp Gin Glu His lie Met Glu His Leu 
15 20 25 30 

gaa ggt gtc ate aac aaa cca gag gcg gag atg teg cca caa gaa ttg 244 
Glu Gly Val lie Asn Lys Pro Glu Ala Glu Met Ser Pro Gin Glu Leu 

35 40 45 

cag etc cat tac ttc aaa atg cat gat tat gat ggc aat aat ttg ctt 292 
Gin Leu His Tyr Phe Lys Met His Asp Tyr Asp Gly Asn Asn Leu Leu 

50 55 60 

gat ggc tta gaa etc tec aca gec ate act cat gtc cat aag gag gaa 340 
Asp Gly Leu Glu Leu Ser Thr Ala lie Thr His Val His Lys Glu Glu 

65 70 75 

ggg agt gaa cag gca cca eta atg agt gaa gat gaa ctg att aac ata 388 
Gly Ser Glu Gin Ala Pro Leu Met Ser Glu Asp Glu Leu lie Asn lie 

80 85 90 

ata gat ggt gtt ttg aga gat gat gac aag aac aat gat gga tac att 436 
lie Asp Gly Val Leu Arg Asp Asp Asp Lys Asn Asn Asp Gly Tyr lie 
95 100 105 110 

gac tat get gaa ttt gca aaa tea ctg cag tagatgttat ttggcatctc 486 
Asp Tyr Ala Glu Phe Ala Lys Ser Leu Gin 
115 120 
ctggttatat 496 



<210> 353 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 155. .340 



<220> 

<221> sig_peptide 

<222> 155. .292 

<223> Von Heijne matrix 

score 8.60000038146973 



WO 02/083898 

seq AVLLLILFAIVFG/LL 



199 



PCT/IB01/00914 



<400> 353 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agcwaccagc agaatcagat tctc atg gac caa ytg gta ttc aaa 175 

Met Asp Gin Leu Val Phe Lys 
-45 -40 



gag 


aca 


ate 


tgg 


aat 


gat 


gcg 


ttc 


tgg 


cag 


aac 


ccc 


tgg 


gac 


cag 


ggg 


223 


Glu 


Thr 


He 


Trp 


Asn 


Asp 


Ala 


Phe 


Trp 


Gin 


Asn 


Pro 


Trp 


Asp 


Gin 


Gly 












-35 










-30 










-25 






ggc 


ctg 


gca 


gtg 


att 


ate 


tta 


ttc 


ate 


ace 


get 


gtc 


ctg 


ctt 


etc 


ate 


271 


Gly 


Leu 


Ala 


Val 


He 


He 


Leu 


Phe 


He 


Thr 


Ala 


Val 


Leu 


Leu 


Leu 


He 










-20 










-15 










-10 








tta 


ttt 


gec 


ate 


gtg 


ttt 


ggt 


tta 


etc 


act 


tec 


aca 


gaa 


aac 


act 


cag 


319 


Leu 


Phe 


Ala 
-5 


He 


Val 


Phe 


Gly 


Leu 
i 


Leu 


Thr 


Ser 


Thr 


Glu 


Asn 


Thr 


Gin 




tgt 


gaa 


gcg 


ggt 


gaa 


gag 


gag 


tgacctgact 


tgctggggac tgagatggca 


370 


Cys 


Glu 


Ala 


Gly 


Glu 


Glu 


Glu 






















10 










15 

























gcaggggagg cgagctgacc tgcccccatt ccagtggtgg gcccttcgcg gttccctctg 430 
gctcaggggc caagcctggt gtcttccttt cccaccagga aaaagtctag taaaatactg 490 
tatctggct 4 99 



<210> 354 
<211> 537 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 13. .498 



<220> 

<221> sig_peptide 

<222> 13. .153 

<223> Von Heijne matrix 

score 8.30000019073486 
seq LALSSLLSLLLFA/GM 



<400> 354 

tagegctgae gc atg cgc ata get aac cgc ace egg ttc age teg cct ttc 51 
Met Arg He Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 
-45 -40 -35 

ttg gee aga ggc gee ggt tgg act cac ggg egg ggc atg atg gtg gtg 99 
Leu Ala Arg Gly Ala Gly Trp Thr His Gly Arg Gly Met Met Val Val 

-30 -25 -20 

ggt acg ggc ace teg ctg gcg etc tec tec etc ctg tec ctg ctg etc 147 
Gly Thr Gly Thr Ser Leu Ala Leu Ser Ser Leu Leu Ser Leu Leu Leu 

-15 -10 -5 

ttt get ggg atg cag atg tac age cgt cag ctg gee tec ace gag tgg 195 
Phe Ala Gly Met Gin Met Tyr Ser Arg Gin Leu Ala Ser Thr Glu Trp 

15 10 
etc ace ate cag ggc ggc ctg ctt ggt teg ggt etc ttc gtg ttc teg 243 
Leu Thr lie Gin Gly Gly Leu Leu Gly Ser Gly Leu Phe Val Phe Ser 
15 20 25 30 

etc act gee ttc aat aat ctg gag aat ctt gtc ttt ggc aaa gga ttc 291 
Leu Thr Ala Phe Asn Asn Leu Glu Asn Leu Val Phe Gly Lys Gly Phe 

35 40 45 

caa gca aag ate ttc cct gag att etc ctg tgc etc ctg ttg get etc 339 
Gin Ala Lys He Phe Pro Glu He Leu Leu Cys Leu Leu Leu Ala Leu 

50 55 60 

ttt gca tct ggc etc ate cac cga gtc tgt gtc ace ace tgc ttc ate 387 



WO 02/083898 

Phe Ala Ser Gly Leu 
65 

ttc tec atg gtt ggt 
Phe Ser Met Val Gly 
80 

tac cag gca gca get 
Tyr Gin Ala Ala Ala 
95 

age aag aag aga aac 
Ser Lys Lys Arg Asn 
115 

<210> 355 
<211> 514 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132. .401 

<220> 

<221> sig_peptide 
<222> 132. .236 
<223> Von Heijne matrix 

score 8.30000019073486 

seq LGLGLVTLTASWA/AL 

<400> 355 

ccttcagckc ccggccgaga gcccggtggc tgcgactgak agcctggtta ctgccggcac 60 
ctttggatcc teggecaatt ctctgcttcg cctacacttg gcgtgcggtg taccagcctc 120 
cccgcgtcac c atg gaa aca gga gec tct gca tec ate cca gag ctg ate 170 
Met Glu Thr Gly Ala Ser Ala Ser lie Pro Glu Leu lie 
-35 -30 -25 



tgt 


gaa 


get 


atg 


aga 


aga 


ate 


tgg 


age 


ctg 


gga 


ttg 


ggg 


ttg 


gtg 


act 


218 


Cys 


Glu 


Ala 


Met 


Arg 


Arg 


He 


Trp 


Ser 


Leu 


Gly 


Leu 


Gly 


Leu 


Val 


Thr 








-20 










-15 










-10 










ctg 


acg 


gee 


age 


tgg 


gca 


get 


ctt 


ttc 


cac 


gat 


ggc 


ttt 


gcg 


gtt 


ctt 


266 


Leu 


Thr 


Ala 


Ser 


Trp 


Ala 


Ala 


Leu 


Phe 


His 


Asp 


Gly 


Phe 


Ala 


Val 


Leu 






-5 










1 








5 










10 




gga 


gga 


aac 


att 


gtg 


age 


gat 


etc 


age 


aca 


gta 


aga 


ttt 


gtt 


gca 


caa 


314 


Gly 


Gly 


Asn 


He 


Val 


Ser 


Asp 


Leu 


Ser 


Thr 


Val 


Arg 


Phe 


Val 


Ala 


Gin 












15 










20 










25 






cag 


cag 


cac 


ttc 


cag 


etc 


ctt 


gac 


gtg 


tgg 


ace 


agg 


aat 


ttc 


egg 


aag 


362 


Gin 


Gin 


His 


Phe 


Gin 


Leu 


Leu 


Asp 


Val 


Trp 


Thr 


Arg 


Asn 


Phe 


Arg 


Lys 










30 










35 










40 








cca 


ctg 


ggc 


age 


atg 


tgc 


ttt 


gtt 


ttc 


ttg 


ttg 


etc 


cca 


taatcaatgt 


411 


Pro 


Leu 


Gly 


Ser 


Met 


Cys 


Phe 


Val 


Phe 


Leu 


Leu 


Leu 


Pro 











45 50 55 



tgggcatcaa gatctaaccc ttgaaccttc cacgaaccct gttgtcaata cctctgggtt 471 
tccgccggtt aegcttaatt ttgacatatc ggtcagactg gtg 514 

<210> 356 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .223 



PCT/IB01/00914 

200 

He His Arg Val Cys Val Thr Thr Cys Phe He 

70 75 
ctg tac tac ate aac aag ate tec tec acc ctg 435 
Leu Tyr Tyr lie Asn Lys He Ser Ser Thr Leu 

85 90 
cca gtc etc aca cca gee aag gtc aca ggc aag 483 
Pro Val Leu Thr Pro Ala Lys Val Thr Gly Lys 
100 105 110 

tgaccctgaa tgttcaataa agttgattct ttgcaaaaa 537 



<220> 

<221> sig_peptide 



WO 02/083898 PCT/IB01/00914 

201 

<222> 41 . .115 

<223> Von Heijne matrix 

score 8.10000038146973 

seq ATLLLVLCLQLGA/NK 



<400> 356 

accacttctc tgggacacat tgccttctgt tttctccagc 



etc ctg ttc agg gec age cct gec acc ctg etc 
Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu 
-20 -15 -10 

cag ttg ggg gec aac aaa get cag gac aac act 
Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr 

1 5 
aag aat ttt gac att ccc aag tea gta cgt cca 
Lys Asn Phe Asp lie Pro Lys Ser Val Arg Pro 

15 20 
cag tgc ttg cag ttc aaa cag aat tgaaagaatg 
Gin Cys Leu Gin Phe Lys Gin Asn 

30 35 
tcattagcag catccctcta caaggtgeat ttaactataa 
acgacaatcc aaaaaccttc tactgggact tttacaccaa 
ccgtcgttga tgttattcgg gaattaggca tetgectgat 
caaaaacaac eggttttata ctattgaaat cctaaaggta 
gtttgecaca cccaggtgat ttcctctaaa gaaacttggc 
ataaaataaa cttcttaaca tacaaaaaaa aaaaaa 



atg cgc ttg etc cag 
Met Arg Leu Leu Gin 
-25 

ctg gtt etc tgc ctg 
Leu Val Leu Cys Leu 
-5 

egg aag ate ata ata 
Arg Lys lie lie lie 
10 

aat gac gaa gya ctg 
Asn Asp Glu Xaa Leu 
25 

catggtggtt aaaacttacc 



gtatactgee 
cagaactgtg 
gatgetgetg 
gaataatgga 
tggaatttct 



tgcctatgtg 
caaattgeag 
taatccccat 
agccctgtct 
gctgtggtct 



55 



103 



151 



199 



253 



313 
373 
433 
4 93 
553 
589 



<210> 357 
<211> 465 
<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 19. .378 



<220> 

<221> sig_peptide 

<222> 19. . 114 

<223> Von Heijne matrix 

score 7.59999990463257 
seq VTLGIGFFALASA/LW 

<400> 357 



ectaategmw aaagtcat 


atg 


get 


tea 


cct 


aaa 


ggt 


ttt 


ttt 


aat 


tac 


ttg 


51 












Met 


Ala 


Ser 
-30 


Pro 


Lys 


Gly 


Phe 


Phe 
-25 


Asn 


Tyr 


Leu 




acc 


tat 


ttt 


ctt 


get 


get 


ggt 


get 


gtc 


act 


ttg 


gga 


att 


ggt 


ttc 


ttt 


99 


Thr 


Tyr 
-20 


Phe 


Leu 


Ala 


Ala 


Gly 
-15 


Ala 


Val 


Thr 


Leu 


Gly 
-10 


He 


Gly 


Phe 


Phe 




get 


ttg 


gca 


tea 


get 


ttg 


tgg 


ttc 


ctg 


att 


tgc 


aaa 


cga 


aga 


gaa 


ata 


147 


Ala 


Leu 


Ala 


Ser 


Ala 


Leu 


Trp 


Phe 


Leu 


He 


Cys 


Lys 


Arg 


Arg 


Glu 


He 




-5 










1 








5 










10 






ttt 


caa 


aat 


tec 


aaa 


ttt 


aaa 


gca 


att 


gat 


gag 


aga 


tgc 


agg 


caa 


aga 


195 


Phe 


Gin 


Asn 


Ser 
15 


Lys 


Phe 


Lys 


Ala 


He 
20 


Asp 


Glu 


Arg 


Cys 


Arg 
25 


Gin 


Arg 




cca 


tea 


atg 


gcg 


aaa 


att 


aaa 


tct 


cat 


tct 


cag 


tgt 


gtt 


ttt 


att 


tct 


243 


Pro 


Ser 


Met 
30 


Ala 


Lys 


He 


Lys 


Ser 
35 


His 


Ser 


Gin 


Cys 


Val 
40 


Phe 


He 


Ser 




cga 


aat 


ttt 


cat 


act 


ggg 


aga 


ttc 


caa 


tta 


cag 


caa 


tta 


aag 


ate 


att 


291 


Arg 


Asn 
45 


Phe 


His 


Thr 


Gly 


Arg 
50 


Phe 


Gin 


Leu 


Gin 


Gin 
55 


Leu 


Lys 


He 


He 
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202 

eta aag atg aac ccc aac ttg caa caa aaa ata tea ttt gtg ate cct 339 

Leu Lys Met Asn Pro Asn Leu Gin Gin Lys He Ser Phe Val He Pro 

60 "* 65 70 75 

cag aga cca get cca caa caa ate gca gca gtg tta cat taagcttatc 388 

Gin Arg Pro Ala Pro Gin Gin He Ala Ala Val Leu His 

80 85 
aacattacca tctgattctt attacageca aagtatagaa gcagctgatg actggttttc 448 
tgatgattct ctagtga 465 



<210> 358 
<211> 438 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 84 . .317 



<220> 

<221> sig_peptide 

<222> 84 . . 140 

<223> Von Heijne matrix 

score 7.59999990463257 
seq ALVLALMI SMI SA/ DS 



<400> 358 

atagaaaagg acatctcttg agacttcact tcagcttcac tgacttcttg actctcctct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 

Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 



get 


etc 


atg 


att 


tec 


atg 


att 


age 


get 


gat 


tea 


cat 


gaa 


aag 


aga 


cat 


161 


Ala 


Leu 


Met 


He 


Ser 
-5 


Met 


He 


Ser 


Ala 


Asp 
1 


Ser 


His 


Glu 


Lys 
5 


Arg 


His" 




cat 


ggg 


tat 


aga 


aga 


aaa 


ttc 


cat 


gaa 


aag 


cat 


cat 


tea 


tac 


cat 


ate 


209 


His 


Gly 


Tyr 


Arg 


Arg 


Lys 


Phe 


His 


Glu 


Lys 


His 


His 


Ser 


Tyr 


His 


He 








10 










15 










20 










aca 


eta 


eta 


cca 


ctt 


ttt 


gaa 


gaa 


tea 


tea 


aag 


age 


aat 


gca 


aat 


gaa 


257 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 


Glu 


Glu 


Ser 


Ser 


Lys 


Ser 


Asn 


Ala 


Asn 


Glu 






25 










30 










35 












aaa 


cac 


tat 


aat 


tta 


ctg 


tat 


act 


ctt 


tgt 


ttc 


agg 


ata 


ctt 


gee 


ttt 


305 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 


Tyr 


Thr 


Leu 


Cys 


Phe 


Arg 


He 


Leu 


Ala 


Phe 




40 










45 










50 










55 





tea att gtc act tgatgatata attgeaattt aaactgttaa gctgtgttca 357 
Ser He Val Thr 

gtactgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca agcacatttt 417 

taaataaaaa aaatmtmaaa a 438 



<210> 359 
<211> 537 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 57. .317 



<220> 

<221> sig_peptide 

<222> 57. .116 

<223> Von Heijne matrix 

score 7.40000009536743 
seq LLVFLAGFPVLDA/ND 



WO 02/083898 



PCT/IB01/00914 



203 



<400> 359 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggccagcgc tctgac atg 

Met 
-20 

cag aag gtg acc ctg ggc ctg ctt gtg ttc ctg gca ggc ttt cct gtc 
Gin Lys Val Thr Leu Gly Leu Leu Val Phe Leu Ala Gly Phe Pro Val 

-15 -10 -5 

ctg gac gcc aat gac eta gaa gat aaa aac agt cct ttc tac tat gac 
Leu Asp Ala Asn Asp Leu Glu Asp Lys Asn Ser Pro Phe Tyr Tyr Asp 

15 10 
tgg cac age etc cag gtt ggc ggg etc ate tgc get ggg gtt ctg tgc 
Trp His Ser Leu Gin Val Gly Gly Leu lie Cys Ala Gly Val Leu Cys 

15 20 25 

gcc atg ggc ate ate ate gtc atg agt gca aaa tgc aaa tgc aag ttt 
Ala Met Gly lie lie lie Val Met Ser Ala Lys Cys Lys Cys Lys Phe 
30 35 40 45 

ggc cag aag tec ggt cac cat cca ggg gag act cca cct etc ate acc 
Gly Gin Lys Ser Gly His His Pro Gly Glu Thr Pro Pro Leu lie Thr 

50 55 60 

cca ggc tea gcc caa age tgatgaggac agaccagctg aaattgggtg 
Pro Gly Ser Ala Gin Ser 

65 

gaggacegtt ctctgtcccc aggtcctgtc tetgeacaga aacttgaact ccaggatgga 
attcttcctc ctctgctggg actcctttgc atggcagggc tcatctcacc tetegcaaga 
gggtctcttt gttcaatttt ttttaatcta aaatgattgt gcctctgccc aaaaaaaaaa 
aaaaaaagct 



59 



107 



155 



203 



251 



299 



347 



407 
4 67 
527 
537 



<210> 360 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 69. .383 



<220> 

<221> sig_peptide 

<222> 69. . 125 

<223> Von Heijne matrix 

score 7.19999980926514 
seq VSIMLLLVTVSDC/AV 



<400> 360 

acaaggctga gcgggaggaa gegagaggea tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gcc acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser lie Met Leu Leu Leu Val 
-15 -10 



act 


gtg 


tct 


gac 


tgt 


get 


gtg 


ate 


aca 


ggg 


gcc 


tgt 


gag 


cga 


gat 


gtc 


158 


Thr 


Val 


Ser 


Asp 


Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 




-5 










1 








5 










10 






cag 


tgt 


ggg 


gca 


ggc 


acc 


tgc 


tgt 


gcc 


ate 


age 


ctg 


tgg 


ctt 


cga 


ggg 


206 


Gin 


Cys 


Gly 


Ala 


Gly 


Thr 


Cys 


Cys 


Ala 


He 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 










15 










20 










25 








ctg 


egg 


atg 


tgc 


acc 


ccg 


ctg 


ggg 


egg 


waa 


ggc 


gag 


gag 


tgc 


cac 


ccc 


254 


Leu 


Arg 


Met 


Cys 


Thr 


Pro 


Leu 


Gly 


Arg 


Xaa 


Gly 


Glu 


Glu 


Cys 


His 


Pro 








30 










35 










40 










ggc 


age 


cac 


aag 


ate 


ccc 


ttc 


ttc 


agg 


aaa 


cgc 


aag 


cac 


cac 


acc 


tgt 


302 


Gly 


Ser 


His 


Lys 


He 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 






45 










50 










55 












cct 


tgc 


ttg 


ccc 


aac 


ctg 


ctg 


tgc 


tec 


agg 


ttc 


ccg 


gac 


ggc 


agg 


tac 


350 


Pro 


Cys 


Leu 


Pro 


Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 




60 










65 










70 










75 





WO 02/083898 



PCT/IB01/00914 



204 



cgc tgc tec atg gac ttg aag aac ate aat ttt taggegcttg cctggtctca 
Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 " 85 

ggatacccac catccttttc ctgagcacag cctggatttt tatttctgee atgaaaccca 
gctcccatga ctctcccagt ccctasactg actacc 



403 



463 
499 



<210> 361 
<211> 479 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129. . 434 



<220> 

<221> sig_peptide 
<222> 129. .236 
<223> Von Heijne matrix 
score 7 

seq PFFLITLVGVVVA/VV 



<400> 361 

tccggggccc gggagecaac egagggegtt cctgtcgggg ctgcagcggc gggagggagc 60 
ccagtggagg cgccctcccg aagcgccact geycatgetg accacccagc cctccggctg 120 
ctgatgtc atg agt aac acc act gtg ccc aat gec ccc cag gee aac age 170 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 



gac 


tec 


atg 


gtg 


ggc 


tat 


gtg 


ttg 


ggg 


ccc 


ttc 


ttc 


etc 


ate 


acc 


ctg 


218 


Asp 


Ser 


Met 
-20 


Val 


Gly 


Tyr 


Val 


Leu 
-15 


Gly 


Pro 


Phe 


Phe 


Leu 
-10 


He 


Thr 


Leu 




gtc 


ggg 


gtg 


gtg 


gtg 


get 


gtg 


gta 


atg 


tat 


gta 


cag 


aag 


aaa 


aag 


egg 


266 


Val 


Gly 
-5 


Val 


Val 


Val 


Ala 


Val 
1 


Val 


Met 


Tyr 


Val 
5 


Gin 


Lys 


Lys 


Lys 


Arg 
10 




gtg 


gac 


egg 


ctg 


cgc 


cat 


cac 


ctg 


etc 


ccc 


atg 


tac 


age 


tat 


gac 


cca 


314 


Val 


Asp 


Arg 


Leu 


Arg 
15 


His 


His 


Leu 


Leu 


Pro 
20 


Met 


Tyr 


Ser 


Tyr 


Asp 
25 


Pro 




get 


gag 


gaa 


ctg 


crt 


gag 


get 


gag 


cag 


gag 


ctg 


etc 


tct 


gac 


atg 


gga 


362 


Ala 


Glu 


Glu 


Leu 
30 


Xaa 


Glu 


Ala 


Glu 


Gin 
35 


Glu 


Leu 


Leu 


Ser 


Asp 
40 


Met 


Gly 




gac 


ccc 


aag 


gtg 


gta 


cat 


ggc 


tgg 


cag 


agt 


ggc 


tac 


cag 


cac 


aag 


egg 


410 


Asp 


Pro 


Lys 
45 


Val 


Val 


His 


Gly 


Trp 

50 


Gin 


Ser 


Gly 


Tyr 


Gin 
55 


His 


Lys 


Arg 




atg 


cca 


ctg 


ctg 


gat 


gtc 


aag 


acg 


tgacctgacc cccttgcccc acccttcaga 


464 


Met 


Pro 
60 


Leu 


Leu 


Asp 


Val 


Lys 
65 


Thr 





















gcctggggtc ctgga 479 



<210> 362 
<211> 453 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30. .440 



<220> 

<221> sig__peptide 
<222> 30. . 83 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 



WO 02/083898 



PCT/IB01/00914 



205 



<400> 362 

cagcctagca ctctgaccta gcagtcaac atg aag get etc att gtt ctg ggg 53 



















Met 


Lys 


Ala 


Leu 
-15 


He 


Val 


Leu 


Gly 




ctt 


gtc 


etc 


ctt 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


101 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gee 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


149 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


Met 


Asp 


Gly 


Tyr 


Arg 


Gly 










10 










15 










20 








ate 


age 


eta 


gca 


aac 


tgg 


atg 


tgt 


ttg 


gee 


aaa 


tgg 


gag 


agt 


ggt 


tac 


197 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 








25 










30 










35 










aac 


aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age 


act 


gat 


tat 


245 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


acc 


293 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




cca 


gga 


gca 


gtt 


aat 


gec 


tgt 


cat 


tta 


tec 


tgc 


agt 


ggg 


tgg 


cat 


gga 


341 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Gly 


Trp 


His 


Gly 












75 










80 










85 






gaa 


ate 


gtt 


gtc 


aaa 


aca 


gag 


atg 


tec 


gtc 


agt 


atg 


ttc 


aag 


gtt 


gtg 


389 


Glu 


He 


Val 


Val 


Lys 


Thr 


Glu 


Met 


Ser 


Val 


Ser 


Met 


Phe 


Lys 


Val 


Val 










90 










95 










100 








gag 


tgt 


aac 


tec 


aga 


att 


ttc 


ctt 


ctt 


cag 


etc 


att 


ttg 


tct 


etc 


tea 


437 


Glu 


Cys 


Asn 


Ser 


Arg 


He 


Phe 


Leu 


Leu 


Gin 


Leu 


lie 


Leu 


Ser 


Leu 


Ser 








105 










110 










115 











cat taaggggagt agg 453 
His 



<210> 363 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .384 

<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.90000009536743 

seq VCMCVFSTQGALG/EM 

<220> 

<221> misc_feature 
<222> 404 

<223> n=a, g, c or t 
<400> 363 

gctacctcct ggtgtctgtt gcctggtgtg tgggaaaccc ccacacgctg gaccacagaa 60 
gtcttctgtg ttgactgttg a atg agg gaa gag aaa aag ccc ttt gag aga 111 

Met Arg Glu Glu Lys Lys Pro Phe Glu Arg 
-25 -20 
gag aga gag agt gtg tgt gtg tgt atg tgt gtg ttt tec act caa gga 159 
Glu Arg Glu Ser Val Cys Val Cys Met Cys Val Phe Ser Thr Gin Gly 

-15 -10 -5 

get ttg ggg gaa atg get gca cac ttc ata gat gaa aag ctg agg ccc 207 
Ala Leu Gly Glu Met Ala Ala His Phe He Asp Glu Lys Leu Arg Pro 
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206 

15 10 
agt gag ggg aat ggt cac aga ggg aca ctg gat age ctg agt teg gac 255 
Ser Glu Gly Asn Gly His Arg Gly Thr Leu Asp Ser Leu Ser Ser Asp 

15 20 25 

caa gag tec tac ate ccc tec ace gee gac ccc ace cag get ggc cct 303 
Gin Glu Ser Tyr lie Pro Ser Thr Ala Asp Pro Thr Gin Ala Gly Pro 
30 " 35 40 45 

gag ctg ctg cac aag aac ttg ccc gtg acc tec cga tec cag ccg ctg 351 
Glu Leu Leu His Lys Asn Leu Pro Val Thr Ser Arg Ser Gin Pro Leu 

50 55 60 

ccc tct gac etc gcg ate cca gee get gee etc tgacctcccg atcccagccn 404 
Pro Ser Asp Leu Ala lie Pro Ala Ala Ala Leu 

65 7 0 

gctgccctcg ctg 417 



<210> 364 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . . 497 



<220> 

<221> sig_peptide 

<222> 114 . . 191 

<223> Von Heijne matrix 

score 6.80000019073486 
seq VFLSLFLIQLLIS/FS 



<400> 364 

ctcagcaccc agggcggtgg taggtcacag tctctgggcg ggtctcagtg tccaacactg 60 
tagctggtgc ctgccaggtt cccagtggct ggggtcacca ggtctgaaga gag atg 116 

Met 



tgc 


tgg 


ctg 


egg 


gca 


tgg 


ggc 


cag 


ate 


etc 


ctg 


cca 


gtt 


ttc 


etc 


tec 


164 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




etc 


ttt 


etc 


ate 


caa 


ttg 


ctt 


ate 


age 


ttc 


tea 


gag 


aat 


ggt 


ttt 


ate 


212 


Leu 


Phe 


Leu 


He 


Gin 
-5 


Leu 


Leu 


He 


Ser 


Phe 
1 


Ser 


Glu 


Asn 


Gly 
5 


Phe 


He 




cac 


age 


ccc 


agg 


aac 


aat 


cag 


aaa 


cca 


aga 


gat 


ggg 


aat 


gaa 


gag 


gaa 


260 


His 


Ser 


Pro 


Arg 


Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp 


Gly 


Asn 


Glu 


Glu 


Glu 








10 










15 










20 










tgt 


get 


gta 


aag 


aag 


agt 


tgt 


caa 


ttg 


tgc 


aca 


gaa 


gat 


aag 


aaa 


tgt 


308 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


Cys 






25 










30 










35 












gtt 


tgg 


tgt 


agt 


gaa 


gaa 


aaa 


gca 


tgc 


aaa 


aaa 


tac 


tgt 


ttt 


ccc 


tat 


356 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Lys 


Lys 


Tyr 


Cys 


Phe 


Pro 


Tyr 




40 










45 










50 










55 




ttc 


ggt 


tgt 


cga 


ttc 


agt 


tct 


ata 


tat 


tgg 


tta 


aac 


tgt 


aaa 


gtt 


gac 


404 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


Asp 












60 










65 










70 






atg 


ttt 


gga 


ate 


atg 


atg 


ctt 


eta 


etc 


att 


gca 


gta 


tta 


att 


aca 


gga 


452 


Met 


Phe 


Gly 


He 


Met 


Met 


Leu 


Leu 


Leu 


He 


Ala 


Val 


Leu 


He 


Thr 


Gly 










75 










80 










85 








ttc 


gtt 


tgg 


tac 


tgc 


tgc 


gee 


tat 


cac 


ttt 


tac 


ctg 


cag 


gat 


ata 




497 


Phe 


Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


He 







90 95 100 

tgatgaatag ataattgraa agagatcctc ca 529 



<210> 365 
<211> 505 



WO 02/083898 

<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 71. .496 

<220> 

<221> sig_peptide 
<222> 71. . 142 
<223> Von Heijne matrix 
score 6.5 

seq LSICLSAVATATG/AE 
<400> 365 

gacgaggacs cgcctgcgca gaggcggcag caccaccggg gttgactccg ggggcgcggc 60 
gaggagagac atg agg ctg age tgg ttc egg gtc ctg aca gta ctg tec 109 
Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu Ser 
-20 -15 



ate 


tgc 


ctg 


age 


gec 


gtg 


gee 


acg 


gee 


acg 


ggg 


gee 


gag 


ggc 


aaa 


agg 


157 


lie 


Cys 


Leu 


Ser 


Ala 


Val 


Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Lys 


Arg 






-10 










-5 










1 








5 




aag 


ctg 


cag 


ate 


ggg 


gtc 


aag 


aag 


egg 


gtg 


gac 


cac 


tgt 


ccc 


ate 


aaa 


205 


Lys 


Leu 


Gin 


He 


Gly 


Val 


Lys 


Lys 


Arg 


Val 


Asp 


His 


Cys 


Pro 


He 


Lys 












10 










15 










20 






teg 


cgc 


aaa 


ggg 


gat 


gtc 


ctg 


cac 


atg 


cac 


tac 


acg 


ggg 


aag 


ctg 


gaa 


253 


Ser 


Arg 


Lys 


Gly 


Asp 


Val 


Leu 


His 


Met 


His 


Tyr 


Thr 


Gly 


Lys 


Leu 


Glu 










25 










30 










35 








gat 


ggg 


aca 


gag 


ttt 


gac 


age 


age 


ctg 


ccc 


cag 


aac 


cag 


ccc 


ttt 


gtc 


301 


Asp 


Gly 


Thr 


Glu 


Phe 


Asp 


Ser 


Ser 


Leu 


Pro 


Gin 


Asn 


Gin 


Pro 


Phe 


Val 








40 










45 










50 










ttc 


tec 


ctt 


ggc 


aca 


ggc 


cag 


gtc 


ate 


aag 


ggc 


tgg 


gac 


cag 


ggg 


ctg 


349 


Phe 


Ser 


Leu 


Gly 


Thr 


Gly 


Gin 


Val 


He 


Lys 


Gly 


Trp 


Asp 


Gin 


Gly 


Leu 






55 










60 










65 












ctg 


ggg 


atg 


tgt 


gag 


ggg 


gaa 


aag 


cgc 


aas 


ykg 


gtg 


ate 


cca 


tec 


gag 


397 


Leu 


Gly 


Met 


Cys 


Glu 


Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 


Glu 




70 










75 










80 










85 




eta 


ggg 


tat 


gga 


gag 


egg 


gga 


get 


ccc 


cca 


aag 


att 


cca 


ggc 


ggt 


gca 


445 


Leu 


Gly 


Tyr 


Gly 


Glu 


Arg 


Gly Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 


Ala 












90 










95 










100 






ace 


ctg 


gtg 


ttc 


gag 


gtg 


gag 


ctg 


etc 


aaa 


ata 


gag 


cga 


cga 


act 


gag 


493 


Thr 


Leu 


Val 


Phe 


Glu 


Val 


Glu 


Leu 


Leu 


Lys 


He 


Glu 


Arg 


Arg 


Thr 


Glu 





105 110 115 



ctg taaccagac 505 
Leu 

<210> 366 
<211> 539 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 65. .274 

<220> 

<221> sig_peptide 
<222> 65 . . 136 
<223> Von Heijne matrix 
score 6 . 5 

seq ILLTRPACLGSWA/EI 



PCT/IB01/00914 

207 



<400> 366 



WO 02/083898 PCT/IB01/00914 

208 

cactgattct ttggccatcg ctactacaat tttagaattg ctggcaattt tttctgcaca 60 
ctgg atg get tct tec ace agt gtc tea aca gga caa ate ttg ctg aca 109 
Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin lie Leu Leu Thr 
-20 -15 -10 

aga cct get tgc ttg ggg tec tgg get gag ate egg tea ccg gtg agg 157 
Arg Pro Ala Cys Leu Gly Ser Trp Ala Glu lie Arg Ser Pro Val Arg 

-5 15 
acc ate tec ate gee age gac ttc cca aca gca egg gtg agt etc tgg 205 
Thr lie Ser He Ala Ser Asp Phe Pro Thr Ala Arg Val Ser Leu Trp 

10 15 20 

gtg ccg ccc gca cct ggg atg gtt cct att aag ate tec ggc tgt gca 253 
Val Pro Pro Ala Pro Gly Met Val Pro He Lys He Ser Gly Cys Ala 

25 30 35 

aac tgg gec ttc tea ccg gca tagatgatat cacacatcat ggcaagctca 304 
Asn Trp Ala Phe Ser Pro Ala 
40 45 

cagcccccgc caaaggcata gecattgaca geagegatga ctggattctt gacctgggtg 364 
aggtggtccc agtgcttcaa gaacttgctg gagtaacagt cctggaaact caggttctgc 424 
atttccttga tatcagctct agtgaaaagg gcggtagttg gtggtggaac cagaaaegga 484 
cgccggtgct tkgagcggtt cttaaattat atttaaaaag taactttttg tataa 539 



<210> 367 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14 . . 178 



<220> 

<221> sig_peptide 

<222> 14 . .82 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LGALSGAAALGFA/SY 



<400> 367 

ctgttctaca get atg gec ggg cca get gca. get 
Met Ala Gly Pro Ala Ala Ala 
-20 

gec ttg tec gga get gcg gee tta ggc ttc get 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala 



-10 



-5 



ggc gem aat tec cag atg cct acg gga agg age 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser 



10 



15 



aca aac acc act tct tac aca gec tgg ccc tgt 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 



ttc cgc cgc ttg ggc 4 9 

Phe Arg Arg Leu Gly 
-15 

tec tac ggg gcg cac 97 
Ser Tyr Gly Ala His 
1 5 
tgt ttg aca agg cca 145 
Cys Leu Thr Arg Pro 
20 

taggggtgee ecattgeaga 198 



25 



30 



aagccactct gggctgggtt attgetaget teeggaaega 
tactaccagg ctctgagtgg agaccccagc atccagactt 
ctgctactct tgggctggct tgccttggct ctttgagctc 
ttttctgggc agtttttttt ttaaagagtt ggagtaagaa 
aaataaaaaa aaaaaaaaaa gc 



ccttattctg caccagcttt 258 
tggcccctgc gggagggacc 318 
ecttttgett aattactggg 378 
gaggattaaa aaggaaaggc 438 

460 



<210> 368 
<211> 482 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 



WO 02/083898 

<222> 14 . . 178 



209 



PCT/IB01/00914 



<220> 

<221> sig_peptide 

<222> 14 . .82 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LGALSGAAALGFA/SY 



<400> 368 

ctgttctaca get atg gec ggg cca get gca get ttc cgc cgc 
Met Ala Gly Pro Ala Ala Ala Phe Arg Arg 
-20 -15 
gee ttg tec gga get gcg gee tta ggc ttc get tec tac ggg 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala Ser Tyr Gly 

-10 -5 1 

ggc gem aat tec cag atg cct acg gga agg age tgt ttg aca 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser Cys Leu Thr 

10 15 
aca aac acc act tct tac aca gec tgg ccc tgt taggggtgee 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 

2 5 30 
aagccactct gggctgggtt attgetaget teeggaaega ccttattctg 
tactaccagg ctctgagtgg agaccccagc atccagactt tggcccctgc 
ctgctactct tgggctggct tgccttggct ctttgagctc ecttttgett 
ttttctgggc agtttttttt ttaaagagtt ggagtaagaa gaggattaaa 
aaataaactt tggagtcttt gttcatcaaa aaaaaaaaaa aaaa 



ttg ggc 
Leu Gly 

gcg cac 
Ala His 
5 

agg cca 
Arg Pro 
20 

ecattgeaga 



caccagcttt 
gggagggacc 
aattactggg 
aaggaaaggc 



49 



97 



145 



198 



258 
318 
378 
438 
482 



<210> 369 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .322 



<220> 

<221> sig_peptide 

<222> 161. .211 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LTFAQLLFATVLG/IA 



<400> 369 

agatgggggt aggagctctc cagacagtgt cccctcactc acccccgcgg cgcctggcgt 60 
cttctcctgg gtgttgaagt acaagatttg aagagatttc ctgggtgtgg agtgtgactt 120 
tccaaaacca gcttttcctt gagctgtatt tgttgcagca atg ttt agg aga ttg 175 

Met Phe Arg Arg Leu 
-15 

act ttt gca caa ctg ctt ttt gec act gtc ctt gga att get gga gga 223 
Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly lie Ala Gly Gly 

-10 -5 1 

gta tat att ttt caa cca gta ttt gaa cag tat gee aaa gat cag aag 271 
Val Tyr lie Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys Asp Gin Lys 
5 10 15 20 

gaa tta aaa gaa aag atg cag ttg gta caa gaa tea gaa gag aag aaa 319 
Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu Glu Lys Lys 

25 30 35 

agt taatactaca tggagttagg cctggcgcag tggctcacgc ctgtaatccc 372 
Ser 

agcactttgg gaggecgagg cgggtggatc aggtggtcag gagttcaaga ccagcctgac 432 
caacatggtg aaacctgtct ctactgaaaa tacaaaaatc agecaggett ggtggcatgc 492 



WO 02/083898 

gctgt 



210 



PCT/IB01/00914 

497 



<210> 370 
<211> 422 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .295 

<220> 

<221> sig_peptide 

<222> 47 . . 91 

<223> Von Heijne matrix 

score 6.30000019073486 

seq LFLSLPVLVVVLS/IV 

<400> 370 

aaccaagccc tccagcaagg attcagagtg cccctccggc ctcgcc atg agg etc 55 

Met Arg Leu 
-15 



ttc 


ctg 


teg 


ctt 


ccg 


gtc 


ctg 


gtg 


gtg 


gtt 


ctg 


teg 


ate 


gtc 


ttg 


gaa 


103 


Phe 


Leu 


Ser 


Leu 


Pro 


Val 


Leu 


Val 


Val 


Val 


Leu 


Ser 


He 


Val 


Leu 


Glu 








-10 










-5 










1 










ggc 


cca 


gee 


cca 


gee 


cag 


ggg 


acc 


cca 


gac 


gtc 


tec 


agt 


gee 


ttg 


gat 


151 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin 


Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 


Ala 


Leu 


Asp 




5 










10 










15 










20 




aag 


ctg 


aag 


gag 


ttt 


gga 


aac 


aca 


ctg 


gag 


gac 


aag 


get 


egg 


gaa 


etc 


199 


Lys 


Leu 


Lys 


Glu 


Phe 


Gly 


Asn 


Thr 


Leu 


Glu 


Asp 


Lys 


Ala 


Arg 


Glu 


Leu 












25 










30 










35 






ate 


age 


cgc 


ate 


aaa 


cag 


agt 


gaa 


ctt 


tct 


gee 


aag 


atg 


egg 


gag 


tgg 


247 


lie 


Ser 


Arg 


He 


Lys 


Gin 


Ser 


Glu 


Leu 


Ser 


Ala 


Lys 


Met 


Arg 


Glu 


Trp 










40 










45 










50 








ttt 


tea 


gag 


aca 


ttt 


cag 


aaa 


gtg 


aag 


gat 


aaa 


etc 


aag 


att 


gac 


tea 


295 


Phe 


Ser 


Glu 


Thr 


Phe 


Gin 


Lys 


Val 


Lys 


Asp 


Lys 


Leu 


Lys 


He 


Asp 


Ser 








55 










60 










65 











tgaggacctg aagggtgaca tcccaggagg ggcctctgaa atttcccaca ccccagcgcc 355 
tgtgctgagg actccctcca tgtggcccca ggtgccacca ataaaaatcc tacagaaaac 415 
taatata 422 



<210> 371 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. . 359 

<220> 

<221> sig_peptide 
<222> 66. .194 
<223> Von Heijne matrix 
score 6 

seq FLIYVALLRVSEC/LP 
<400> 371 

gggegegagg cggccaccgt ggagagcaga gcgcggcggc tggaagctgc taagtcagag 60 
ccgcg atg ttc egg att gag ggc etc gcg ccg aag ctg gac ccg gag gag 110 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tec ata egg aac ttt 158 
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211 



Met 


Lys 


Arg 


Lys 


Met 


Arg 


Glu 


Asp 


Val 


He 


Ser 


Ser 


He 


Arg 


Asn 


Phe 










-25 










-20 










-15 








etc 


ate 


tac 


gtg 


gee 


etc 


ctg 


cga 


gtc 


agt 


gag 


tgt 


etc 


ccg 


ggc 


tgt 


206 


Leu 


He 


Tyr 


Val 


Ala 


Leu 


Leu 


Arg 


Val 


Ser 


Glu 


Cys 


Leu 


Pro 


Gly 


Cys 








-10 










-5 










1 










gac 


tgt 


gat 


ace 


age 


ggg 


gag 


etc 


acc 


gac 


ggg 


cac 


ccc 


tta 


act 


eta 


254 


Asp 


Cys 


Asp 


Thr 


Ser 


Gly 


Glu 


Leu 


Thr 


Asp 


Gly 


His 


Pro 


Leu 


Thr 


Leu 




5 










10 










15 










20 




agg 


ggt 


cat 


egg 


ggc 


ctt 


cga 


act 


gag 


ctg 


aac 


ggt 


age 


ggg 


gag 


caa 


302 


Arg 


Gly 


His 


Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Leu 


Asn 


Gly 


Ser 


Gly 


Glu 


Gin 












25 










30 










35 






gga 


ggc 


tec 


att 


tat 


ctt 


aaa 


gaa 


att 


gga 


cag 


cat 


atg 


aag 


aca 


gga 


350 


Gly 


Gly 


Ser 


He 


Tyr 


Leu 


Lys 


Glu 


He 


Gly 


Gin 


His 


Met 


Lys 


Thr 


Gly 





40 45 50 



cat cac ata tgaatgcacg atatgaagag cctggttaca gtttcgactc 399 
His His He 
55 

ctctctgcaa gtgaataggc ccagaaaggt gtaagagact ctttgaatgg acataaaatt 459 
ctgcttgtta agaacaagtt tggctctggt aactga 4 95 



<210> 372 
<211> 515 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .360 



<220> 

<221> sig_peptide 

<222> 115. .243 

<223> Von Heijne matrix 

score 5.59999990463257 
seq VVSFALIATLVYA/LF 



<400> 372 

ataaggggac gtctagtggg ttgcccggga gggtggcggg agggtcctgg aaataatctg 60 
tcctctgtcg ccgggaactg gcgaggtagt tccttcgcgg tggagagacc tgga atg 117 

Met 



gec 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


165 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 
-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


213 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


261 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


309 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


Leu 
20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


357 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 
30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 





cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 410 
His 

ccaactgtgg acatcttttt tgtggtgcct geattattge ttactggcga tatggttcat 470 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatg 515 



<210> 373 
<211> 513 
<212> DNA 



WO 02/083898 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .352 

<220> 

<221> sig_peptide 
<222> 56. . 139 
<223> Von Heijne matrix 
score 5 . 5 

seq LGYLVLSEGAVLA/SS 



<400> 373 

ctgaagccgg aagctaccta tctggtaggg agctccccca gcaccgaaga ctgcg atg 58 

Met 



act 


tct 


gca 


ctg 


acc 


cag 


ggg 


ctg 


gag 


cga 


ate 


cca 


gac 


cag 


etc 


ggc 


106 


Thr 


Ser 


Ala 


Leu 


Thr 


Gin 


Gly 


Leu 


Glu 


Arg 


He 


Pro 


Asp 


Gin 


Leu 


Gly 








-25 










-20 










-15 










tac 


ctg 


gta 


ctg 


agt 


gaa 


ggt 


gca 


gtg 


ctg 


gcg 


tea 


tct 


ggg 


gac 


ctg 


154 


Tyr 


Leu 


Val 


Leu 


Ser 


Glu 


Gly 


Ala 


Val 


Leu 


Ala 


Ser 


Ser 


Gly 


Asp 


Leu 






-10 










-5 










1 








5 




gag 


aat 


gat 


gag 


cag 


gca 


gcc 


agt 


gcc 


ate 


tct 


gag 


ctg 


gtc 


age 


aca 


202 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Ser 


Glu 


Leu 


Val 


Ser 


Thr 












10 










15 










20 






gcc 


tgc 


ggt 


ttc 


egg 


ctg 


cac 


cgc 


ggc 


atg 


aat 


gtg 


ccc 


ttc 


aag 


cgc 


250 


Ala 


Cys 


Gly 


Phe 


Arg 


Leu 


His 


Arg 


Gly 


Met 


Asn 


Val 


Pro 


Phe 


Lys 


Arg 










25 










30 










35 








ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


cac 


aca 


ctg 


ctg 


gtg 


acg 


gtg 


tea 


gga 


298 


Leu 


Ser 


Val 


Val 


Phe 


Gly 


Glu 


His 


Thr 


Leu 


Leu 


Val 


Thr 


Val 


Ser 


Gly 








40 










45 










50 










cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aac 


cga 


ggt 


egg 


gag 


ccc 


att 


346 


Gin 


Arg 


Val 


Phe 


Val 


Val 


Lys 


Arg 


Gin 


Asn 


Arg 


Gly 


Arg 


Glu 


Pro 


He 





55 60 65 

gat gtc tgagcctgcc ggaggegagg gtcggagaag cggattgggt cctgggcctc 402 
Asp Val 
70 

tgtgatgagg caggcacacc tgteggtett ggcttgctgc tagaaactag ggccttctgc 4 62 
tcgcccacct cccaccccct acctggaegg gcccaggctt ggggactctg a 513 

<210> 374 
<211> 632 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .559 

<220> 

<221> sig_peptide 
<222> 2. . 46 

<223> Von Heijne matrix 

score 5.30000019073486 
seq VMAASMARGGVSA/RV 

<400> 374 

g atg cat gtc atg gcc gcc tec atg gcc egg gga ggc gtg agt gcc agg 4 9 

Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 

gtt eta ctg cag get gcc agg ggc acc tgg tgg aac aga cct ggg ggc 97 
Val Leu Leu Gin Ala Ala Arg Gly Thr Trp Trp Asn Arg Pro Gly Gly 
5 10 15 



PCT/IB01/00914 

212 
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213 



act 


tec 


ggg 


teg 


ggg 


gag 


ggg 


gtg 


gcg 


ctg 


ggg 


aca 


ace 


aga 


aag 


ttt 


145 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 


Leu 


Gly 


Thr 


Thr 


Arg 


Lys 


Phe 








20 










25 










30 










caa 


gcg 


aca 


ggc 


teg 


cgc 


ccg 


get 


gga 


gag 


gag 


gac 


gcg 


ggc 


ggc 


ccg 


193 


Gin 


Ala 


Thr 


Gly 


Ser 


Arg 


Pro 


Ala 


Gly 


Glu 


Glu 


Asp 


Ala 


Gly 


Gly 


Pro 






35 










40 










45 












gag 


egg 


ccc 


ggg 


gac 


gtg 


gtg 


aac 


gtg 


gtg 


ttc 


gta 


gac 


cgc 


tea 


ggc 


241 


Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe 


Val 


Asp 


Arg 


Ser 


Gly 




50 










55 










60 










65 




cag 


c gg 


ate 


cca 


gtg 


agt 


ggc 


aga 


gtc 


ggg 


gac 


aat 


gtt 


ctt 


cac 


ctg 


289 


Gin 


Arg 


He 


Pro 


Val 


Ser 


Gly 


Arg 


Val 


Gly 


Asp 


Asn 


Val 


Leu 


His 


Leu 












70 










75 










80 






gcc 


cag 


cgc 


cac 


ggg 


gtg 


gac 


ctg 


gaa 


ggg 


gcc 


tgt 


gaa 


gcc 


tec 


ctg 


337 


Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 


Gly 


Ala 


Cys 


Glu 


Ala 


Ser 


Leu 










85 










90 










95 








gcc 


tgc 


tec 


ace 


tgc 


cat 


gtg 


tat 


gtg 


agt 


gaa 


gac 


cac 


ctg 


gat 


etc 


385 


Ala 


Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 


Ser 


Glu 


Asp 


His 


Leu 


Asp 


Leu 








100 










105 










110 










ctg 


cct 


cct 


ccc 


gag 


gag 


agg 


gaa 


gac 


gac 


atg 


eta 


gac 


atg 


gcc 


ccc 


433 


Leu 


Pro 


Pro 


Pro 


Glu 


Glu 


Arg 


Glu 


Asp 


Asp 


Met 


Leu 


Asp 


Met 


Ala 


Pro 






115 










120 










125 












etc 


etc 


cag 


gag 


aac 


teg 


egg 


ctg 


ggc 


tgc 


cag 


att 


gtg 


ctg 


aca 


ccg 


481 


Leu 


Leu 


Gin 


Glu 


Asn 


Ser 


Arg 


Leu 


Gly 


Cys 


Gin 


He 


Val 


Leu 


Thr 


Pro 




130 










135 










140 










145 




gag 


ctg 


saa 


gga 


gcg 


gaa 


ttc 


ace 


ctg 


ccc 


aag 


ate 


aec 


agg 


aac 


ttc 


529 


Glu 


Leu 


Xaa 


Gly 


Ala 


Glu 


Phe 


Thr 


Leu 


Pro 


Lys 


He 


Thr 


Arg 


Asn 


Phe 












150 










155 










160 






tac 


gtg 


gat 


ggc 


cat 


gtc 


ccc 


aag 


ccc 


cac 


tgacatgaac acctggacca 


579 


Tyr 


Val 


Asp 


Gly 


His 


Val 


Pro 


Lys 


Pro 


His 

















165 170 
ttccacattg ccatggcccc agggecagat tgagggaata gcaggtgcag cct 632 



<210> 375 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .427 

<220> 

<221> sig_peptide 

<222> 59. .202 

<223> Von Heijne matrix 

score 5.19999980926514 

seq LLMLLLFLSELQY/YL 

<400> 375 

aggceggaag agggagtctg taggggeggg ccgggcgtcc cctttccggc cggtcccc 58 



atg 


gag 


gcg 


ctg 


ggg 


aag 


ctg 


aag 


cag 


ttc 


gat 


gcc 


tac 


ccc 


aag 


act 


106 


Met 


Glu 


Ala 


Leu 


Gly 


Lys 


Leu 


Lys 


Gin 


Phe 


Asp 


Ala 


Tyr 


Pro 


Lys 


Thr 










-45 










-40 










-35 








ttg 


gag 


gac 


ttc 


egg 


gtc 


aag 


ace 


tgc 


ggg 


ggc 


gcc 


ace 


gtg 


ace 


att 


154 


Leu 


Glu 


Asp 


Phe 


Arg 


Val 


Lys 


Thr 


Cys 


Gly 


Gly 


Ala 


Thr 


Val 


Thr 


He 








-30 










-25 










-20 










gtc 


agt 


ggc 


ctt 


etc 


atg 


ctg 


eta 


ctg 


ttc 


ctg 


tec 


gag 


ctg 


cag 


tat 


202 


Val 


Ser 


Gly 


Leu 


Leu 


Met 


Leu 


Leu 


Leu 


Phe 


Leu 


Ser 


Glu 


Leu 


Gin 


Tyr 






-15 










-10 










-5 












tac 


etc 


ace 


acg 


gag 


gtg 


cat 


cct 


gag 


etc 


tac 


gtg 


gac 


aag 


teg 


egg 


250 


Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


Leu 


Tyr 


Val 


Asp 


Lys 


Ser 


Arg 




1 








5 










10 










15 






gga 


gat 


aaa 


ctg 


aag 


ate 


aac 


ate 


gat 


gta 


ctk 


ttt 


ccg 


cac 


atg 


cct 


298 
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214 



Gly 


Asp 


Lys 


Leu 


Lys 


He 


Asn 


He 


Asp 


Val 


Leu 


Phe 


Pro His Met 


Pro 










20 










25 








30 






tgt 


gee 


tat 


ctg 


agt 


att 


gat 


gec 


atg 


gat 


gtg 


gec 


gga gaa cag 


cag 


346 


Cys 


Ala 


Tyr 


Leu 


Ser 


He 


Asp 


Ala 


Met 


Asp 


Val 


Ala 


Gly Glu Gin 


Gin 








35 










40 










45 






ctg 


gat 


gtg 


gaa 


cac 


aac 


ctg 


ttc 


aag 


caa 


cga 


eta 


gat aaa gat 


ggc 


394 


Leu 


Asp 


Val 


Glu 


His 


Asn 


Leu 


Phe 


Lys 


Gin 


Arg 


Leu 


Asp Lys Asp 


Gly 






50 










55 










60 








ate 


ccc 


gtg 


age 


tea 


gag 


get 


gag 


egg 


cat 


gat 


taatgcagga caccagtgga 


447 


He 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu 


Arg 


His 


Asp 










65 










70 










75 











accaacataa tttgetatag gactggatgt ggagtggagg agagcaaagt catggt 503 



<210> 376 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. .375 



<220> 

<221> sig_peptide 

<222> 145. .231 

<223> Von Heijne matrix 

score 5.09999990463257 
seq LLLKLQRLPQAEP/VE 



<400> 376 

gacaaacttg ttctgeggge tgcggatggg tgcgagggtg gaatctcggt getgegaega 60 
gtgtggggcc arccgtggag gctccaggtg ttctctctgc cccagcagag cccggcagga 120 
gccccaacag gaagccagcg egge atg get gee ace gac ttc gtg cag gag 171 

Met Ala Ala Thr Asp Phe Val Gin Glu 

atg cgc gee gtg 
Met Arg Ala Val 
-20 

cag get gag ccc 
Gin Ala Glu Pro 

aca get act gtt 
Thr Ala Thr Val 
15 

act cac tgc tgc 
Thr His Cys Cys 

30 35 40 

ccg aca cca cca tgacggackg gcgatggctg aggagaagct ggagaggaga 415 
Pro Thr Pro Pro 
45 

tggccaatgc catgacacag gccatcagcc tggccctgca gcccttaccc ctcaagacsa 475 
ggctccctgg cccagctctg gcccagccag gtacctggac actgac 521 

















-25 












ggc 


gag 


agg 


ctg 


ctg 


etc 


aag 


ctg 


cag 


aga 


ctg 


ccc 


219 


Gly 


Glu 


Arg 


Leu 


Leu 


Leu 


Lys 


Leu 


Gin 


Arg 


Leu 


Pro 






-15 










-10 










-5 




gtg 


gag 


ate 


gtg 


gee 


ttc 


tea 


gtc 


ate 


ate 


ctt 


ttc 


267 


Val 


Glu 


He 


Val 


Ala 


Phe 


Ser 


Val 


He 


He 


Leu 


Phe 




1 








5 










10 








ctg 


ctg 


ttg 


ctg 


ctg 


ata 


gec 


tgc 


age 


tgc 


tgc 


tgc 


315 


Leu 


Leu 


Leu 


Leu 


Leu 


He 


Ala 


Cys 


Ser 


Cys 


Cys 


Cys 










20 










25 










tgc 


cct 


gag 


erg 


aga 


ggc 


agg 


aag 


gtc 


cag 


gtg 


cag 


363 


Cys 


Pro 


Glu 


Xaa 


Arg 


Gly 


Arg 


Lys 


Val 


Gin 


Val 


Gin 





<210> 377 
<211> 461 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17 . . 337 



<220> 
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<221> sig_peptide 

<222> 11 . .91 

<223> Von Heijne matrix 

score 4.90000009536743 

seq VLSTLSLVQFSPS/GR 



<400> 377 

actttagctg ctgcag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 52 

Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 
gat cct gtg ctt age act ctt age ctg gtg cag ttc age ccc agt gga 100 
Asp Pro Val Leu Ser Thr Leu Ser Leu Val Gin Phe Ser Pro Ser Gly 
-15 -10 -5 1 

agg acc cag gac ctg etc tct cca ggg gtg gag aac ctg teg gtg ctg 148 
Arg Thr Gin Asp Leu Leu Ser Pro Gly Val Glu Asn Leu Ser Val Leu 

5 10 15 

gac gtg tec cct ctg ggc ttg gee tgc tgt ctg etc act etc acc atg 196 
Asp Val Ser Pro Leu Gly Leu Ala Cys Cys Leu Leu Thr Leu Thr Met 

20 25 30 

tec tgc cca ggg cct gac cct cct gag ggg ccc ggg acc cag cgt gtg 24 4 

Ser Cys Pro Gly Pro Asp Pro Pro Glu Gly Pro Gly Thr Gin Arg Val 

35 40 45 

tgg caa ggg get eta egg ate eta cag etc cca gga gee cca gat ggg 292 
Trp Gin Gly Ala Leu Arg lie Leu Gin Leu Pro Gly Ala Pro Asp Gly 
50 55 60 65 

gtt tea cca tat cag cca gtc tgg tct cga act cct gac etc aag 337 
Val Ser Pro Tyr Gin Pro Val Trp Ser Arg Thr Pro Asp Leu Lys 

70 75 80 

tgattcgeca gcctcgtcct cccaaagtgc tgggattaca gatgtgagcc accatgccca 397 
tggagaatgt attacttttg taatttaaaa aataatagta ataaaataaa gatctttaca 457 
atgt 461 



<210> 378 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148. .483 



<220> 

<221> sig_peptide 

<222> 148 . .264 

<223> Von Heijne matrix 

score 4.80000019073486 
seq IILFSAIVGFIYG/YV 

<220> 

<221> misc_feature 
<222> 43, 112, 122, 154 
<223> n=a, g, c or t 



<400> 378 

cctaatcgaa aattegctet cccgggctta gaaggecegg ctnctgacgc gcagtgccag 60 
accttacccc tcacggtcct taagtctegg tcgccctcgc ctcgcagcct gncacccgcg 120 
mncagctgcc cgcctcctca gccagcc atg ctg nag cat ctg age teg ctg ccc 174 

Met Leu Xaa His Leu Ser Ser Leu Pro 
-35 

acg cag atg gat tac aag ggc cag aag eta get gaa cag atg ttt cag 222 

Thr Gin Met Asp Tyr Lys Gly Gin Lys Leu Ala Glu Gin Met Phe Gin 

-30 -25 -20 -15 

gga att att ctt ttt tct gca ata gtt gga ttt ate tac ggg tac gtg 270 
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Gly 


He 


He 


Leu 


Phe 
-10 


Ser 


Ala 


He 


Val 


Gly 
-5 


Phe 


He 


Tyr 


Gly 


Tyr 
1 


Val 




get 


gaa 


cag 


ttc 


ggg 


tgg 


act 


gtc 


tat 


ata 


gtt 


atg 


gee 


gga 


ttt 


get 


318 


Ala 


Glu 


Gin 
5 


Phe 


Gly 


Trp 


Thr 


Val 
10 


Tyr 


He 


Val 


Met 


Ala 
15 


Gly 


Phe 


Ala 




ttt 


tea 


tgt 


ttg 


ctg 


aca 


ctt 


cct 


cca 


tgg 


ccc 


ate 


tat 


cgc 


egg 


cat 


366 


Phe 


Ser 
20 


Cys 


Leu 


Leu 


Thr 


Leu 
25 


Pro 


Pro 


Trp 


Pro 


He 
30 


Tyr 


Arg 


Arg 


His 




cct 


etc 


aag 


tgg 


tta 


cct 


gtt 


caa 


gca 


cag 


acg 


aca 


aga 


aac 


cag 


ggg 


414 


Pro 


Leu 


Lys 


Trp 


Leu 


Pro 


Val 


Gin 


Ala 


Gin 


Thr 


Thr 


Arg 


Asn 


Gin 


Gly 




35 










40 










45 










50 




aaa 


gaa 


aaa 


tta 


aga 


ggc 


atg 


eta 


aaa 


ata 


att 


gag 


gtt 


ttc 


atg 


att 


4 62 


Lys 


Glu 


Lys 


Leu 


Arg 
55 


Gly 


Met 


Leu 


Lys 


He 
60 


He 


Glu 


Val 


Phe 


Met 
65 


He 




cag 


cac 


ctg 


ctt 


ttg 


ttt 


ctg 


tgagatgagc ta 












495 


Gin 


His 


Leu 


Leu 
70 


Leu 


Phe 


Leu 























<210> 379 
<211> 541 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 60. . 368 



<220> 

<221> sig_peptide 

<222> 60. . 122 

<223> Von Heijne matrix 

score 4.69999980926514 
seq ALRSLNLAPPTVA/AP 



<400> 379 

acagtcaacg tcatctagga gcaccgagca gcttggctaa aagtaagggt gtcgtgctg 59 



atg 


gee 


ctg 


tgc 


gca 


ctg 


acc 


cgc 


get 


ctg 


cgc 


tct 


ctg 


aac 


ctg 


gcg 


107 


Met 


Ala 
-20 


Leu 


Cys 


Ala 


Leu 


Thr 
-15 


Arg 


Ala 


Leu 


Arg 


Ser 
-10 


Leu 


Asn 


Leu 


Ala 




ccc 


ccg 


ace 


gtc 


gee 


gee 


cct 


gee 


ccg 


agt 


ctg 


ttc 


ccc 


gee 


gee 


cag 


155 


Pro 


Pro 


Thr 


Val 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Phe 


Pro 


Ala 


Ala 


Gin 




-5 










1 








5 










10 






atg 


atg 


aac 


aat 


ggc 


etc 


etc 


caa 


cag 


ccc 


tct 


gee 


ttg 


atg 


ttg 


etc 


203 


Met 


Met 


Asn 


Asn 
15 


Gly 


Leu 


Leu 


Gin 


Gin 
20 


Pro 


Ser 


Ala 


Leu 


Met 
25 


Leu 


Leu 




ccc 


tgc 


cgc 


cca 


gtt 


ctt 


act 


tct 


gtg 


gee 


ctt 


aat 


gee 


aac 


ttt 


gtg 


251 


Pro 


Cys 


Arg 
30 


Pro 


Val 


Leu 


Thr 


Ser 
35 


Val 


Ala 


Leu 


Asn 


Ala 
40 


Asn 


Phe 


Val 




tec 


tgg 


aag 


agt 


cgt 


acc 


aag 


tac 


acc 


att 


aca 


cca 


gtg 


aag 


atg 


agg 


299 


Ser 


Trp 
45 


Lys 


Ser 


Arg 


Thr 


Lys 
50 


Tyr 


Thr 


He 


Thr 


Pro 
55 


Val 


Lys 


Met 


Arg 




aag 


tct 


ggg 


ggc 


cga 


gac 


cac 


aca 


ggt 


get 


gga 


aac 


gtg 


cgt 


age 


aac 


347 


Lys 


Ser 


Gly 


Gly 


Arg 


Asp 


His 


Thr 


Gly 


Ala 


Gly 


Asn 


Val 


Arg 


Ser 


Asn 




60 










65 










70 










75 




agt 


agg 


ccg 


agt 


ate 


caa 


cgt 


tgatcataac aaaegggtea ttggcaaggc 


398 


Ser 


Arg 


Pro 


Ser 


He 
80 


Gin 


Arg 























aggtegcaac cgctggctgg geaagaggee taacagtggg cggtggcacc gcaagggggg 458 
ctgggctggc cgaaagattc ggccactacc ccccatgaag agttacgtga agctgccttc 518 
tgcttctgcc aaagctgata tec 541 



<210> 380 
<211> 545 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .355 



<220> 

<221> sig_peptide 

<222> 80. . 124 

<223> Von Heijne matrix 

score 4.59999990463257 
seq VLAGSLLGPTSRS/AA 



<400> 380 

cttttccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 60 
tgcgggaccc actgcggat atg get gtc ttg get gga tec ctg ttg ggc ccc 112 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly Pro 
-15 -10 -5 

acg agt agg teg gca gcg ttg ctg ggt ggc agg tgg etc cag ccc egg 160 
Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro Arg 

1 5 10 

gee tgg ctg ggg ttc cca gac gee tgg ggc etc ccc ace ccg cag cag 208 
Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin Gin 

15 20 25 

gee egg ggc aag get cgc ggg aat gag tat cag ccg age aac ate aaa 256 
Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie Lys 

30 35 40 

cgc aag aac aag cac ggc tgg gtc egg cgc ctg age acg ccg gee ggc 304 
Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Ser Thr Pro Ala Gly 
4 5 ~ ~ 50 " 55 60 

gtg cag gtc ate ctt cgc cga atg etc aag ggc cgc aag teg ctg age 352 
Val Gin Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu Ser 

65 70 75 

cat tgaggatege gaegcagteg gcgggaccct catggaagca tcgccctcgc 405 
His 

ctcggacctt gcctggcgct atttttgeag ggagctgggg agcaggaacg cctcggacct 4 65 
gagtgetetc catattgtgg ggttgaagtc tggatgggag ettgecaagt ccctttttag 525 
gctttttaat taggaagcat 545 



<210> 381 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. . 397 



<220> 

<221> sig_peptide 

<222> 137. .253 

<223> Von Heijne matrix 

score 4.59999990463257 
seq LQGARVLLTGANA/GV 



<400> 381 

ggggtgcggt cggggacccg geaggaggeg gecgagaaga gaggacegtg ggggcgttcg 60 
cgtggctccc agcccgggac cccacccccg ctggacagtg ggggaaactg aggectgage 120 
gggcccacac aggacc atg aag gtg ctt etc etc aca ggg ctg ggg gee ctg 172 

Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 
ttc ttc gee tat tat tgg gat gac aac ttc gac cca gee age etc cag 220 
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Phe 


Phe 


Ala 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Asn 


Phe 


Asp 


Pro 


Ala 


Ser 


Leu 


Gin 








-25 










-20 










-15 










gga 


gcg 


cga 


gtg 


ctg 


ctg 


aca 


ggg 


gee 


aac 


get 


ggt 


gtt 


ggt 


gag 


gag 


268 


Gly 


Ala 


Arg 


Val 


Leu 


Leu 


Thr 


Gly 


Ala 


Asn 


Ala 


Gly 


Val 


Gly 


Glu 


Glu 






-10 










-5 










1 








5 




ctg 


gec 


tat 


cac 


tac 


gcg 


cgt 


ctg 


ggc 


tec 


cac 


ctg 


gtg 


etc 


act 


gee 


316 


Leu 


Ala 


Tyr 


His 


Tyr 


Ala 


Arg 


Leu 


Gly 


Ser 


His 


Leu 


Val 


Leu 


Thr 


Ala 












10 










15 










20 






cac 


act 


gag 


get 


etc 


ctg 


cag 


aag 


gca 


egg 


tgg 


etc 


acg 


ctt 


gta 


gtc 


364 


His 


Thr 


Glu 


Ala 


Leu 


Leu 


Gin 


Lys 


Ala 


Arg 


Trp 


Leu 


Thr 


Leu 


Val 


Val 










25 










30 










35 








age 


act 


ttg 


gga 


ggc 


cga 


gga 


aag 


tgg 


ate 


ace 


tgaggtcagg . 


agttcaagac 


417 


Ser 


Thr 


Leu 


Gly 


Gly 


Arg 


Gly 


Lys 


Trp 


He 


Thr 















40 45 



cagcctggcc aacatggega aacctggtct ctactaagaa gacaaaaatt aaggecagge 477 
atggtggctc atgcctgtaa t 498 



<210> 382 
<211> 459 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 43. .417 



<220> 

<221> sig_peptide 

<222> 43. .87 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LLTHNLLSSHVRG/VG 

<400> 382 

acctttttcc ggttccggcc tggcgagagt ttgtgcggcg ac atg aaa ctg ctt 54 

Met Lys Leu Leu 



























-15 










ace 


cac 


aat 


ctg 


ctg 


age 


teg 


cat 


gtg 


egg 


ggg 


gtg 


ggg 


tec 


cgt 


ggc 


102 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val 


Arg 


Gly 


Val 


Gly 


Ser 


Arg 


Gly 






-10 










-5 










1 








5 




ttc 


ccc 


ctg 


cgc 


etc 


cag 


gee 


ace 


gag 


gtc 


cgt 


ate 


tgc 


cct 


gtg 


gaa 


150 


Phe 


Pro 


Leu 


Arg 


Leu 


Gin 


Ala 


Thr 


Glu 


Val 


Arg 


He 


Cys 


Pro 


Val 


Glu 












10 










15 










20 






ttc 


aac 


ccc 


aac 


ttc 


gtg 


gcg 


cgt 


atg 


ata 


cct 


aaa 


gtg 


gag 


tgg 


teg 


198 


Phe 


Asn 


Pro 


Asn 


Phe 


Val 


Ala 


Arg 


Met 


He 


Pro 


Lys 


Val 


Glu 


Trp 


Ser 










25 










30 










35 








gcg 


ttc 


ctg 


gag 


gcg 


gee 


gat 


aac 


ttg 


cgt 


ctg 


ate 


cag 


gtg 


ccg 


aaa 


246 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp 


Asn 


Leu 


Arg 


Leu 


He 


Gin 


Val 


Pro 


Lys 








40 










45 










50 










ggg 


ccg 


gtt 


gag 


gga 


tat 


gag 


gag 


aat 


gag 


gag 


ttt 


ctg 


agg 


ace 


atg 


294 


Gly 


Pro 


Val 


Glu 


Gly 


Tyr 


Glu 


Glu 


Asn 


Glu 


Glu 


Phe 


Leu 


Arg 


Thr 


Met 






55 










60 










65 












cac 


cac 


ctg 


ctg 


ctg 


gag 


gtg 


gaa 


gtg 


ata 


gag 


ggc 


ace 


ctg 


cag 


tgc 


342 


His 


His 


Leu 


Leu 


Leu 


Glu 


Val 


Glu 


Val 


He 


Glu 


Gly 


Thr 


Leu 


Gin 


Cys 




70 










75 










80 










85 




ccg 


gaa 


tct 


gga 


cgt 


atg 


ttc 


ccc 


ate 


age 


cgc 


ggg 


ate 


ccm 


amc 


atg 


390 


Pro 


Glu 


Ser 


Gly 


Arg 


Met 


Phe 


Pro 


He 


Ser 


Arg 


Gly 


lie 


Pro 


Xaa 


Met 












90 










95 










100 






ctg 


ctg 


agt 


gaa 


gag 


gaa 


act 


gag 


agt 


tgattgtgsc aggegecagt 




437 


Leu 


Leu 


Ser 


Glu 


Glu 


Glu 


Thr 


Glu 


Ser 



















105 110 
ttttcttgty atgactgtgt at 459 
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<210> 383 
<211> 533 
<212> DNA 

<213> Homo sapiens 

<220> 
<22l> CDS 
<222> 82. .237 



<220> 

<221> sig_peptide 
<222> 82 . . 147 
<223> Von Heijne matrix 
score 4 . 5 

seq FFLLITIALGTKT/ES 



<400> 383 

aawtttctca acaattcctc accgcagagc ctctgaagct cccaccaggc cagctctcct 
cccacaacag cttcccacag c atg aag ate tec gtg get gee att ccc ttc 

Met Lys lie Ser Val Ala Ala lie Pro Phe 
-20 -15 
ttc etc etc ate acc ate gee eta ggg acc aag act gaa tec tec tea 
Phe Leu Leu lie Thr lie Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser 

-10 -5 1 

egg gga cct tac cac ccc tea gag tgc tgc ttc acc tac act acc tac 
Arg Gly Pro Tyr His Pro Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr 
5 ~ 10 15 " 20 

aag ate ccg cgt cag awt tat gga tta eta tgagaccaac agecagtget 
Lys lie Pro Arg Gin Xaa Tyr Gly Leu Leu 

25 30 
ccaagcccgg aattgtcttc atcaccaaaa ggggecatte cgtctgtacc aaccccagtg 
acaagtgggt ccaggactat atcaaggaca tgaaggagaa ctgagtgacc cagaaggggt 
ggegaaggea cagctcagag acataaagag aagatgccaa ggccccctcc tccacccacc 
gctaactctc agccccagtc accctcttgg agcttccctg ctttgaatta aagaccactc 
atgctctcca aaaaaaaaaa aaaaaggtca aaagct 



60 
111 



159 



207 



257 



317 
377 
437 
4 97 
533 



<210> 384 
<211> 472 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 92. .307 



<220> 

<221> sig_peptide 

<222> 92. .217 

<223> Von Heijne matrix 

score 4.40000009536743 
seq LRLEVAAAPGTPA/QP 



<400> 384 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 
tgeagaagea ettgeggtet gcagcctacg c atg aat agg ttg gca ggt gtg 112 

Met Asn Arg Leu Ala Gly Val 
-40 

ggc tgg egg gtg gac tac acc ctg age tec age ctg ctg caa tec gtg 160 
Gly Trp Arg Val Asp Tyr Thr Leu Ser Ser Ser Leu Leu Gin Ser Val 
-35 -30 -25 -20 

gaa gag ccc atg gtg cac ctg egg ctg gag gtg gca get gee cca ggg 208 
Glu Glu Pro Met Val His Leu Arg Leu Glu Val Ala Ala Ala Pro Gly 
-15 -10 -5 
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acc cca gcc cag cct gtt gcc atg tec etc tea gca gac aag ttc cag 256 
Thr Pro Ala Gin Pro Val Ala Met Ser Leu Ser Ala Asp Lys Phe Gin 

15 10 
gtc etc ctg gca gaa ctg aag cag gcc cag acc ctg atg age tec ctg 304 
Val Leu Leu Ala Glu Leu Lys Gin Ala Gin Thr Leu Met Ser Ser Leu 

15 20 25 

ggc tgaggagaag ggtgttccag gcctgtgtgg agccgccctg cccgtatgga 357 
Gly 
30 

gtcacgccct ctgaactget ettegggagg cagccctggt tetaggatge tgaggccctg 417 
gcccggactc tggcctccca gatccccagc tgcctcactt ctctcttgag aactt 472 



<210> 385 
<211> 514 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 113. .364 



<220> 

<221> sig_peptide 

<222> 113. . 172 

<223> Von Heijne matrix 

score 4.30000019073486 
seq SLLLSLPPHQGLT/FS 



<400> 385 

ttttttacat ggtgttccca cagctgggag gacacccaca tggtcggcgt gcaggatatt 60 
tcgctggacc ctagaaaagc caccacgacc tgtgggccat gatgctaccc ca atg get 118 

Met Ala 
-20 

166 



214 



get 


get 


get 


gtt 


cct 


tct 


ctt 


ctt 


ctt 


tct 


ctt 


cct 


cct 


cac 


cag 


ggg 


Ala 


Ala 


Ala 


Val 


Pro 


Ser 


Leu 


Leu 


Leu 


Ser 


Leu 


Pro 


Pro 


His 


Gin 


Gly 








-15 










-10 










-5 






etc 


act 


ttc 


tec 


aac 


aaa 


ata 


caa 


cct 


ttt 


gga 


get 


caa 


gga 


gtc 


ttg 


Leu 


Thr 


Phe 


Ser 


Asn 


Lys 


He 


Gin 


Pro 


Phe 


Gly 


Ala 


Gin 


Gly 


Val 


Leu 






1 








5 










10 










cat 


ccg 


gaa 


cca 


gga 


ctg 


cga 


gac 


tgg 


ctg 


ctg 


cca 


acg 


tgc 


tec 


aga 


His 


Pro 


Glu 


Pro 


Gly 


Leu 


Arg 


Asp 


Trp 


Leu 


Leu 


Pro 


Thr 


Cys 


Ser 


Arg 


15 










20 










25 










30 


caa 


ttg 


cga 


gtc 


gca 


ctg 


ccg 


gag 


aag 


ggg 


tec 


gag 


ggc 


agt 


ctg 


tgt 


Gin 


Leu 


Arg 


Val 


Ala 


Leu 


Pro 


Glu 


Lys 


Gly 


Ser 


Glu 


Gly 


Ser 


Leu 


Cys 










35 










40 










45 




caa 


acg 


cag 


ctg 


cca 


get 


act 


cca 


tgc 


ttc 


ctg 


cct 


teg 


aat 


acg 


gtg 


Gin 


Thr 


Gin 


Leu 


Pro 


Ala 


Thr 


Pro 


Cys 


Phe 


Leu 


Pro 


Ser 


Asn 


Thr 


Val 








50 










55 










60 







262 



310 



358 



aga acg tgaagtcatg agetgetget aaggcatgtg gcaaccttga agagaaggtc 414 
Arg Thr 

aagagctacc agccaccaaa agaatgecag cacttcctgt gtctttgett tggattcatg 474 

agaaatatac gttcctattt gcttcaaaaa aaaaaaaaaw 514 



<210> 386 
<211> 508 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83. . 244 



<220> 



WO 02/083898 



PCT/IB01/00914 



221 

<221> sig_peptide 

<222> 83. . 139 

<223> Von Heijne matrix 

score 4.19999980926514 

seq LLILSKVLDDTFQ/SV 



<400> 386 

actgctatag 

gggagggacg 



tec aaa gtt 
Ser Lys Val 

tat ttt cag 
Tyr Phe Gin 
10 

get cat gec 
Ala His Ala 
25 

actttgggag 
catggtgaaa 
gcctgtagtc 
gagcttacag 
ctgttcaaaa 



cgatataact cagatgttct 
tgttaggaag ac atg aaa 
Met Lys 

eta gat gac act ttc 
Leu Asp Asp Thr Phe 
-5 

ttt act tta aga aat 
Phe Thr Leu Arg Asn 
15 

tgt aat ccc age act 
Cys Asn Pro Ser Thr 
30 

tccgaggtgg gcagatcatg 
ccccgtctct actaaaaata 
ccagctactc gggaggctga 
tgagecgaga tcgcgccact 
aaaa 



caggacagca 
ttc aat ttt 
Phe Asn Phe 
-15 

caa agt gtc 
Gin Ser Val 
1 

ggt tta atg 
Gly Leu Met 

ggc tea cgc 
Gly Ser Arg 

aggtcaggag 
caaaaaaatt 
ggcaggagaa 
gcactccagc 



agtttacatg tgaatagaga 
gta ctt ttg ata ctt 
Val Leu Leu lie Leu 
-10 

aag aaa tgg tta aat 
Lys Lys Trp Leu Asn 
5 

tgg cca ggt gcg gtg 
Trp Pro Gly Ala Val 
20 

ctg taatcccagc 
Leu 
35 

atcgagacca tcctggctaa 
agecaggcat ggtggcgggc 
tggcgtgaac ctgggaggcg 
ctgggcgaca gagegagact 



60 
112 



160 



208 



254 



314 
374 
434 
494 
508 



<210> 387 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 55. .327 



<220> 

<221> sig_peptide 

<222> 55. . Ill 

<223> Von Heijne matrix 

score 4.19999980926514 
seq FVLLLSEIVSISA/LS 



<400> 387 

ttgtctttgg tagttttttt gcactaactt caggaaccag ctcatgatct cagg atg 57 

Met 

tat gga aaa ata ate ttt gta tta eta ttg tea gaa att gtg age ata 105 
Tyr Gly Lys lie lie Phe Val Leu Leu Leu Ser Glu lie Val Ser lie 

-15 -10 -5 

tea gca tta agt acc act gag gtg gca atg cac act tea acc tct tct 153 
Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser Ser 

15 10 
tea gtc aca aag agt tac ate tea tea cag aca aat gga gaa ayg gga 201 
Ser Val Thr Lys Ser Tyr lie Ser Ser Gin Thr Asn Gly Glu Xaa Gly 
15 20 25 30 

caa ctt gtc cat cgt ttc act gta cca get cct gta gtg ata ata etc 249 
Gin Leu Val His Arg Phe Thr Val Pro Ala Pro Val Val lie lie Leu 

35 40 45 

att att ttg tgt gtg atg get ggt att att gga acg ate etc tta att 297 
lie He Leu Cys Val Met Ala Gly He He Gly Thr lie Leu Leu He 

50 55 60 

tct tac agt att cgc cga ctg ata aag gca tgaggatgtg gcctgcatgc 347 
Ser Tyr Ser He Arg Arg Leu He Lys Ala 

65 70 
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tgcctgatct tgcctagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 407 

cacctgctat tcctttgctt atgcccctac ccctggctat cctaattccc tgttctcctg 467 
cctcactatt actgtattct ctacttctaa ataaaaat 505 



<210> 388 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 395. .736 



<220> 

<221> sig_peptide 

<222> 395. .490 

<223> Von Heijne matrix 

score 4.19999980926514 
seq LLGSGLFVFSLTA/FN 



<400> 388 

atgatgggta 

tggacaagct 

cgggcatttc 

tgccgcgcgg 

tcggcccacc 

tgactctccg 

ctcctccctc 



acaggaccgg 
ttcctcgtcc 
caccggacgg 
tgtggggagt 
gagcttccgg 
tcccttctcc 
ctgtccctgc 



tgggstcccc 
tctcccgaca 
gagggttcgg 
tggggcgtgt 
accggctgat 
catcccctcc 
tgctctttgc 



ctg gcc tec 
Leu Ala Ser 
-25 

ggt etc ttc 
Gly Leu Phe 



gtc 
Val 

tgc 
Cys 

gtc 

Val 

40 

aag 

Lys 



ttt ggc 
Phe Gly 
10 

etc ctg 
Leu Leu 
25 

acc acc 
Thr Thr 

ate tec 
lie Ser 



gcc aag gtc 
Ala Lys Val 

agttgattct 



acc gag tgg 
Thr Glu Trp 
-20 

gtg ttc teg 
Val Phe Ser 
-5 

aaa gga ttc 
Lys Gly Phe 

ttg get etc 
Leu Ala Leu 

tgc ttc ate 
Cys Phe lie 
45 

tec acc ctg 
Ser Thr Leu 
60 

aca ggc aag 
Thr Gly Lys 
75 

ttgtaaaaa 



etc acc 
Leu Thr 

etc act 
Leu Thr 

caa gca 
Gin Ala 
15 

ttt gca 
Phe Ala 
30 

ttc tec 
Phe Ser 

tac cag 
Tyr Gin 

age aag 
Ser Lys 



aggaagtcct agagggggtc 
gagctgacgt gtcctgggtt 
ggtgtccggg gctggggaat 
ggctgcagtc ccgggagttc 
ctgcccgtag ettgeeggag 
agtggtgggt acgggcacct 
tggg atg cag atg tac a 
Met Gin Met Tyr S 
-30 

ate cag ggc ggc ctg ctt 
lie Gin Gly Gly Leu Leu 
-15 

gcc ttc aat aat ctg gag 
Ala Phe Asn Asn Leu Glu 

1 5 
aag ate ttc cct gag att 
Lys lie Phe Pro Glu lie 
20 

tct ggc etc ate cac cga 
Ser Gly Leu lie His Arg 
35 

atg gtt ggt ctg tac tac 
Met Val Gly Leu Tyr Tyr 
50 

gca gca get cca gtc etc 
Ala Ala Ala Pro Val Leu 
65 

aag aga aac tgaccctgaa 
Lys Arg Asn 
8 0 



ggggtttggg 
ccaccgggag 
aegtaggggt 
ttggaggggg 
gagggeggas 
cgctggcgct 
gc cgt cag 
er Arg Gin 

ggt teg 
Gly Ser 
-10 
aat ctt 
Asn Leu 

etc ctg 
Leu Leu 

gtc tgt 
Val Cys 

ate aac 
lie Asn 

55 
aca cca 
Thr Pro 
70 

tgttcaataa 



60 
120 
180 
240 
300 
360 
415 



463 



511 



559 



607 



655 



703 



756 



775 



<210> 389 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 166. .324 



<220> 



WO 02/083898 



PCT/IB01/00914 
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<221> sig_peptide 

<222> 166. .204 

<223> Von Heijne matrix 

score 4.09999990463257 

seq MAVAFVLSLGVAA/LY 



<400> 389 

attctgcgcc tgcgcgcggc tacagcacgg ttcgtttttc ctttagtcag gaaggacgtt 60 
ggtgttgagg ttagcatacg tatcaaggac agtaactacc atggctcccg aagttttgcc 120 
aaaacctcgg atgcgtggcc ttctggcagg cgtctgcgaa atcat atg get gta gca 177 

Met Ala Val Ala 
-10 

ttc gtg eta tec ctg ggg gtt gca get ttg tat aag ttt cgt gtg get 225 
Phe Val Leu Ser Leu Gly Val Ala Ala Leu Tyr Lys Phe Arg Val Ala 

-5 15 
gat caa aga aag aag gca tac gca gat ttc tac aga aac tac gat gtc 273 
Asp Gin Arg Lys Lys Ala Tyr Ala Asp Phe Tyr Arg Asn Tyr Asp Val 

10 15 20 

atg aaa gat ttt gag gag atg agg aag get ggt ate ttt cag agt gta 321 
Met Lys Asp Phe Glu Glu Met Arg Lys Ala Gly lie Phe Gin Ser Val 

25 30 35 

aag taatcttgga atataaagaa tttcttcagg ttgaattacc tagaagtttg 374 
Lys 
40 

tcactgactt gtgttcctga actatgacac atgaatatgt gggctaagaa atagttcctc 434 
ttgataaata aacaattaac aaatactttt ga 466 

<210> 390 
<211> 522 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 94 . .417 



<220> 

<221> sig_peptide 
<222> 94 . .213 
<223> Von Heijne matrix 
score 4 

seq VTAPSMVAPVTFA/SI 



<400> 390 

ctgtaatctc tgegaaaget tccaggttcg etcategtag gtcactggaa aaaattgtag 60 
cattgetaaa gagtatttca gagaaaattt tgc atg caa aaa ttt agg aag atg 114 

Met Gin Lys Phe Arg Lys Met 
-40 -35 
agt gaa act cat cat tct gta ate tct gtg aaa get tec agt ccc tgg 162 
Ser Glu Thr His His Ser Val lie Ser Val Lys Ala Ser Ser Pro Trp 

-30 -25 -20 

eta tct tct tea gtg act get cca tec atg gta gee cca gtc act ttt 210 
Leu Ser Ser Ser Val Thr Ala Pro Ser Met Val Ala Pro Val Thr Phe 

-15 -10 -5 

gca tct att gta gaa gaa gaa eta caa caa gaa gca get ctt att aga 258 
Ala Ser lie Val Glu Glu Glu Leu Gin Gin Glu Ala Ala Leu lie Arg 

15 10 15 

agt cga gaa aaa ccg ttg get ctg att cag att gag gag cat gee ata 306 
Ser Arg Glu Lys Pro Leu Ala Leu lie Gin lie Glu Glu His Ala lie 

20 25 30 

caa gat tta ttg gtt ttc tat gag gca ttt ggc aac cct gaa gag ttt 354 
Gin Asp Leu Leu Val Phe Tyr Glu Ala Phe Gly Asn Pro Glu Glu Phe 
35 40 45 
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gtc att gtt gaa agg aca ccg cag gga cca ctg gca gta cct atg tgg 402 
Val lie Val Glu Arg Thr Pro Gin Gly Pro Leu Ala Val Pro Met Trp 

50 55 60 

aat aag cat gga tgc tagttcactg tggagttgag atgcatttta cataattatg 457 
Asn Lys His Gly Cys 
65 

agtttgttca tataaagaaa gctgtggaaa agagtcttag agatttgaat atcgtctaaa 517 
tagat 522 

<210> 391 
<211> 409 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. . 389 

<220> 

<221> sig_peptide 

<222> 9. . 149 

<223> Von Heijne matrix 

score 3.90000009536743 

seq LSSDLQQVGGASA/RI 

<220> 

<221> misc_feature 

<222> 8,403 

<223> n=a, g, c or t 

<400> 391 

tgggagtn atg ttc acg cct mtg ama gtg aaa tarn gcg tac tac gac act 50 
Met Phe Thr Pro Xaa Xaa Val Lys Xaa Ala Tyr Tyr Asp Thr 
-45 -40 -35 

gaa cgc ate gga gtt gac mtg ate atg aag acc tgm ttt age ccc aac 98 
Glu Arg lie Gly Val Asp Xaa lie Met Lys Thr Xaa Phe Ser Pro Asn 

-30 -25 -20 

aga gtg att gga etc tea agt gac ttg cag caa gta gga ggg gca tea 14 6 

Arg Val lie Gly Leu Ser Ser Asp Leu Gin Gin Val Gly Gly Ala Ser 

-15 -10 -5 

get cgc ate mag gat gec ctg agt aca gtg ttg caa tat gca gag gat 194 
Ala Arg lie Xaa Asp Ala Leu Ser Thr Val Leu Gin Tyr Ala Glu Asp 

15 10 15 

gta ctg tct gga aag gtg tea get gac aat act gtg ggc cgc ttc ctg 242 
Val Leu Ser Gly Lys Val Ser Ala Asp Asn Thr Val Gly Arg Phe Leu 

20 25 30 

atg age ctg gtt aac caa gta ccg aaa ata gtt ccc gat gac ttt gag 290 
Met Ser Leu Val Asn Gin Val Pro Lys lie Val Pro Asp Asp Phe Glu 

35 40 45 

acc atg etc aac age aac ate aat gac mtt ttg atg gtg acc tac mtg 338 
Thr Met Leu Asn Ser Asn lie Asn Asp Xaa Leu Met Val Thr Tyr Xaa 

50 55 60 

gee aac etc aca cag tea cag att gca etc aat gaa aaa ctt gta aac 386 
Ala Asn Leu Thr Gin Ser Gin lie Ala Leu Asn Glu Lys Leu Val Asn 

65 70 75 

ctg tgaatggacc ccangcagta 409 
Leu 
80 

<210> 392 
<211> 431 
<212> DNA 
<213> Homo sapiens 



WO 02/083898 



PCT/IB01/00914 



225 

<220> 
<221> CDS 
<222> 60. .314 

<220> 

<221> sig_peptide 

<222> 60. .155 

<223> Von Heijne matrix 

score 3.79999995231628 

seq LAHIQALISGIEA/QL 

<220> 

<221> misc_f eature 
<222> 381 

<223> n=a, g, c or t 
<400> 392 

cataagaaca gggaggttag aagtagggtc ttgagattga gctgcagtca cagctgagc 59 



atg 


aaa 


get 


gcc 


ttg 


gaa 


gac 


aca 


ctg 


gca 


gaa 


acg 


gag 


gcg 


cgc 


ttt 


107 


Met 


Lys 


Ala 
-30 


Ala 


Leu 


Glu 


Asp 


Thr 
-25 


Leu 


Ala 


Glu 


Thr 


Glu 
-20 


Ala 


Arg 


Phe 




gga 


gcc 


cag 


ctg 


gcg 


cat 


ate 


cag 


gcg 


ctg 


ate 


age 


ggt 


att 


gaa 


gcc 


155 


Gly Ala 


Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 


He 


Ser 


Gly 


He 


Glu 


Ala 






-15 










-10 










-5 












cag 


ctg 


ggc 


gat 


gtg 


cga 


get 


gat 


agt 


gag 


egg 


cag 


aat 


cag 


gag 


tac 


203 


Gin 


Leu 


Gly 


Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 




1 








5 










10 










15 






cag 


egg 


etc 


atg 


gac 


ate 


aag 


teg 


egg 


ctg 


gag 


cag 


gag 


att 


gcc 


ace 


251 


Gin 


Arg 


Leu 


Met 
20 


Asp 


He 


Lys 


Ser 


Arg 
25 


Leu 


Glu 


Gin 


Glu 


He 
30 


Ala 


Thr 




tac 


cgc 


age 


ctg 


etc 


gag 


gga 


cag 


gaa 


gat 


cac 


tac 


aac 


aat 


ttg 


tct 


299 


Tyr 


Arg 


Ser 
35 


Leu 


Leu 


Glu 


Gly 


Gin 
40 


Glu 


Asp 


His 


Tyr 


Asn 
45 


Asn 


Leu 


Ser 





gcc tec aag gtc etc tgaggcagca ggctctgggg ettctgetgt cctttggagg 354 
Ala Ser Lys Val Leu 
50 

gtgtcttctg ggtagaggga tgggaangaa gggaccctta cccccggctc ttctcctgac 414 
ctgccaataa aaattta 431 



<210> 393 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .534 

<220> 

<221> sig_peptide 

<222> 232. .306 

<223> Von Heijne matrix 

score 3.70000004768372 

seq AKTCLVLCSRVLS/VI 

<400> 393 

tatcactgtt acgaaccaag gatttacaga tcactggcaa aaattctgag aactttcaca 60 
ccagtatact gtccaagccc attaagtggc atcacacctc tcttttatgt agctcagaca 120 
agacagtcta atatcttcaa aatactactg caatatggaa tcttagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtactc tacccttcga gagtaagagt a atg gtt 237 

Met Val 
-25 
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gat 


cgt 


gaa 


ttg 


get 


gac 


ate 


cat 


gaa 


gat 


gee 


aaa 


aca 


tgt 


ttg 


gta 


285 


Asp 


Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu 


Asp 


Ala 


Lys 


Thr 


Cys 


Leu 


Val 










-20 










-15 










-10 








eta 


tgt 


tec 


aga 


gtg 


ctt 


tct 


gtc 


att 


tea 


gtc 


aag 


gaa 


ata 


aag 


aca 


333 


Leu 


Cys 


Ser 
-5 


Arg 


Val 


Leu 


Ser 


Val 
1 


He 


Ser 


Val 


Lys 
5 


Glu 


He 


Lys 


Thr 




cag 


ctg 


agt 


tta 


gga 


aga 


cat 


cca 


att 


att 


tea 


aat 


tgg 


ttt 


gat 


tac 


381 


Gin 


Leu 


Ser 


Leu 


Gly 


Arg 


His 


Pro 


He 


He 


Ser 


Asn 


Trp 


Phe 


Asp 


Tyr 




10 










15 










20 










25 




att 


cct 


tea 


aca 


aga 


tac 


aaa 


gat 


cca 


tgt 


gaa 


eta 


tta 


cat 


ctt 


tgc 


429 


He 


Pro 


Ser 


Thr 


Arg 


Tyr 


Lys 


Asp 


Pro 


Cys 


Glu 


Leu 


Leu 


His 


Leu 


Cys 












30 










35 










40 






aga 


eta 


ace 


ate 


agg 


aat 


caa 


eta 


tta 


ace 


aac 


aat 


atg 


etc 


cca 


gat 


477 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


Pro 


Asp 










45 










50 










55 








gga 


ata 


ttt 


tea 


ctt 


eta 


att 


cct 


get 


cgt 


eta 


caa 


aac 


tat 


ctg 


aat 


525 


Gly 


He 


Phe 


Ser 


Leu 


Leu 


He 


Pro 


Ala 


Arg 


Leu 


Gin 


Asn 


Tyr 


Leu 


Asn 





60 65 70 



tta gaa ate taacatacgt cagtgtc 551 
Leu Glu He 
75 



<210> 394 
<211> 529 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 78. .281 



<220> 

<221> sig_peptide 

<222> 78. .218 

<223> Von Heijne matrix 

score 3.70000004768372 
seq PLLSLHSRGGSSS/ES 



<400> 394 

ttttttgega aeggegagea gcggcgcgcg eggagagacg ageggaggtt ttcctggttt 60 
cggaccccag eggcegg atg gtg aaa tec tec ctg cag egg ate etc aat 110 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 



age 


cac 


tgc 


ttc 


gee 


aga 


gag 


aag 


gaa 


ggg 


gat 


aaa 


ccc 


age 


gee 


ace 


158 


Ser 


His 
-35 


Cys 


Phe 


Ala 


Arg 


Glu 
-30 


Lys 


Glu 


Gly 


Asp 


Lys 
-25 


Pro 


Ser 


Ala 


Thr 




ate 


cac 


gec 


age 


cgc 


ace 


atg 


ccg 


etc 


eta 


age 


ctg 


cac 


age 


cgc 


ggc 


206 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu 


Ser 


Leu 


His 


Ser 


Arg 


Gly 




-20 










-15 










-10 










-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec 


etc 


cac 


tgc 


tgt 


agt 


aac 


254 


Gly 


Ser 


Ser 


Ser 


Glu 

1 


Ser 


Ser 


Arg 


Val 
5 


Ser 


Leu 


His 


Cys 


Cys 
10 


Ser 


Asn 




ccg 


ggt 


ccg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcccct cacccacccc 




301 


Pro 


Gly 


Pro 
15 


Gly 


Pro 


Arg 


Trp 


Cys 
20 


Ser 



















tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctaacttat 361 

tctactccga tgatcggctg aatgtaacag aggaactaac gtccaacgac aagacgagga 421 

ttctcaacgt ccagtccagg ctcacagacg ccaaacgcat taactggcga acagtgctga 481 

gtggcggcas tcctacatcg agatcceggg cggcgcgctg ccgagggg 529 



<210> 395 
<211> 491 
<212> DNA 



WO 02/083898 

<213> Homo sapiens 



227 



PCT/IB01/00914 



<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.70000004768372 
seq PLLSLHSRGGSSS/ES 



<400> 395 

aktttgcgaa cggcgagcag cggcggcggc gcggagagac 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg 

Met Val Lys Ser Ser Leu 
-45 

age cac tgc ttc gee aga gag aag gaa ggg gat 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp 



-35 



-30 



ate cac gee age cgc ace atg ccg etc eta age 
lie His Ala Ser Arg Thr Met Pro Leu Leu Ser 
-20 -15 -10 

ggc age age agt gag agt tec agg gtc tec etc 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu 

1 ' 5 

ccg ggt ccg ggg cct egg tgg tgc tec tgatgece 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga gggaatagtc agagggatca 
tctactccga tgatcggctg aatgtaacag aggaactaac 
ttctcaacgt tcagtccagg ctcacaggaa cgccaaacgc 
gagtggcg 



geageggagg ttttcctggt 60 
cag egg ate etc aat 112 
Gin Arg lie Leu Asn 
-40 

aaa ccc age gee ace 160 

Lys Pro Ser Ala Thr 

-25 

ctg cac age cgc ggc 208 
Leu His Ser Arg Gly 
-5 

cac tgs tgt agt aac 256 
His Xaa Cys Ser Asn 
10 

ct cacccacccc 303 



caatctttca gctaacttat 363 
gtscaacgac aagacgagga 423 
attaactgee gaacagtgct 483 

491 



<210> 396 
<211> 503 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. . 395 



<220> 

<221> sig_peptide 
<222> 9. . 56 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LLLLLRALRRGPG/PG 



<400> 396 

aggecaac atg gee gtg ctg ctg ctg ctg etc cgt gee etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 " -5 

cca ggc ccg ggt cct egg ccg ctg tgg ggc cca ggc ccg gee tgg agt 98 
Pro Gly Pro Gly Pro Arg Pro Leu Trp Gly Pro Gly Pro Ala Trp Ser 

1 5 10 

cca ggg ttc ccc gee agg ccc ggg agg ggg egg ccg tac atg gee age 14 6 

Pro Gly Phe Pro Ala Arg Pro Gly Arg Gly Arg Pro Tyr Met Ala Ser 
15 20 25 30 

agg cct ccg ggg gac etc gee gag get gga ggc cga get ctg cag age 194 
Arg Pro Pro Gly Asp Leu Ala Glu Ala Gly Gly Arg Ala Leu Gin Ser 
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35 










40 










45 






tta 


caa 


ttg 


aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 


Leu 


Leu 


Thr 


Pro 


Thr 


Phe 


Glu 


Gly 


He 


Asn 


Gly 


Leu 










50 










55 










60 








ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 


Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn 


Pro 


Val 


Arg 


Leu 


Trp 


Gin 


Leu 








65 










70 










75 










tta 


ggt 


ggt 


act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cag 


aag 


aat 


338 


Leu 


Gly 


Gly 


Thr 


Phe 


Tyr 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Asn 






80 










85 










90 












aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
He Phe He 

atttctgttt weettttage aaggtgaaac cagtctggac aatggggaga tgggccgggt 4 95 

gcagtggc 503 



<210> 397 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .419 



<220> 

<221> sig_peptide 

<222> 93. . 191 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LAVFQMLKSMCAG/QR 



<400> 397 

gaaaaaaagc gaaggcegge egggegggga agggaaatgg cgaggcagga gtgcggggga 60 



gggastggtc cttagctgaa tgcgcctgcg tt 


atg 


gcg 


gee 


tec 


ggc 


gec 


cca 


113 




















Met 


Ala 


Ala 


Ser 
-30 


Gly 


Ala 


Pro 




agg 


ate 


ctg 


gtg 


gac 


ctg 


ctg 


aag 


ctg 


aac 


gtg 


gee 


ccc 


etc 


gee 


gtc 


161 


Arg 


He 


Leu 


Val 


Asp 


Leu 


Leu 


Lys 


Leu 


Asn 


Val 


Ala 


Pro 


Leu 


Ala 


Val 






-25 










-20 










-15 












ttc 


cag 


atg 


etc 


aag 


tec 


atg 


tgt 


gee 


ggg 


cag 


agg 


eta 


gcg 


age 


gag 


209 


Phe 


Gin 


Met 


Leu 


Lys 


Ser 


Met 


Cys 


Ala 


Gly 


Gin 


Arg 


Leu 


Ala 


Ser 


Glu 




-10 










-5 










1 








5 






ccc 


cag 


gac 


cct 


gcg 


gee 


gtg 


tct 


ctg 


ccc 


acg 


teg 


age 


gtg 


ccc 


grg 


257 


Pro 


Gin 


Asp 


Pro 


Ala 


Ala 


Val 


Ser 


Leu 


Pro 


Thr 


Ser 


Ser 


Val 


Pro 


Xaa 










10 










15 










20 








acc 


cga 


ggg 


aga 


aac 


aaa 


ggc 


age 


get 


gee 


etc 


ggg 


gga 


gca 


ttg 


gee 


305 


Thr 


Arg 


Gly 


Arg 


Asn 


Lys 


Gly 


Ser 


Ala 


Ala 


Leu 


Gly 


Gly 


Ala 


Leu 


Ala 








25 










30 










35 










ctg 


gcg 


gaa 


csm 


rrc 


age 


cgc 


gaa 


gga 


tec 


age 


cag 


agg 


atg 


cca 


cgc 


353 


Leu 


Ala 


Glu 


Xaa 


Xaa 


Ser 


Arg 


Glu 


Gly 


Ser 


Ser 


Gin 


Arg 


Met 


Pro 


Arg 






40 










45 










50 












cag 


ccc 


age 


get 


acc 


agg 


ctg 


ccc 


aag 


ggg 


ggc 


ggg 


cct 


ggg 


aag 


age 


401 


Gin 


Pro 


Ser 


Ala 


Thr 


Arg 


Leu 


Pro 


Lys 


Gly 


Gly 


Gly 


Pro 


Gly 


Lys 


Ser 




55 










60 










65 










70 





cct aca egg ggc age acc taggatgggg cagagacttg ttgcatcttt 44 9 

Pro Thr Arg Gly Ser Thr 
75 

gtccccagca aaggctacat gttacctcct tcaattgata ataaaccttt ctgagatgag 509 
agggtcca 517 
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<210> 398 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78. .386 



<220> 

<221> sig_peptide 

<222> 78 . . 185 

<223> Von Heijne matrix 

score 3.59999990463257 
seq RALSTFLFGSIRG/AA 



<400> 398 

aagtgcgcag acgcaggggt cggcgccggg tgagagcgtg cggccgggta agggcgtgtg 60 
gccggattca ccacaac atg gca aat ctt ttt ata agg aaa atg gtg aac 110 

Met Ala Asn Leu Phe lie Arg Lys Met Val Asn 

158 



206 















-35 










-30 










cct 


ctg 


etc 


tat 


etc 


agt 


cgt 


cac 


acg 


gtg 


aag 


cct 


cga 


gec 


etc 


tec 


Pro 


Leu 


Leu 


Tyr 


Leu 


Ser 


Arg 


His 


Thr 


Val 


Lys 


Pro 


Arg 


Ala 


Leu 


Ser 


-25 










-20 










-15 










-10 


aca 


ttt 


eta 


ttt 


gga 


tec 


att 


cga 


ggt 


gca 


gec 


ccc 


gtg 


get 


gtg 


gaa 


Thr 


Phe 


Leu 


Phe 


Gly 
-5 


Ser 


He 


Arg 


Gly 


Ala 
1 


Ala 


Pro 


Val 


Ala 
5 


Val 


Glu 


ccc 


ggg 


gca 


gca 


gtg 


cgc 


tea 


ctt 


etc 


tea 


ccc 


ggc 


etc 


ctg 


ccc 


cat 


Pro 


Gly 


Ala 


Ala 


Val 


Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Gly 


Leu 


Leu 


Pro 


His 






10 










15 










20 








ctg 


ctg 


cct 


gcg 


ctg 


ggg 


ttc 


aaa 


aac 


aag 


act 


gtc 


ctt 


aag 


aag 


cgc 


Leu 


Leu 


Pro 


Ala 


Leu 


Gly 


Phe 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Lys 


Lys 


Arg 




25 










30 










35 










tgc 


aag 


gac 


tgt 


tac 


ctg 


gtg 


aag 


agg 


egg 


ggt 


egg 


tgg 


tac 


gtc 


tac 


Cys 


Lys 


Asp 


Cys 


Tyr 


Leu 


Val 


L Y S 


Arg 


Arg 


Gly 


Arg 


Trp 


Tyr 


Val 


Tyr 


40 










45 










50 










55 


tgt 


aaa 


acc 


cat 


ccg 


agg 


cac 


aag 


cag 


aga 


cag 


atg 


tagacccttt 




Cys 


Lys 


Thr 


His 


Pro 


Arg 


His 


Lys 


Gin 


Arg 


Gin 


Met 











254 



302 



350 



396 



60 65 
ccctccagag tcacgcacat actcgtcatc gcatcacttg ggagaatggt tgtatcttat 456 
ggaaggaatt atcacatcaa ggagtcaggg gamagtgact gggaagcaaa c 507 



<210> 399 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 



<220> 

<221> sig_peptide 

<222> 58 . .165 

<223> Von Heijne matrix 

score 3.59999990463257 
seq SVSLSGFTVGTLS/ET 



<400> 399 

ctgtgggaga ggccaggatg ctgeattagg cacaggataa cctgggaacc caggcac 57 
atg ggt cct get etc cga agt ctg caa gtc aag aag gga aca gag cac 105 
Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 
-35 -30 -25 
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gcc gac cct etc cct ttc ccc tct gtc tct ctt agt ggc ttt aca gtg 153 

Ala Asp Pro Leu Pro Phe Pro Ser Val Ser Leu Ser Gly Phe Thr Val 

-20 -15 -10 -5 

ggt acc ctg tea gaa ace age act ggg ggc cct gcc acc ccc aca tgg 201 

Gly Thr Leu Ser Glu Thr Ser Thr Gly Gly Pro Ala Thr Pro Thr Trp 

15 10 
aag gag tgt cct ate tgt aag gag cgc ttt cct get gag agt gac aag 24 9 

Lys Glu Cys Pro lie Cys Lys Glu Arg Phe Pro Ala Glu Ser Asp Lys 

15 20 25 

gat gcc ctg gag gac cac atg gat gga cac ttc ttt ttc age acc cag 297 
Asp Ala Leu Glu Asp His Met Asp Gly His Phe Phe Phe Ser Thr Gin 

30 35 40 

gac ccc ttc acc ttt gag tgatcttact ccctcgtaca tgcacaaata 345 
Asp Pro Phe Thr Phe Glu 
45 50 

cacactcatg cacacacaca ctcacacaca tgcatacact taggtttcat gcccattttc 405 
tatcacactg ggctccatga tattctgttc cctaagaact gcttctgtgt gccctgtttt 465 
catcccaaga tttctcactt catcctct 493 



<210> 400 
<211> 769 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 506. . 661 



<220> 

<221> sig_peptide 
<222> 506. .595 
<223> Von Heijne matrix 
score 3.5 

seq VKLLLKMVS SATT / ER 



<400> 400 

cttggctctt aaagccaggt tgagggttat gtgtttttat 
cttaggcttg gcaactagtt cattgaaaca gagecaacaa 
gatatgaagc aagctgtggt catttatcta ctaccttcca 
taagtagtgt ttgtagtgta gttgtcagta acttttctca 
aattcattta caaatgtgtg attattacga tagatgttat 
ttaatacagc atggacatgt ttctaaatca ataaatatag 
gggatgtaaa gtatgtatta caacaggtaa atccatttct 
aatgttttgt ttgttttgtt getgeatata atgttgcact 
ttggtctcct aatyctacag gataa atg tct age tgt 

Met Ser Ser Cys 

-30 

ttt cag tta gat get gtg aaa ttg 
Phe Gin Leu Asp Ala Val Lys Leu 
-15 

acc aca gaa egg tgt tgt aat gga 
Thr Thr Glu Arg Cys Cys Asn Gly 

1 5 
tgt gca aca ggt att aag aat ttt 
Cys Ala Thr Gly lie Lys Asn Phe 
15 20 
atcccagcac tttgggaggc ctaggcaggt 
gcctgaccaa catggtga 



tea gtt teg 
Ser Val Ser 

-20 
tec tct gcc 
Ser Ser Ala 
-5 

aaa aac ttg 
Lys Asn Leu 

cacgcctgta 
ttccagacca 



tcttttawat 

gaatgagtga 

tacctacttc 

gcttacataa 

tttataaatt 

cttamccaca 

ctgttttggg 

atatatatat 

ggc att gtt 

Gly lie Val 

-25 
etc ttg aaa 
Leu Leu Lys 
-10 

agt gcc aat 
Ser Ala Asn 

tgaggtcggg 



ctccttttca 
gtcactcagt 
ctaggtttga 
gtttatgtgg 
tcaaggttct 
gactttttat 
aaagcaggtt 
ketaatgeag 
gga tea 
Gly Ser 

atg gtg 
Met Val 

ttt cat 
Phe His 
10 

egcagtaget 



ggatcacctg aggtcaggag 



60 
120 
180 
240 
300 
360 
420 
480 
532 



580 



628 



681 



741 
769 



<210> 401 
<211> 646 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 76. . 621 



<400> 401 

attacgagat tggcttggat tctgtcggat ggacttgggg ctagctgcgg cggggctgga 60 
ggaggccaga taacc atg tea gec aca gtt gta gat gca gtt aat get gca 111 
Met Ser Ala Thr Val Val Asp Ala Val Asn Ala Ala 
15 10 



ccc 


eta 


teg 


ggg 


tec 


aaa 


gaa 


atg 


agt 


ttg 


gaa 


gaa 


cca 


aag 


aag 


atg 


159 


Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 


Glu 


Pro 


Lys 


Lys 


Met 








15 










20 










25 










acc 


aga 


gag 


gac 


tgg 


aga 


aag 


aag 


aag 


gag 


eta 


gaa 


gaa 


cag 


cga 


aaa 


207 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Lys 






30 










35 










40 












ttg 


ggc 


aat 


get 


cct 


gca 


gaa 


gtt 


gat 


gaa 


gaa 


gga 


aaa 


gac 


ate 


aac 


255 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 


Gly 


Lys 


Asp 


He 


Asn 




45 










50 










55 










60 




ccc 


cat 


att 


cct 


cag 


tat 


att 


tct 


tea 


gtg 


cca 


tgg 


tat 


att 


gat 


cct 


303 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 












65 










70 










75 






tea 


aaa 


aga 


cct 


act 


tta 


aaa 


cac 


cag 


aga 


cca 


caa 


cca 


gaa 


aaa 


caa 


351 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 










80 










85 










90 








aag 


cag 


ttc 


age 


tea 


tct 


gga 


gaa 


tgg 


tac 


aag 


agg 


ggt 


gta 


aaa 


gag 


399 


L ys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 


Arg 


Gly 


Val 


Lys 


Glu 








95 










100 










105 










aat 


tec 


ata 


att 


act 


aag 


tac 


cgc 


aaa 


gga 


gca 


tgt 


gaa 


aat 


tgt 


ggg 


447 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 


Cys 


Glu 


Asn 


Cys 


Gly 






110 










115 










120 












gee 


atg 


aca 


cac 


aaa 


aag 


aaa 


gac 


tgc 


ttt 


gag 


aga 


cct 


agg 


cga 


gtt 


495 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Lys 


Asp 


Cys 


Phe 


Glu 


Arg 


Pro 


Arg 


Arg 


Val 




125 










130 










135 










140 




gga 


gee 


aaa 


ttt 


aca 


ggt 


act 


aat 


ata 


get 


cca 


gat 


gaa 


cat 


gtc 


cag 


543 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 


Asp 


Glu 


His 


Val 


Gin 












145 










150 










155 






cct 


caa 


ctg 


atg 


ttt 


gac 


tat 


gat 


ggg 


aag 


agg 


gat 


egg 


tgg 


aat 


ggc 


591 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 


Asp 


Arg 


Trp 


Asn 


Gly 










160 










165 










170 








tac 


aat 


cca 


gaa 


gaa 


cac 


atg 


aaa 


att 


gtt 


tgaagagtat gcaaagtgat 


641 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 


Lys 


He 


Val 

















175 180 
ttggc 646 



<210> 402 
<211> 484 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. . 469 



<400> 402 

acttggegag tgagacgctg atgggagg atg gac gta ctg gtg tct gag tgc 52 

Met Asp Val Leu Val Ser Glu Cys 
1 5 

tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

10 15 20 

gaa att gac egg ttg aaa aac tgt ggc tgt tta gga get tct cca aat 148 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 



WO 02/083898 PCT/IB01/00914 

232 



25 










30 










35 










40 




ttg 


gag 


cag 


tta 


caa 


gaa 


gaa 


aat 


tta 


aaa 


tta 


aag 


tat 


cga 


ctg 


aat 


196 


Leu 


Glu 


Gin 


Leu 


Gin 
45 


Glu 


Glu 


Asn 


Leu 


Lys 
50 


Leu 


Lys 


Tyr 


Arg 


Leu 
55 


Asn 




att 


ctt 


cga 


aag 


agt 


ctt 


cag 


gca 


gaa 


agg 


aac 


aaa 


cca 


act 


aaa 


aat 


244 


lie 


Leu 


Arg 


Lys 
60 


Ser 


Leu 


Gin 


Ala 


Glu 
65 


Arg 


Asn 


Lys 


Pro 


Thr 
70 


Lys 


Asn 




atg 


att 


aac 


att 


att 


age 


cgc 


eta 


caa 


gag 


gtc 


ttt 


ggt 


cat 


gca 


att 


292 


Met 


He 


Asn 
75 


He 


He 


Ser 


Arg 


Leu 
80 


Gin 


Glu 


Val 


Phe 


Gly 
85 


His 


Ala 


He 




aag 


get 


gca 


tat 


cca 


gat 


ttg 


gaa 


aat 


cct 


cct 


ctg 


eta 


gtg 


aca 


cca 


340 


Lys 


Ala 
90 


Ala 


Tyr 


Pro 


Asp 


Leu 
95 


Glu 


Asn 


Pro 


Pro 


Leu 
100 


Leu 


Val 


Thr 


Pro 




agt 


cag 


cag 


gec 


aag 


ttt 


ggg 


gac 


tat 


cag 


tgt 


aat 


agt 


get 


atg 


ggt 


388 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly Asp 


Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 




105 










110 










115 










120 




att 


tct 


cag 


gtg 


atg 


tat 


tgt 


cat 


gac 


tct 


tgg 


ctg 


ttt 


gat 


ttt 


ttt 


436 


He 


Ser 


Gin 


Val 


Met 
125 


Tyr 


Cys 


His 


Asp 


Ser 
130 


Trp 


Leu 


Phe 


Asp 


Phe 
135 


Phe 




aag 


tat 


tat 


tat 


cat 


cat 


tgc 


cat 


tta 


cag 


aaa 


taatactatt acaag 


484 


Lys 


Tyr 


Tyr 


Tyr 
140 


His 


His 


Cys 


His 


Leu 
145 


Gin 


Lys 















<210> 403 
<211> 345 
<212> DNA 
<213> Homo sapiens 



<400> 403 

ctttctccag 

gagcgtggat 

gcgaatgctg 

ctgagggctc 

ttccccccgt 

ctactaataa 



ctcaagcgtc 
ccagttcgcg 
cttctcctcc 
cgacgagccg 
cgccgctgcc 
atgetgaect 



tggtatttgg 
gcggggttgt 
tccttcagcg 
cccgtcccag 
gccttctcct 
gatcaaatgg 



tgtagggaca 
ttgggtcaag 
aacagectgg 
tcgcggcccc 
gcaggaacga 
ageccaaaar 



gegcttttte 
ttgctgctgc 
gatcgggtgg 
gatccctgcc 
gttgcacatg 
aaaaa 



cgggtggagc 
tgetgegget 
tegctgeaag 
ccaggcccgg 
ttactgette 



60 
120 
180 
240 
300 
345 



<210> 404 
<211> 432 
<212> DNA 
<213> Homo sapiens 



<400> 404 

accaggactc 

aactggggga 

agcgtttcaa 

catctegggg 

caagcatctc 

ccccacccca 

atcctttcat 

aaaaaaaaaa 



caaaatggcg 
gctgccaagc 
agaggttact 
attaccatgg 
aagcacgagc 
cgaccttggc 
atcctaatag 
ag 



tcagttgtac 
tggatcttga 
aceggtacta 
tgctggcatg 
ggctccgcaa 
ccgagcccct 
gaattaacct 



cagtgaagga 
tgegggaett 
caacaagtac 
ctacgtgctc 
ataccactga 
ccgtgagaac 
ccaaataaaa 



caagaaactt 
cagtctagtg 
atcaatgtga 
tttagctact 
agaggacaca 
acaatctcaa 
catgactggt 



ctggaggtca 
geattttegg 
agaaggggag 
ccttttccta 
ctctgcaccc 
tcgttgctga 
acgyraaaaa 



60 
120 
180 
240 
300 
360 
420 
432 



<210> 405 
<211> 428 
<212> DNA 
<213> Homo sapiens 



<400> 405 

actccgcctt ccacgtgcac ccactgcctc ttcccttctc gcttgggaac tctagtctcg 60 

cctcgggttg caatggaccc caactgctcc tgtgccgctg caggtgtctc ctgcacctgc 120 

gccagctcct gcaagtgcaa agagtgcaaa tgcacctcct gcaagaagag ctgctgctcc 180 

tgctgccctg tgggctgtgc caagtgtgcc cagggctgea tetgeaaagg ggcateggag 240 

aagtgcagct gctgcgcctg atgtcgggac agccctgctc ccaagtacaa atagagtgac 300 

cegtaaaate caggattttt tgttttttgc tacaatcttg acccctttgc tacattcctt 360 
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tttttctgtg aaatatgtga ataataatta aacacttaga caaaaaaaaa aaaaaaaggt 420 
caaaagct 428 

<210> 406 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196. .390 



<220> 

<221> sig_peptide 

<222> 196. .279 

<223> Von Heijne matrix 

score 5.90000009536743 
seq IILLLLKIWKSRS/CA 



<400> 406 

ctcttcccgg ctccagctcc gccgccagct ccagcctttg ctccccctcc caaagtcccc 60 
tccccggagc ggagcgcacc tagggtccct cttccgtccc cccagcccag ctacccgttc 120 
agaccagcag cctcgggggg cacccccccg ccagcctgcc tccctcccgc tcagccctgc 180 
cagggttccc cagcc atg aat etc ttc cga ttc ctg gga gac etc tec cac 231 
Met Asn Leu Phe Arg Phe Leu Gly Asp Leu Ser His 
-25 ~ -20 



etc 


etc 


gec 


ate 


ate 


ttg 


eta 


ctg 


etc 


aaa 


ate 


tgg 


aag 


tec 


cgc 


teg 


279 


Leu 


Leu 
-15 


Ala 


lie 


He 


Leu 


Leu 
-10 


Leu 


Leu 


Lys 


He 


Trp 
-5 


Lys 


Ser 


Arg 


Ser 




tgc 


gee 


gec 


cac 


ccc 


cag 


ctt 


cct 


etc 


tec 


ttc 


tgt 


ctg 


tct 


gtc 


tgt 


327 


Cys 


Ala 


Ala 


His 


Pro 


Gin 


Leu 


Pro 


Leu 


Ser 


Phe 


Cys 


Leu 


Ser 


Val 


Cys 




1 








5 










10 










15 






ctg 


tct 


gtc 


tct 


etc 


tct 


etc 


tct 


gtc 


tct 


etc 


tct 


etc 


tct 


ttc 


tct 


375 


Leu 


Ser 


Val 


Ser 
20 


Leu 


Ser 


Leu 


Ser 


Val 
25 


Ser 


Leu 


Ser 


Leu 


Ser 
30 


Phe 


Ser 




gtc 


tea 


aaa 


aaa 


aaa 


t 






















3 91 


Val 


Ser 


Lys 
35 


Lys 


Lys 



























<210> 407 
<211> 500 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 81 . . 500 



<220> 

<221> sig_peptide 

<222> 81. .200 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LLMRALHLLKSGC/SP 



<400> 407 

aaccccaccg cctaactctc ccaactctgc agtccgtgac cttcaggtgc ccttcaaggt 60 
cagaaaaact eggcaagaat atg gtt tct agt ttt agg gtt tct gaa eta caa 113 

Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin 
-40 -35 -30 

gta tta eta ggc ttt get gga egg aat aaa agt gga cgc aag cat gac 161 
Val Leu Leu Gly Phe Ala Gly Arg Asn Lys Ser Gly Arg Lys His Asp 
-25 -20 -15 
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etc ctg atg agg gcg ctg cat tta ttg aag age ggc tgc age cct gcg 209 
Leu Leu Met Arg Ala Leu His Leu Leu Lys Ser Gly Cys Ser Pro Ala 

-10 -5 1 

gtt cag att aaa ate cga gaa ttg tat aga cgc cga tat cca cga act 257 
Val Gin lie Lys lie Arg Glu Leu Tyr Arg Arg Arg Tyr Pro Arg Thr 

5 10 15 

ctt gaa gga ctt tct gat tta tec aca ate aaa tea teg gtt ttc agt 305 
Leu Glu Gly Leu Ser Asp Leu Ser Thr lie Lys Ser Ser Val Phe Ser 
20 25 30 35 

ttg gat ggt ggc tea tea cct gta gaa cct gac ttg gee gtg get gga 353 
Leu Asp Gly Gly Ser Ser Pro Val Glu Pro Asp Leu Ala Val Ala Gly 

40 45 50 

ate cac teg ttg cct tec act tea gtt aca cct cac tea cca tec tct 401 
lie His Ser Leu Pro Ser Thr Ser Val Thr Pro His Ser Pro Ser Ser 

55 60 65 

cct gtt ggt tct gtg ctg ctt caa gat act aag ccc aca ttt gag atg 44 9 

Pro Val Gly Ser Val Leu Leu Gin Asp Thr Lys Pro Thr Phe Glu Met 

70 75 80 

cag cag cca tct ccc cca att cct cct gtc cat cct gat gtg cag tta 4 97 

Gin Gin Pro Ser Pro Pro lie Pro Pro Val His Pro Asp Val Gin Leu 

85 90 95 

aaa 500 
Lys 
100 

<210> 408 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. .496 

<400> 408 

tcctaatcga aawctcttct gaggatcegg caag atg gca gaa gta gag cag aag 55 

Met Ala Glu Val Glu Gin Lys 
1 5 

aag aag egg acc ttc cgc aag ttc ace tac cgc ggc gtg gac etc gac 103 

Lys Lys Arg Thr Phe Arg Lys Phe Thr Tyr Arg Gly Val Asp Leu Asp 

10 " 15 20 

cag ctg ctg gac atg tec tac gag cag ctg atg cag ctg tac agt gcg 151 

Gin Leu Leu Asp Met Ser Tyr Glu Gin Leu Met Gin Leu Tyr Ser Ala 

25 30 35 

cgc cag gcg gcg get gaa ccg ggg cct gcg gcg gaa gca gca etc cct 199 

Arg Gin Ala Ala Ala Glu Pro Gly Pro Ala Ala Glu Ala Ala Leu Pro 
40 45 50 55 

get gaa gcg cct gcg caa ggc caa gaa gga ggc gee gee cat gga gaa 247 

Ala Glu Ala Pro Ala Gin Gly Gin Glu Gly Gly Ala Ala His Gly Glu 

60 65 70 

gee gga agt ggt gaa gac gca cct gcg gga cat gat cat cct acc cga 295 

Ala Gly Ser Gly Glu Asp Ala Pro Ala Gly His Asp His Pro Thr Arg 

7 5 8 0 8 5 

gat ggt ggg cag cat ggt ggg cgt eta caa egg caa gac ctt caa cca 34 3 

Asp Gly Gly Gin His Gly Gly Arg Leu Gin Arg Gin Asp Leu Gin Pro 

90 95 100 

ggt gga gat caa gee cga gat gat egg cca eta cct ggg cga gtt etc 391 

Gly Gly Asp Gin Ala Arg Asp Asp Arg Pro Leu Pro Gly Arg Val Leu 

105 110 115 

cat cac eta caa gee cgt aaa gca tgg ccg gee egg cat egg ggc mmc 439 

His His Leu Gin Ala Arg Lys Ala Trp Pro Ala Arg His Arg Gly Xaa 
120 125 130 135 

cma mtc etc csg ctt cat ccm tct caa gta atg get cag eta ata aag 487 
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Xaa Xaa Leu Xaa Leu His Pro Ser Gin Val Met Ala Gin Leu lie Lys 

140 145 150 

gcg cac atg a 497 
Ala His Met 



<210> 409 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .304 



<220> 

<221> misc_f eature 
<222> 296. .297 
<223> n=a, g, c or t 



<400> 409 

acgtacggca gggcggggtc ggcccgcagt ggccagctgg gagctacgag cgcggasttg 60 
cgcagaagac ccccatcagg gtgcggggtg cagttgcggc tccagggcc atg gcg gag 118 

Met Ala Glu 
1 

gag cag ggc egg gaa egg gac teg gtt ccc aag ccg teg gtg ctg ttc 166 
Glu Gin Gly Arg Glu Arg Asp Ser Val Pro Lys Pro Ser Val Leu Phe 

5 ~ " " 10 15 

etc cac cca gac ctg ggc gtg ggc ggc get gag egg ctg gtg etc ttg 214 
Leu His Pro Asp Leu Gly Val Gly Gly Ala Glu Arg Leu Val Leu Leu 
20 25 30 35 

cct ttt ccc act gag aga agg ctg etc ttt tgt act gee ccc cgc tea 262 
Pro Phe Pro Thr Glu Arg Arg Leu Leu Phe Cys Thr Ala Pro Arg Ser 

40 45 50 

tta aac age etc ccc eta aaa aaa aaa aaa att nna aaa age 304 
Leu Asn Ser Leu Pro Leu Lys Lys Lys Lys lie Xaa Lys Ser 
55 ~ 60 ~ 65 



<210> 410 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 155. .553 



<40.0> 410 

aatagtgagc ggcagacaga ggggtggcca gtttgttgtt ttacctaaaa ggtgttttcc 60 
acatggagtg ggactgagag gcacaggtgg ctgaggggac ccgcctggga cgtgaggcgc 120 
aggtcctegg ccccttcatc tgtcagctgc agca atg gaa tac gtg ctg gaa gtg 175 

Met Glu Tyr Val Leu Glu Val 



aag 


aac 


tct 


ccg 


egg 


cac 


etc 


ctg 


aag 


1 

caa 


ttc 


aca 


gtg 


5 

tgt 


gac 


gtt 


223 


Lys 


Asn 


Ser 


Pro 


Arg 


His 


Leu 


Leu 


Lys 


Gin 


Phe 


Thr 


Val 


Cys 


Asp 


Val 








10 










15 










20 










cct 


ctg 


tat 


gac 


att 


tgt 


gac 


tac 


aac 


gtc 


tec 


aga 


gac 


cga 


tgc 


cag 


271 


Pro 


Leu 


Tyr 


Asp 


He 


Cys 


Asp 


Tyr 


Asn 


Val 


Ser 


Arg 


Asp 


Arg 


Cys 


Gin 






25 










30 










35 












gag 


etc 


ggg 


tgc 


tgc 


ttc 


tac 


gaa 


ggc 


gtc 


tgc 


tac 


aag 


aaa 


gcg 


gtt 


319 


Glu 


Leu 


Gly 


Cys 


Cys 


Phe 


Tyr 


Glu 


Gly 


Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 




40 










45 










50 










55 




ccc 


att 


tac 


ate 


cac 


gtg 


ttc 


tct 


gee 


ttg 


att 


gtg 


ate 


ata 


get 


ggg 


367 


Pro 


He 


Tyr 


He 


His 


Val 


Phe 


Ser 


Ala 


Leu 


He 


Val 


He 


He 


Ala 


Gly 
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60 










65 










70 






gcc 


ttc 


gtc 


ate 


acc 


ate 


ate 


tac 


aga 


gtc 


att 


cag 


gag 


age 


agg 


aaa 


415 


Ala 


Phe 


Val 


He 


Thr 


He 


He 


Tyr 


Arg 


Val 


He 


Gin 


Glu 


Ser 


Arg 


Lys 










75 










80 










85 








gaa 


aag 


gcc 


ate 


cct 


gtg 


tat 


gtc 


gcg 


ctg 


cca 


cag 


aag 


tec 


age 


gaa 


463 


Glu 


Lys 


Ala 


He 


Pro 


Val 


Tyr 


Val 


Ala 


Leu 


Pro 


Gin 


Lys 


Ser 


Ser 


Glu 








90 










95 










100 










aag 


gcg 


gag 


ttg 


gcc 


tea 


tec 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


ctg 


cga 


511 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Lys 


Leu 


Gly 


Leu 


Lys 


Leu 


Arg 






105 










110 










115 












gtc 


ctg 


ggc 


etc 


aaa 


gtg 


ctg 


ggc 


etc 


gat 


gag 


agt 


gac 


gag 






553 


Val 


Leu 


Gly 


Leu 


Lys 


Val 


Leu 


Gly 


Leu 


Asp 


Glu 


Ser 


Asp 


Glu 









120 125 130 

<210> 411 
<211> 470 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. . 469 

<400> 411 

gggttcccgt tccccgcgga sc atg egg tac aac gag aag gag ctg cag get 52 

Met Arg Tyr Asn Glu Lys Glu Leu Gin Ala 
15 10 



ctg 


tec 


egg 


cag 


ccg 


gcc 


gag 


atg 


gcg 


gcc 


gag 


ctg 


ggc 


atg 


agg 


ggc 


100 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 












15 










20 










25 






ccc 


aag 


aag 


ggc 


age 


gtg 


ctg 


aag 


egg 


egg 


ctg 


gtg 


aag 


ctg 


gtg 


gtg 


148 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 


Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 










30 










35 










40 








aat 


ttc 


etc 


ttc 


tac 


ttt 


egg 


aca 


gac 


gag 


gcc 


gag 


ccc 


gtc 


gga 


gcc 


196 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 


Arg 


Thr 


Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 








45 










50 










55 










ctg 


ctg 


ctg 


gag 


cgc 


tgc 


aga 


gtc 


gtc 


egg 


gaa 


gag 


ccc 


ggc 


acc 


ttc 


244 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 


Arg 


Val 


Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 






60 










65 










70 












tec 


ate 


age 


ttc 


att 


gag 


gac 


cct 


gag 


agg 


aag 


tat 


cac 


ttt 


gag 


tgc 


292 


Ser 


He 


Ser 


Phe 


He 


Glu 


Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 




75 










80 










85 










90 




age 


age 


gag 


gag 


cag 


tgt 


cag 


gag 


tgg 


atg 


gag 


get 


ctg 


cgt 


egg 


gcc 


340 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 


Gin 


Glu 


Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 


Ala 












95 










100 










105 






age 


tac 


gag 


ttc 


atg 


egg 


aga 


age 


etc 


ate 


ttc 


tac 


agg 


aac 


gaa 


ate 


388 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 


Arg 


Ser 


Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 


He 










110 










115 










120 








egg 


aag 


gtg 


acg 


ggc 


aag 


gac 


ccc 


ctg 


gaa 


cag 


ttc 


ggc 


ata 


tec 


gag 


436 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 


Asp 


Pro 


Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 








125 










130 










135 










gag 


gcc 


agg 


ttc 


cag 


ctg 


agt 


ggc 


ttg 


cag 


gcg 


t 










470 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 


Ser 


Gly 


Leu 


Gin 


Ala 















140 145 



<210> 412 
<211> 486 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 211. .486 
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<400> 412 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 60 
cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 120 
atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 180 
taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 234 

Met Pro Val Val Pro Ala Leu Gly 

1 5 



agg 


cca 


agg 


tgg 


gca 


gat 


cac 


ctg 


agg 


teg 


gga 


gtt 


egg 


gac 


cag 


cct 


282 


Arg 


Pro 


Arg 


Trp 


Ala 


Asp 


His 


Leu 


Arg 


Ser 


Gly 


Val 


Arg 


Asp 


Gin 


Pro 






10 










15 










20 












ggc 


caa 


cct 


ggt 


gaa 


gcc 


ccc 


cca 


tct 


eta 


eta 


aaa 


ata 


caa 


aaa 


ttg 


330 


Gly 


Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 


Ser 


Leu 


Leu 


Lys 


He 


Gin 


Lys 


Leu 




25 










30 










35 










40 




gcc 


ggg 


tat 


ggt 


ggt 


ggc 


tgc 


ctg 


tgg 


tec 


cag 


eta 


etc 


ggg 


agg 


ctg 


378 


Ala 


Gly 


Tyr 


Gly 


Gly 


Gly 


Cys 


Leu 


Trp 


Ser 


Gin 


Leu 


Leu 


Gly 


Arg 


Leu 












45 










50 










55 






agg 


egg 


gag 


aat 


cac 


ttg 


age 


ccg 


gga 


ggc 


gga 


ggt 


tgc 


agt 


gag 


ccg 


426 


Arg 


Arg 


Glu 


Asn 


His 


Leu 


Ser 


Pro 


Gly 


Gly Gly Gly 


Cys 


Ser 


Glu 


Pro 










60 










65 










70 








aga 


ttg 


tgc 


cac 


tgc 


act 


cca 


gcc 


tgg 


gtg 


aca 


gag 


caa 


gac 


tec 


ate 


474 


Arg 


Leu 


Cys 


His 


Cys 


Thr 


Pro 


Ala 


Trp 


Val 


Thr 


Glu 


Gin 


Asp 


Ser 


He 





75 80 85 



tea aaa ata gaa 486 
Ser Lys He Glu 
90 



<210> 413 
<211> 335 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 73. .252 



<220> 

<221> sig_peptide 

<222> 73. .135 

<223> Von Heijne matrix 

score 4.30000019073486 
seq AGVSCTCAXPASA/KS 



<400> 413 

actccgcctt ccacgtgcac ccactgcctc ttcccttctc gcttgggaac tctagtctcg 60 
cctcgggttg ca atg gac ccc aac tgc tec tgt gcc get ggt gtc tec tgc 111 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
-20 -15 -10 

acc tgc gcc ast cct gca agt gca aag agt gca aat gca cct cct gca 159 
Thr Cys Ala Xaa Pro Ala Ser Ala Lys Ser Ala Asn Ala Pro Pro Ala 

-5 15 
aga aga get get get cct ggt gca ctg gcc cca ccc teg tgg aca cct 207 
Arg Arg Ala Ala Ala Pro Gly Ala Leu Ala Pro Pro Ser Trp Thr Pro 

10 15 20 

gcc ctg ccc tgc cac ctg tct gtc tgt ccc aaa gaa gtt ctg gta 252 
Ala Leu Pro Cys His Leu Ser Val Cys Pro Lys Glu Val Leu Val 
25 30 35 

tgaacttgag gacacatgtc cagtgggagg tgagaccacc tctcaatatt caataaagct 312 
gctgagaatc tagectcgaa aaa 335 



<210> 414 
<211> 494 
<212> DNA 



WO 02/083898 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. . 422 

<220> 

<221> sig_peptide 
<222> 63. .167 
<223> Von Heijne matrix 

score 4.09999990463257 

seq LSQSLAMSPQTHS/QT 

<400> 414 

cacggatctc acaggtggag gtggggcgag cttctttcta gtgtggacca gaggtggggg 60 
ca atg agt aaa cat ctt cac tgt aag gtt ttg gga ctg ccc etc agg 107 
Met Ser Lys His Leu His Cys Lys Val Leu Gly Leu Pro Leu Arg 





■35 










-30 










-25 












ctt 


gag 


gtc 


teg 


gca 


teg 


tgt 


etc 


agt 


cag 


tec 


ttg 


gcg 


atg 


tec 


cct 


155 


Leu 


Glu 


Val 


Ser 


Ala 


Ser 


Cys 


Leu 


Ser 


Gin 


Ser 


Leu 


Ala 


Met 


Ser 


Pro 




-20 










-15 










-10 










-5 




cag 


acg 


cac 


tea 


cag 


act 


tgc 


ata 


cgt 


aat 


tta 


gta 


aca 


tgc 


ate 


aac 


203 


Gin 


Thr 


His 


Ser 


Gin 
1 


Thr 


Cys 


He 


Arg 
5 


Asn 


Leu 


Val 


Thr 


Cys 
10 


He 


Asn 




tat 


cct 


aga 


aca 


tct 


aca 


ggg 


tgc 


aaa 


gga 


acc 


acc 


act 


cag 


aga 


ate 


251 


Tyr 


Pro 


Arg 
15 


Thr 


Ser 


Thr 


Gly 


Cys 
20 


Lys 


Gly 


Thr 


Thr 


Thr 
25 


Gin 


Arg 


He 




atg 


gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 


gaa 


ggg 


aca 


gaa 


caa 


gac 


aat 


get 


299 


Met 


Glu 
30 


Pro 


Val 


Glu 


Leu 


Glu 

35 


Val 


Glu 


Gly 


Thr 


Glu 
40 


Gin 


Asp 


Asn 


Ala 




aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 


acc 


atg 


etc 


cga 


347 


Lys 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly 


Arg 


Gly 


Asn 


Glu 


Asn 


Thr 


Met 


Leu 


Arg 




45 










50 










55 










60 




ggt 


gga 


ttc 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


tec 


aag 


caa 


395 


Gly 


Gly 


Phe 


Ser 


Met 
65 


Asn 


Thr 


Thr 


Val 


Gly 
70 


Gin 


Gly 


He 


Ser 


Lys 
75 


Gin 




aca 


cac 


cac 


act 


agt 


acc 


act 


tct 


tec 


taaacagaaa agggaaagag 




442 


Thr 


His 


His 


Thr 
80 


Ser 


Thr 


Thr 


Ser 


Ser 
85 



















catccttcga cttccccctc cctccatcag caccgggttt tcctccaaaa aa 494 



<210> 415 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .306 

<220> 

<221> sig_peptide 

<222> 112. . 153 

<223> Von Heijne matrix 

score 3.70000004768372 

seq LCWLKLVSITTVA/II 

<400> 415 

cctttctcgt tccccggcca tettategge tgctgttggt tgggggccgt cccgctccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtac t atg eta 117 

Met Leu 

tgt tgg eta aaa ctg gtg tec ate act aca gtg gca ata ata ttg aac 165 
Cys Trp Leu Lys Leu Val Ser He Thr Thr Val Ala He He Leu Asn 
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-10 -5 1 

tgg gca cag cat gcg gaa aat act aca gag tgt gca cac tgg eta tea 213 

Trp Ala Gin His Ala Glu Asn Thr Thr Glu Cys Ala His Trp Leu Ser 

5 10 15 20 

ttg ate cag gtg act ctg aca tea tta gaa gca tgc cag aac aga ctg 261 

Leu lie Gin Val Thr Leu Thr Ser Leu Glu Ala Cys Gin Asn Arg Leu 

25 30 35 

gtg aaa agt aaa cct ttt cac eta caa aat ttc ace tgc aaa cct 306 

Val Lys Ser Lys Pro Phe His Leu Gin Asn Phe Thr Cys Lys Pro 

40 45 50 

taaacctgea aaattttcct ttaataaaat ttgcttgttc taaaaaaaaa aaaaaaaaag 366 



<210> 416 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .411 



<400> 416 

aagaaagggg tgaggcctaa gggacaatca ggatgttttt cagagagaag tgtggatgct 60 
ggacaggaag aaccacagat accagatacg ggtactgttg taactctgtt ctcc atg 117 

Met 
1 

aaa aaa aag gaa gaa aca aca ctt tea gag atg gag cct gtt gag cca 165 
Lys Lys Lys Glu Glu Thr Thr Leu Ser Glu Met Glu Pro Val Glu Pro 

5 10 15 

cag tac caa eta gtc aat get gaa teg act tct ccc ttt eta cat tgc 213 
Gin Tyr Gin Leu Val Asn Ala Glu Ser Thr Ser Pro Phe Leu His Cys 

20 25 30 

ctg aga gaa gtc att ggg gaa tac tct gta cac gaa ttt tea ctg ttg 261 
Leu Arg Glu Val lie Gly Glu Tyr Ser Val His Glu Phe Ser Leu Leu 

35 40 45 

ggg aaa aca gag agt caa ggg att gga ttg tgg att gca ttg gtg gtt 309 
Gly Lys Thr Glu Ser Gin Gly lie Gly Leu Trp lie Ala Leu Val Val 
50 55 60 65 

ttc etc agt ttc etc ate ttc tec aca agt ttc tac ata teg aat gca 357 
Phe Leu Ser Phe Leu lie Phe Ser Thr Ser Phe Tyr lie Ser Asn Ala 

70 75 80 

gag cag ccc ttc ttc aaa gaa cct cct acg gaa get get aag gaa etc 405 
Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu Leu 

85 90 95 

agt ctg tagctctgeg tggagccatg tgtaaacact gaactgagac ctgccacctc 461 
Ser Leu 

ctactaccta a 472 



<210> 417 
<211> 532 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 125. .508 



<400> 417 

agctrcctgg gtaaggecca agatggctgt cttcgcctta gtactcgtgt gaagttggcg 60 
gggacggttc ctgtcatctt cttgggctta tttggtgtgc tgttgaaggg gggagactag 120 
agaa atg gca ggg aac etc tta tec ggg gca ggt agg cgc ctg tgg gac 169 
Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 
15 10 15 
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tgg 


gtg 


cct 


ctg 


gcg 


tgc 


aga 


age 


ttc 


tct 


ctt 


ggt 


gtg 


cct 


aga 


ttg 


217 


Trp 


Val 


Pro 


Leu 


Ala 
20 


Cys 


Arg 


Ser 


Phe 


Ser 
25 


Leu 


Gly 


Val 


Pro 


Arg 
30 


Leu 




ate 


ggt 


ata 


agg 


etc 


act 


etc 


ccg 


ccc 


ccc 


aaa 


gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 
35 


Leu 


Thr 


Leu 


Pro 


Pro 
40 


Pro 


Lys 


Val 


Val 


Asp 
45 


Arg 


Trp 




aac 


gag 


aaa 


agg 


gee 


atg 


ttc 


gga 


gtg 


tat 


gac 


aac 


ate 


ggg 


ate 


ctg 


313 


Asn 


Glu 


Lys 
50 


Arg 


Ala 


Met 


Phe 


Gly 
55 


Val 


Tyr 


Asp 


Asn 


He 
60 


Gly 


He 


Leu 




gga 


aac 


ttt 


gaa 


aag 


cac 


mcc 


aaa 


gaa 


ctg 


ate 


agg 


ggg 


ccc 


ata 


tgg 


361 


Gly Asn 


Phe 


Glu 


Lys 


His 


Xaa 


Lys 


Glu 


Leu 


He 


Arg 


Gly 


Pro 


He 


Trp 






65 










70 










75 












ctt 


cga 


ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 


caa 


cgt 


tgt 


ate 


cga 


aag 


agg 


409 


Leu 


Arg 


Gly 


Trp 


Lys 


Gly 


Asn 


Glu 


Leu 


Gin 


Arg 


Cys 


He 


Arg 


Lys 


Arg 




80 










85 










90 










95 




aaa 


atg 


gtt 


gga 


agt 


aga 


atg 


ttc 


get 


gat 


gac 


ctg 


cac 


aac 


ctt 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp 
105 


Asp 


Leu 


His 


Asn 


Leu 
110 


Asn 




aaa 


cgc 


ate 


cgc 


tat 


etc 


tac 


aaa 


cac 


ttt 


aac 


cga 


cat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


He 


Arg 
115 


Tyr 


Leu 


Tyr 


Lys 


His 
120 


Phe 


Asn 


Arg 


His 


Gly 
125 


Lys 


Phe 





cga tagaagagaa agctgagaac ttcg 532 
Arg 



<210> 418 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .205 



<400> 418 

actctttcct tggctgetea ggtcataggt gctgtgatct agagacaaa atg ttc ctt 58 

Met Phe Leu 
1 

aca ctg gca gat act tgc aaa eta agg gga atg age ttc ctt ctg aat 106 
Thr Leu Ala Asp Thr Cys Lys Leu Arg Gly Met Ser Phe Leu Leu Asn 

5 * 10 15 

gtt tat gaa gga gag gee act gtg tea tct gtc tta gag eta ttg gaa 154 
Val Tyr Glu Gly Glu Ala Thr Val Ser Ser Val Leu Glu Leu Leu Glu 
20 25 30 35 

tec tgg ate att gtg gga aat gaa aga tac ttc gat gga ate age age 202 
Ser Trp He He Val Gly Asn Glu Arg Tyr Phe Asp Gly He Ser Ser 

4 0 4 5 50 

cat tgatccaatg ccaactccaa gactggaacg tegcaatgat agttccaagg 255 
His 

eggaaatttg aegtaattet tttcgacaca gttttacagg tetggatgee cattttaatt 315 
cttctgaaag catgcctyct ccttctggct tcaggactcc atctscagcc tcttgatcta 375 
aaaataatcc ccaaaccaaa aaaaaattag atactatttc ctcaaaatta ggtattttaa 435 
tcaaaaacat cttaacataa atacattatt atcac 470 



<210> 419 
<211> 559 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 175. . 426 



<400> 419 



WO 02/083898 

actcggcggg cgctgttgag 
cgcggacgcc ggcgctgcca 



ggagtcgggc 
acggaagggc 



ggccaggccc atttgagatc tttgaagata 



241 

cgcgactgtg 
gggtagggcg 
tcctcaacgt 
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gtcgttttta taccttcccg 60 

agacggagtt tcgtcatgtt 120 

gaggctctgc tgcc atg 177 
Met 



aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


225 


Lys 


Val 


Lys 


He 
5 


Lys 


Cys 


Trp 


Asn 


Gly 
10 


Val 


Ala 


Thr 


Trp 


Leu 
15 


Trp 


Val 




gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


273 


Ala 


Asn 


Asp 
20 


Glu 


Asn 


Cys 


Gly 


He 
25 


Cys 


Arg 


Met 


Ala 


Phe 
30 


Asn 


Gly 


Cys 




tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


321 


Cys 


Pro 
35 


Asp 


Cys 


Lys 


Val 


Pro 
40 


Gly 


Asp 


Asp 


Cys 


Pro 
45 


Leu 


Val 


Trp 


Gly 




cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


369 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


417 


Ala 


Gin 


Gin 


Val 


Gin 
70 


Gin 


His 


Cys 


Pro 


Met 
75 


Cys 


Arg 


Gin 


Glu 


Trp 

80 


Lys 





ttc aag gag tgaggecega cctggctctc gctggagggg catcctgaga 4 66 
Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 526 

tggawacaag ggctggagct gcgtttgttt tgc 559 



<210> 420 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. .268 



<400> 420 

atccagcagc cccctgctcc ggcccagc atg gcg ace ccg ace cag acc ccc 52 

Met Ala Thr Pro Thr Gin Thr Pro 
1 5 



aca 


aag 


get 


cct 


gag 


gaa 


cct 


gac 


cca 


ttt 


tac 


tat 


gac 


tac 


aac 


acg 


100 


Thr 


Lys 
10 


Ala 


Pro 


Glu 


Glu 


Pro 
15 


Asp 


Pro 


Phe 


Tyr 


Tyr 
20 


Asp 


Tyr 


Asn 


Thr 




gtg 


cag 


act 


gtg 


ggc 


atg 


act 


ctg 


gca 


acc 


ate 


ttg 


ttc 


ctg 


ctg 


ggt 


148 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr 


Leu 


Ala 


Thr 


He 


Leu 


Phe 


Leu 


Leu 


Gly 




25 










30 










35 










40 




ate 


etc 


ate 


gtc 


ate 


age 


aag 


aag 


gtg 


aag 


tgc 


agg 


aag 


gcg 


gac 


tec 


196 


He 


Leu 


He 


Val 


He 
45 


Ser 


Lys 


Lys 


Val 


Lys 
50 


Cys 


Arg 


Lys 


Ala 


Asp 
55 


Ser 




agg 


tct 


gag 


age 


cca 


acc 


tgc 


aaa 


tec 


tgt 


aag 


tct 


gag 


ctt 


ccc 


tct 


244 


Arg 


Ser 


Glu 


Ser 
60 


Pro 


Thr 


Cys 


Lys 


Ser 
65 


Cys 


Lys 


Ser 


Glu 


Leu 
70 


Pro 


Ser 




tea 


gcc 


cct 


ggt 


ggc 


ggc 


ggc 


gtg 


taacaccttc < 


ccgaggaaac tccgctgccg 


298 


Ser 


Ala 


Pro 
75 


Gly 


Gly 


Gly 


Gly 


Val 
80 





















accctgcctg agegegggag cctgaggacc gggtggaggc ggtggggacc cagccgcgcg 358 
ccgggagcgc tccccggaat gagcgcccca cccaccccaa ggctggagcc gctgcaccct 418 
gctgtccctc tccaggcctt ggcaatgacg atcccccaaa gage 4 62 



<210> 421 
<211> 547 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 99. .479 



<400> 421 

tttcgaaggg aagggagccc cctataaaac agcctacagt ggacagtctg gtcggcagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccacc atg agt ttc tec ggc aag 116 

Met Ser Phe Ser Gly Lys 
1 5 



tac 


caa 


ctg 


cag 


age 


cag 


gaa 


aac 


ttt 


gaa 


gee 


ttc 


atg 


aag 


gca 


ate 


164 


Tyr 


Gin 


Leu 


Gin 


Ser 


Gin 


Glu 


Asn 


Phe 


Glu 


Ala 


Phe 


Met 


Lys 


Ala 


He 










10 










15 










20 








ggt 


ctg 


ccg 


gaa 


gag 


etc 


ate 


cag 


aag 


ggg 


aag 


gat 


ate 


aag 


ggg 


gtg 


212 


Gly 


Leu 


Pro 


Glu 


Glu 


Leu 


He 


Gin 


Lys 


Gly 


Lys 


Asp 


He 


Lys 


Gly 


Val 








25 










30 










35 










teg 


gaa 


ate 


gtg 


cag 


aat 


ggg 


aag 


cac 


ttc 


aag 


ttc 


acc 


ate 


acc 


get 


260 


Ser 


Glu 


He 


Val 


Gin 


Asn 


Gly 


Lys 


His 


Phe 


Lys 


Phe 


Thr 


He 


Thr 


Ala 






40 










45 










50 












ggg 


tec 


aaa 


gtg 


ate 


caa 


aac 


gaa 


ttc 


acg 


gtg 


ggg 


gag 


gaa 


tgt 


gag 


308 


Gly 


Ser 


Lys 


Val 


He 


Gin 


Asn 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 




55 










60 










65 










70 




ctg 


gag 


aca 


atg 


aca 


ggg 


gag 


aaa 


gtc 


aag 


aca 


gtg 


gtt 


cag 


ttg 


gaa 


356 


Leu 


Glu 


Thr 


Met 


Thr 


Gly 


Glu 


Lys 


Val 


Lys 


Thr 


Val 


Val 


Gin 


Leu 


Glu 












75 










80 










85 






ggt 


gac 


aat 


aaa 


ctg 


gtg 


aca 


act 


ttc 


aaa 


aac 


ate 


aag 


tct 


gtg 


acc 


404 


Gly 


Asp 


Asn 


Lys 


Leu 


Val 


Thr 


Thr 


Phe 


Lys 


Asn 


He 


Lys 


Ser 


Val 


Thr 










90 










95 










100 








gaa 


etc 


aac 


ggc 


gac 


ata 


ate 


ace 


aat 


acc 


atg 


aca 


ttg 


ggt 


gac 


att 


452 


Glu 


Leu 


Asn 


Gly 


Asp 


He 


He 


Thr 


Asn 


Thr 


Met 


Thr 


Leu 


Gly 


Asp 


He 








105 










110 










115 










gtc 


ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaacaagtc tgcatttcat 




499 


Val 


Phe 


Lys 


Arg 


He 


Ser 


Lys 


Arg 


He 



















120 125 
attattttag tgtgtaaaat taatgtaata aagtgaactt tgttttaa 547 



<210> 422 
<211> 669 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. .259 



<400> 422 

gaegcakage cgtggggaag ttttcgea atg gee gee gga acg teg ccg gee 52 

Met Ala Ala Gly Thr Ser Pro Ala 
1 5 



gat 


get 


etc 


tgc 


gac 


gga 


aag 


aaa 


gee 


gee 


gga 


ccc 


act 


tec 


ggt 


gcg 


100 


Asp 


Ala 


Leu 


Cys 


Asp 


Gly 


Lys 


Lys 


Ala 


Ala 


Gly 


Pro 


Thr 


Ser 


Gly 


Ala 






10 










15 










20 












ttt 


tac 


aat 


gaa 


etc 


egg 


gca 


etc 


gag 


teg 


etc 


ccc 


gga 


aac 


tat 


agt 


148 


Phe 


Tyr 


Asn 


Glu 


Leu 


Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Pro 


Gly 


Asn 


Tyr 


Ser 




25 










30 










35 










40 




tct 


get 


tec 


acc 


gtt 


gtc 


eta 


ctt 


tgg 


ata 


gca 


tea 


aga 


caa 


aaa 


tct 


196 


Ser 


Ala 


Ser 


Thr 


Val 


Val 


Leu 


Leu 


Trp 


He 


Ala 


Ser 


Arg 


Gin 


Lys 


Ser 












45 










50 










55 






cag 


tct 


teg 


agg 


aac 


tea 


cag 


tgt 


age 


aga 


aaa 


gac 


aaa 


gga 


ggt 


aaa 


244 


Gin 


Ser 


Ser 


Arg 


Asn 


Ser 


Gin 


Cys 


Ser 


Arg 


Lys 


Asp 


Lys 


Gly 


Gly 


Lys 





60 65 70 



cag gag gag cat gaa taattcatct tgatgtaatc atgeaagact cctaagaaga 299 
Gin Glu Glu His Glu 
75 

tgttaggaga ccctgtcatc attcttttgg cctctaatgt ccgttatctt catgtttgct 359 



WO 02/083898 



PCT/IB01/00914 



ggcttccttt 
aaacatcgtc 
acaaggtgca 
ccagtccagg 
acaagttcct 
tctttggcaa 



tactactatg 
agcagcatca 
cttgcctgtc 
agcattgcag 
tcaagtgagt 



gaaatacaac 
tcatcattaa 
atccttccat 
agaatgctag 
tcatcatgat 



243 

atatgcaaaa 
ctgcaccagg 
ctctttcttc 
gtgtgtctgc 
ggggatgata 



gtagtgtttt 
aagccatgac 
ctccttgaag 
agacttgact 
tttgtttgag 



cgctgtttaa 
tcctttctgg 
ttcacaatac 
ttaaaagaaa 
agaatgtgtg 



419 
479 
539 
599 
659 
669 



<210> 423 
<211> 546 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 78. .302 



<400> 423 

ctygccagct gccggtcttt cgggggctcc gtaactttct atccgtccgc gtcascyttg 60 
ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act aac gtg 110 

Met Pro Gly Pro Thr Pro Ser Gly Thr Asn Val 













1 








5 








10 






gga 


tec 


tea 


ggg 


cgc 


tct 


ccc 


age 


aaa 


gca 


gtg 


gee 


gee 


egg 


gcg 


egg 


158 


Gly 


Ser 


Ser 


Gly 


Arg 


Ser 


Pro 


Ser 


Lys 


Ala 


Val 


Ala 


Ala 


Arg 


Ala 


Arg 










15 










20 










25 








gat 


cca 


ctg 


tec 


ggc 


aga 


gga 


aaa 


atg 


cca 


get 


gtg 


gga 


caa 


gga 


gtg 


206 


Asp 


Pro 


Leu 


Ser 


Gly 


Arg 


Gly 


Lys 


Met 


Pro 


Ala 


Val 


Gly 


Gin 


Gly 


Val 








30 










35 










40 










cag 


gec 


gca 


caa 


cct 


egg 


cag 


gca 


ccg 


ggg 


gga 


tgt 


ggc 


gat 


tct 


aca 


254 


Gin 


Ala 


Ala 


Gin 


Pro 


Arg 


Gin 


Ala 


Pro 


Gly 


Gly 


Cys 


Gly 


Asp 


Ser 


Thr 






45 










50 










55 












cag 


aag 


att 


cac 


ctg 


grc 


tea 


aag 


ttg 


gee 


ctg 


ttc 


cag 


tat 


tgg 


tta 


302 


Gin 


Lys 


He 


His 


Leu 


Xaa 


Ser 


Lys 


Leu 


Ala 


Leu 


Phe 


Gin 


Tyr 


Trp 


Leu 




60 










65 










70 










75 





tgagtcttct gttcatcget tctgtattta tgttgcacat ttggggcaag tacactegtt 362 

egtagattea gttacatcca tctgtcatct gaagaakgag gaaawaaccc aacatttctt 422 

ggacccaaaa gtatagtgac tatctgttca tgagagaaat tttctgtaag cttgctgttt 482 

ttaamcaggg gatttatcaa ttaattgatt tgaggaatca gtttttttct atggctaata 542 

aact 546 



<210> 424 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. . 183 



<400> 424 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 " 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
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agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 433 

cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 493 
gtggaa 4 99 

<210> 425 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 122. .487 

<400> 425 

aaacacatcc aagcttaaga cggtgaggtc agcttcacat tctcaggaac tctccttctt 60 
tgggtctggc tgaagttgag gatctcttac tctctaggcc acggaattaa cccgagcagg 120 
c atg gag gcc tct get etc acc tea tea gca gtg ace agt gtg gee aaa 169 
Met Glu Ala Ser Ala Leu Thr Ser Ser Ala Val Thr Ser Val Ala Lys 
15 10 15 

gtg gtc agg gtg gcc tct ggc tct gcc gta gtt ttg ccc ctg gcc agg 217 
Val Val Arg Val Ala Ser Gly Ser Ala Val Val Leu Pro Leu Ala Arg 

20 25 30 

att get aca gtt gtg att gga gga gtt gtg gcc atg gcg get gtg ccc 265 
He Ala Thr Val Val He Gly Gly Val Val Ala Met Ala Ala Val Pro 

35 40 45 

atg gtg etc agt gcc atg ggc ttc act gcg gcg gga ate gcc teg tec 313 
Met Val Leu Ser Ala Met Gly Phe Thr Ala Ala Gly He Ala Ser Ser 

50 55 60 

tec ata gca gcc aag atg atg tec gcg gcg gcc att gcc aat ggg ggt 361 
Ser He Ala Ala Lys Met Met Ser Ala Ala Ala He Ala Asn Gly Gly 
65 70 75 80 

gga gtt gcc teg ggc age ctt gtg get act ctg cag tea ctg gga gca 409 
Gly Val Ala Ser Gly Ser Leu Val Ala Thr Leu Gin Ser Leu Gly Ala 

8 5 90 95 

act gga etc tec gga ttg acc aag ttc ate ctg ggc tec att ggg tct 457 
Thr Gly Leu Ser Gly Leu Thr Lys Phe He Leu Gly Ser He Gly Ser 

100 105 110 

gcc att gcg get gtc att gcg agg ttc tac tagctccctg ccctcgcctg 507 
Ala He Ala Ala Val He Ala Arg Phe Tyr 

115 120 
cagag 512 

<210> 426 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57 . . 278 

<400> 426 

attcaccggc gaegegatae ggttcctcca ccgaggccca tgegaagett tccact atg 59 

Met 
1 

get tec age act gtc ccg gtg age get get ggc teg get aat gaa act 107 
Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr 

5 10 15 

ccc gaa ata ccg gac aac gtg gga gat tgg ctt egg ggc gtc tac cgc 155 
Pro Glu He Pro Asp Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Arg 

20 25 30 

ttt gcc act gat agg aat gac ttc egg agg aac ttg ata eta aat ttg 203 
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Phe Ala Thr Asp Arg Asn Asp Phe Arg Arg Asn Leu lie Leu Asn Leu 

35 4 0 4 5 

gga etc ttt get gcg gga gtt tgg ctg gec agg aac ttg agt gac att 251 
Gly Leu Phe Ala Ala Gly Val Trp Leu Ala Arg Asn Leu Ser Asp lie 
50 55 60 65 

gac etc atg gca cct cag cca ggg gtg tagecaagta gttctaatgc 298 
Asp Leu Met Ala Pro Gin Pro Gly Val 
70 

cacctgtcgt cttatcatct gattgeagae aaatggaatc ctgtgctgaa cccgaatctt 358 
ccaaaaaaca gcctacaatc tgtgaccacc acaagatgtg ccctgatggc agctgaagtt 418 
tgattcagat gggcactttt cttccccttc cctgcctagt ttccttttgt tccttgagtc 478 
aegcagaatt ccattctctg gtca 502 



<210> 427 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 50. .340 



<400> 427 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca 



gca cct gag cct cca aca ttc aaa aag gaa cca 
Ala Pro Glu Pro Pro Thr Phe Lys Lys Glu Pro 

5 10 
ttt caa age cca ggg etc aga ggg gtg 
Phe Gin Ser Pro Gly Leu Arg Gly Val 
20 25 
get gtg aat cca gag etc ttc att aaa 
Ala Val Asn Pro Glu Leu Phe lie Lys 
40 

ttc gga ttg gta act ctt tea ctt tgc 
Phe Gly Leu Val Thr Leu Ser Leu Cys 

55 60 
cat gca ata caa gag aat aaa aag gac 
His Ala lie Gin Glu Asn Lys Lys Asp 

70 75 
gag ggg cac agt tat atg agg egg aaa aca tec 
Glu Gly His Ser Tyr Met Arg Arg Lys Thr Ser 

85 90 
tagtagtget ggttagtgca ratggacctt tattaaaggt 
gaccttgtga gtgtacagta tcatgtttct tgttctagaa 
atagcagttg caaccagaca actgtcgtaa attttgtcct 
ctcattcttt ttccacagag tgagegtcat aatattttct 



cgc aca 
Arg Thr 
30 

cct aac 
Pro Asn 
45 

gtg gca 
Val Ala 

etc tat 
Leu Tyr 



ttgatcaag atg tct tea 
Met Ser Ser 
1 

ccc aaa gar aaa gag 
Pro Lys Glu Lys Glu 
15 

aca acc tta ttt cgt 
Thr Thr Leu Phe Arg 
35 

aaa cct gta atg get 
Lys Pro Val Met Ala 
50 

tat att ggt tat eta 
Tyr lie Gly Tyr Leu 
65 

gaa get att gat agt 
Glu Ala lie Asp Ser 
80 

aaa tgg gat 
Lys Trp Asp 
95 

tctgaaatct tcaaatgaaa 
catgetaatg aagagagaag 
ttcacagctg cagecattat 
ttccttactc ttata 



58 



106 



154 



202 



250 



298 



340 



400 
460 
520 
575 



<210> 428 
<211> 493 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 177. .470 



<400> 428 

atcgaccacg gaaatttgac acctccgggc ttggaagcag ctctctcctc cttccccgct 60 
gcttataaac ctcagccctg aggctccagc tcactctacc ccatctcctt geegggtcag 120 
ccctgacaaa ggtcagctag ccccttgagg acatcagctt tggectcagg gtccta atg 179 

Met 
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gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag 


gag 


tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 


Leu 


Thr 


Glu 


Leu 


Glu 


Glu 


Ser 


He 


Glu 


Asn 


Val 


Val 










5 










10 










15 








acc 


acc 


ttc 


ttc 


acc 


ttt 


gca 


agg 


cag 


gag 


ggc 


egg 


aag 


gat 


age 


etc 


275 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe 


Ala 


Arg 


Gin 


Glu 


Gly 


Arg 


Lys 


Asp 


Ser 


Leu 








20 










25 










30 










age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt 


acc 


cag 


cag 


ttg 


ccc 


cat 


ctg 


323 


Ser 


Val 


Asn 


Glu 


Phe 


Lys 


Glu 


Leu 


Val 


Thr 


Gin 


Gin 


Leu 


Pro 


His 


Leu 






35 










40 










45 












etc 


aag 


gat 


gtg 


ggc 


tct 


ctt 


gat 


gag 


aag 


atg 


aag 


age 


ttg 


gat 


gtg 


371 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu 


Asp 


Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


Val 




50 










55 










60 










65 




aat 


cag 


gac 


teg 


gag 


etc 


aag 


ttc 


aat 


gag 


tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 


Leu 


Lys 


Phe 


Asn 


Glu 


Tyr 


Trp 


Arg 


Leu 


He 


Gly 












70 










75 










80 






gag 


ctg 


gec 


aag 


gaa 


ate 


agg 


aag 


aag 


aaa 


gac 


ctk 


aag 


ate 


agg 


aag 


467 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys 


Lys 


Asp 


Leu 


Lys 


He 


Arg 


Lys 





85 90 95 

aag taaagccgcc tggctgagat ggg 4 93 

Lys 



<210> 429 
<211> 505 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 114 . .458 



<400> 429 

agttccggct ctgcstgaag agetttgeat tgtgggaagt ytttytttct cgttccccgg 60 
ccatcttagc ggctgctgtt ggttgggggc cgtcccgctc ctaaggcagg aag atg 116 

Met 
1 



gtg 


gee 


gca 


aag 


aag 


acg 


aaa 


aag 


teg 


ctg 


gag 


teg 


ate 


aac 


tct 


agg 


164 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


Leu 


Glu 


Ser 


He 


Asn 


Ser 


Arg 










5 










10 










15 








etc 


caa 


etc 


gtt 


atg 


aaa 


agt 


ggg 


aag 


tac 


gtc 


ctg 


ggg 


tac 


aag 


cag 


212 


Leu 


Gin 


Leu 


Val 


Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Val 


Leu 


Gly 


Tyr 


Lys 


Gin 








20 










25 










30 










act 


ctg 


aag 


atg 


ate 


aga 


caa 


ggc 


aaa 


gcg 


aaa 


ttg 


gtc 


att 


etc 


get 


260 


Thr 


Leu 


Lys 


Met 


He 


Arg 


Gin 


Gly 


Lys 


Ala 


Lys 


Leu 


Val 


He 


Leu 


Ala 






35 










40 










45 












aac 


aac 


tgc 


cca 


get 


ttg 


agg 


aaa 


tct 


gaa 


ata 


gag 


tac 


tat 


get 


atg 


308 


Asn 


Asn 


Cys 


Pro 


Ala 


Leu 


Arg 


Lys 


Ser 


Glu 


He 


Glu 


Tyr 


Tyr 


Ala 


Met 




50 










55 










60 










65 




ttg 


get 


aaa 


act 


ggt 


gtc 


cat 


cac 


tac 


agt 


ggc 


aat 


aat 


att 


gaa 


ctg 


356 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 


Asn 


Asn 


He 


Glu 


Leu 












70 










75 










80 






ggc 


aca 


gca 


tgc 


gga 


aaa 


tac 


tac 


aga 


gtg 


tgc 


aca 


ctg 


get 


ate 


att 


404 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Tyr 


Arg 


Val 


Cys 


Thr 


Leu 


Ala 


He 


He 










85 










90 










95 








gat 


cca 


ggt 


gac 


tct 


gac 


ate 


att 


aga 


age 


atg 


cca 


gaa 


cag 


act 


ggt 


452 


Asp 


Pro 


Gly 


Asp 


Ser 


Asp 


He 


lie 


Arg 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


Gly 





100 105 110 



gaa aag taaacctttt cacctacaaa atttcacctg caaaaaaaaa aaaaaav 505 
Glu Lys 
115 



<210> 430 
<211> 772 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 324 . . 629 



247 



PCT/IB01/00914 



<400> 430 

ataatcgcgt catttccggg aggggacgaa ggggtagttc tttcacctcg gctgggcgcc 
tagaaaagcc tagaaacagc tccttttttc ttccgcctcc gagtcttcgc gtcagcgtcc 
tgcgcagggc ccttggggcg aatcgcggtg cgcgtcgggg cgaccgccct ccctccctgg 
gaggggcgag ggggctagcg gcgacsgctg gggcgagcgc gcctgcgcgc tgggtgattt 
tttcacgtgt cgccagggcc ggactgcgag tctctttgcg gcgctacact agagcagagt 
acgagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ctt 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 
15 10 
cca etc ttt gac cga gta ttg gtt gaa agg agt get get gaa act gta 
Pro Leu Phe Asp Arg Val Leu Val Glu Arg Ser Ala Ala Glu Thr Val 

15 20 25 

acc aaa gga ggc att atg ctt cca gaa aaa tct caa gga aaa gta ttg 
Thr Lys Gly Gly lie Met Leu Pro Glu Lys Ser Gin Gly Lys Val Leu 

30 35 4 0 

caa gca aca gta gtc get gtt gga teg ggt tct aaa gga aag ggt gga 
Gin Ala Thr Val Val Ala Val Gly Ser Gly Ser Lys Gly Lys Gly Gly 

45 50 55 

gag att caa cca gtt age gtg aaa gtt gga gat aaa gtt ctt etc cca 
Glu lie Gin Pro Val Ser Val Lys Val Gly Asp Lys Val Leu Leu Pro 

60 65 70 

gaa tat gga ggc acc aaa gta gtt eta gat rac aag gat tat ttc eta 
Glu Tyr Gly Gly Thr Lys Val Val Leu Asp Xaa Lys Asp Tyr Phe Leu 
75 ~ " 80 85 " 90 

ttt aga gat ggt gac att ctt gga aag tac gta gac tgaaataagt 
Phe Arg Asp Gly Asp lie Leu Gly Lys Tyr Val Asp 

95 100 
cactattgaa atggcatcaa catgatgetg cccattccac tgaagttctg aaatctttcg 
tcatgtaaat aatttccata tttctctttt ataataaact aatgataact aatgaccaaa 
aaaaaaaaaa aaa 



60 
120 
180 
240 
300 
353 



401 



449 



497 



545 



593 



639 



699 
759 
772 



<210> 431 
<211> 672 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 224. 



,529 



<400> 431 

cataagaaca gggaggttag aagtagggtc ttggtgacaa 
ggggagagtg cgtcatatgt tgttctagga agattgtagt 
taatgtttgt gtattegget atgaagaata ggccagggcc 
gcgctacact agagcagagt acgagtctga ggcggaggga 



gcg ttt aga 
Ala Phe Arg 
5 

agt get get 
Ser Ala Ala 

tct caa gga 
Ser Gin Gly 



aag ttt 
Lys Phe 

gaa act 
Glu Thr 

25 
aaa gta 
Lys Val 
40 



ctt cca 
Leu Pro 
10 

gta acc 
Val Thr 

ttg caa 
Leu Gin 



etc ttt 
Leu Phe 

aaa gga 
Lys Gly 

gca aca 
Ala Thr 
45 



gac cga 
Asp Arg 

15 
ggc att 
Gly lie 
30 

gta gtc 
Val Val 



aatatgttgt gtagagttca 
ggtgagggtg tttattataa 
ggactgcgag tctctttgcg 
gta atg gca gga caa 

Met Ala Gly Gin 

1 

gta ttg gtt gaa agg 
Val Leu Val Glu Arg 
20 

atg ctt cca gaa aaa 
Met Leu Pro Glu Lys 
35 

get gtt gga teg ggt 
Ala Val Gly Ser Gly 
50 



60 
120 
180 

235 



283 



331 



379 
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tct aaa gga aag ggt gga gag att caa cca gtt age gtg aaa gtt gga 427 
Ser Lys Gly Lys Gly Gly Glu lie Gin Pro Val Ser Val Lys Val Gly 

55 " 60 65 

gat aaa gtt ctt etc cca gaa tat gga ggc acc aaa gta gtt eta gat 475 
Asp Lys Val Leu Leu Pro Glu Tyr Gly Gly Thr Lys Val Val Leu Asp 

70 75 80 

gac aag gat tat ttc eta ttt aga gat ggt gac att ctt ggr aag tac 523 
Asp Lys Asp Tyr Phe Leu Phe Arg Asp Gly Asp lie Leu Gly Lys Tyr 
8 5 ' ^ ~ 90 95 100 

gta gac tgaaataagt cactattgaa atggcatcaa catgatgetg cccattccac 579 
Val Asp 

tgaagttctg aaatctttcg tcatgtaaat aatttccata tttctctttt ataataaact 639 
aatgataact aatgaccaaa aaaaaaaaaa aaa 672 

<210> 432 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97 . .444 

<400> 432 

atetttcegt cccgggcagc cagcgccagt cggagccagc gcgagccgcc gccgccatca 60 

ctgccgctgc caagtcctcc acccgctgcc cccgcc atg tct get acc get gee 114 

Met Ser Ala Thr Ala Ala 
1 5 

acg gee ccc cct get gee ccg get ggg gag ggt ggt ccc cct gca ccc 162 

Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu Gly Gly Pro Pro Ala Pro 

10 15 20 

cct cca aac etc acc agt aac agg aga ctg cag cag acc cag gee cag 210 

Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu Gin Gin Thr Gin Ala Gin 

25 30 35 

gtg gat gag gtg gtg gac ate atg agg gtg aac gtg gac aag gtc ctg 258 

Val Asp Glu Val Val Asp lie Met Arg Val Asn Val Asp Lys Val Leu 

40 45 50 

gag cga gac cag aag ctg teg gag ctg gac gac cgt gca gat gca etc 306 

Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
55 60 65 70 

cag gcg ggg gee tec cag ttt gaa aca age gca gee aag etc aag cgc 354 

Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 

75 80 85 

aaa tac tgg tgg aaa aac etc aag atg atg ate ate ttg gga gtg att 402 

Lys Tyr Trp Trp Lys Asn Leu Lys Met Met lie lie Leu Gly Val lie 

90 " 95 100 

tgc gee ate ate etc ate ate ate ata gtt tac ttc age act 444 

Cys Ala lie lie Leu lie lie lie lie Val Tyr Phe Ser Thr 

105 110 115 

taaatccccg aggagtctgc cctgcctaga gaagggcetc tcccccaacc ctcagccgtt 504 
cctccacctc tct 517 

<210> 433 

<211> 485 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. . 311 



<400> 433 

tttgttttct aatagggcaa atg tct eta age ttg gtg ttt aga get get tea 



53 
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tat ttt 
Tyr Phe 

cag cct 
Gin Pro 

ttt gag 
Phe Glu 
45 

ggt aat 
Gly Asn 
60 

ttc cct 
Phe Pro 



aaa eta 
Lys Leu 

15 
cct ggg 
Pro Gly 
30 

agg ttt 
Arg Phe 

tta cag 
Leu Gin 

aag cag 
Lys Gin 



atg tgt ctt aca 
Met Cys Leu Thr 
95 

gggggaatct ttta 
catcaccata aacc 
tacaattttt tatt 



Met Ser Leu S 
1 

gtt cca ttc cac agt 
Val Pro Phe His Ser 
20 

tgg gtc gtc ttg acc 
Trp Val Val Leu Thr 
35 

tea tac cag aat gta 
Ser Tyr Gin Asn Val 
50 

cca gaa aca aat ata 
Pro Glu Thr Asn lie 
65 

ata age aga aac ttg 
lie Ser Arg Asn Leu 
8 0 

tat ttt tgactaaaag 
Tyr Phe 



249 

er Leu Val Phe Arg Al 
5 

tct agt tea aac cag 
Ser Ser Ser Asn Gin 
25 

caa act ctt gtg ttg 
Gin Thr Leu Val Leu 
40 

cca raa agt gca caa 
Pro Xaa Ser Ala Gin 
55 

cat ttg ttt cat ttc 
His Leu Phe His 



Phe 



70 



ttt aac tea tta ctt 
Phe Asn Ser Leu Leu 
85 

ttatagaatg ttattcctct 



a Ala Ser 
10 

ttt tta 
Phe Leu 

tta cat 
Leu His 

ggt aaa 
Gly Lys 

ttg act 
Leu Thr 
75 

tgt ttg 
Cys Leu 
90 



caggtg gaggaatggg gatagcagta ttgectcaga ttcaaactgg 
ctgtag gccagggtgg aatgaagtca gctccttttt atagttgaaa 
caccat tgtc 



101 



149 



197 



245 



293 



341 



401 
461 
485 



<210> 434 
<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. .369 



<400> 434 
acttgetctg 



get gag gac 
Ala Glu Asp 

ttc gac cac 
Phe Asp His 
25 

gcg gat gat 
Ala Asp Asp 

ggc ' ttc ate 
Gly Phe lie 

cca gat gee 
Pro Asp Ala 
75 

get gga gac 
Ala Gly Asp 
90 

act ctg gtg 
Thr Leu Val 
105 

cctctctgcc 
gttcgtttat 
ct 



cccgctcaaa cagttgeagg atg teg atg aca gac ttg ctg aac 

Met Ser Met Thr Asp Leu Leu Asn 
1 5 
ate aag aag gcg gtg gga gee ttt age get acc gac tec 
lie Lys Lys Ala Val Gly Ala Phe Ser Ala Thr Asp Ser 

15 20 
aaa aag ttc ttc caa atg gtc ggc ctg aag aaa aag agt 
Lys Lys Phe Phe Gin Met Val Gly Leu Lys Lys Lys Ser 

30 35 4 0 

gtg aag aag gtg ttt cac atg ctg gac aag gac aaa agt 
Val Lys Lys Val Phe His Met Leu Asp Lys Asp Lys Ser 

4 5 50 55 

gag gag gat gag ctg gga ttc ate eta aaa ggc ttc tec 
Glu Glu Asp Glu Leu Gly Phe lie Leu Lys Gly Phe Ser 
60 65 70 

aga gac ctg tct get aaa gaa acc aag atg ctg atg get 
Arg Asp Leu Ser Ala Lys Glu Thr Lys Met Leu Met Ala 

80 85 
aaa gat ggg gac ggc aaa att ggg gtt gac gaa ttc tec 
Lys Asp Gly Asp Gly Lys lie Gly Val Asp Glu Phe Ser 

95 " 100 

get gaa ast aag aag cac tgactgcccc tggtcttcca 
Ala Glu Xaa Lys Lys His 
110 

ctgaacaccc aatctcggcc cctctcgcca ccctcctgca tttctgttca 
gttatttttt actcccccat cccctgtggc cctctaatga caccattctt 



54 



102 



150 



198 



246 



294 



342 



389 



449 
509 
511 



<210> 435 



WO 02/083898 
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250 

<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .417 



<400> 435 

cccttctgcg cggtcacgcg agccagcgcc tgggcctgga accgggccgt agcccccagt 60 
ttcgcccacc acctccctac c atg gac ccc cgc aaa gtg aac gag ctt egg 111 

Met Asp Pro Arg Lys Val Asn Glu Leu Arg 
15 10 
gec ttt gtg aaa atg tgt aag cag gat ccg age gtt ctg cac acc gag 159 
Ala Phe Val Lys Met Cys Lys Gin Asp Pro Ser Val Leu His Thr Glu 

15 20 25 

gaa atg cgc ttc ctg agg gag tgg gtg gag age atg ggt ggt aaa gta 207 
Glu Met Arg Phe Leu Arg Glu Trp Val Glu Ser Met Gly Gly Lys Val 

30 35 40 

cca cct get act caa aaa get aaa tea gaa gaa aat acc aag gaa gaa 255 
Pro Pro Ala Thr Gin Lys Ala Lys Ser Glu Glu Asn Thr Lys Glu Glu 

45 1 50 55 

aaa cct gat agt aag aag gtg gag gaa gac tta aag gca gac gaa cca 303 
Lys Pro Asp Ser Lys Lys Val Glu Glu Asp Leu Lys Ala Asp Glu Pro 

60 ~ 65 7 0 

tea agt gag gaa agt gat eta gaa att gat aaa gaa ggt gtg att gaa 351 
Ser Ser Glu Glu Ser Asp Leu Glu lie Asp Lys Glu Gly Val lie Glu 
75 80 85 90 

cca gac act gat get cct caa gaa atg gga gat gaa aat gcg gag ata 399 
Pro Asp Thr Asp Ala Pro Gin Glu Met Gly Asp Glu Asn Ala Glu lie 

95 100 105 

acg get caa cat ttt ttg tagecacagg cttccacgga cttcacgtca 447 
Thr Ala Gin His Phe Leu 
110 

ttattggctc aactttcctc actatctget tcatccgcca actaatattt cactttacat 507 
ccca 511 



<210> 436 
<211> 475 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 22 . . 375 



<400> 436 



agtcgageca gcgtcgccgc g 


atg 


gtg 


ttg 


ttg 


gag 


age 


gag 


cag 


ttc 


ctg 














Met 


Val 


Leu 


Leu 


Glu 


Ser 


Glu 


Gin 


Phe 


Leu 














1 








5 










10 


acg 


gag 


ctg 


acc 


aga 


ctt 


ttc 


cag 


aag 


tgc 


egg 


acg 


teg 


ggc 


age 


gtc 


Thr 


Glu 


Leu 


Thr 


Arg 


Leu 


Phe 


Gin 


Lys 


Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 










15 










20 










25 




tat 


ate 


acc 


ttg 


aag 


aag 


tat 


gac 


ggt 


cga 


acc 


aaa 


ccc 


att 


cca 


aag 


Tyr 


He 


Thr 


Leu 


Lys 


Lys 


Tyr 


Asp 


Gly 


Arg 


Thr 


Lys 


Pro 


He 


Pro 


Lys 








30 










35 










40 






aaa 


ggt 


act 


gtg 


gag 


ggc 


ttt 


gag 


ccc 


gca 


gac 


aac 


aag 


tgt 


ctg 


tta 


Lys 


Gly 


Thr 


Val 


Glu 


Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 






45 










50 










55 








aga 


get 


acc 


gat 


ggg 


aag 


aag 


aag 


ate 


age 


act 


gtg 


gtg 


age 


tec 


aag 


Arg 


Ala 


Thr 


Asp 


Gly 


Lys 


Lys 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 




60 










65 










70 










gaa 


gtg 


aat 


aag 


ttt 


cag 


atg 


get 


tat 


tea 


aac 


etc 


ctt 


aga 


get 


aac 



51 



99 



147 



195 



243 



291 
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Glu Val Asn Lys Phe Gin Met Ala Tyr Ser Asn Leu Leu Arg Ala Asn 
75 80 85 90 

atg gat ggg ttg aag aag aga gac aaa aag aac gaa act aag aag acc 
Met Asp Gly Leu Lys Lys Arg Asp Lys Lys Asn Glu Thr Lys Lys Thr 

95 100 105 

aaa gca gca gca gca gca gca gca aca gca gca cag taaagggcat 
Lys Ala Ala Ala Ala Ala Ala Ala Thr Ala Ala Gin 

110 115 
acatttcctg ctttcaccaa ttaaccactg aattgctatt ttttcctttt ggccagatag 
ctaggtttct ggttccccca cagtaggtgt 



339 



385 



445 
475 



<210> 437 
<211> 896 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 351 . . 677 

<220> 

<221> misc_f eature 
<222> 234 

<223> n=a, g, c or t 



<400> 437 

gcacgagacc 

taagtgaaaa 

ctgtacatgt 

atactgtgct 

atgattctgg 

ctccattcca 



attacataaa 
tgacttagaa 
ttctgatttc 
ttttccagca 
tctgataaca 
tactcctcct 



aacaaacagg 
gaactaaggg 
agtacagata 
aaggatctca 
ataaacagtt 
gaggaaccca 



aatgcattct 
tagatcacta 
atagtggatc 
aggaaaagga 
cacaagagca 
atgaatgtga 



agt 
Ser 

cga 
Arg 

act 
Thr 
35 
cag 
Gin 



tta cct 
Leu Pro 
5 

aat aga 
Asn Arg 
20 

gaa ttt 
Glu Phe 

tct tta 
Ser Leu 



aaa 


ata 


crt 


cga 


aaa 


Lys 


He 


Xaa 


Arg 


Lys 






10 






aat 


ctg 


gaa 


cca 


aat 


Asn 


Leu 


Glu 


Pro 


Asn 



25 

wtt aca era gaa gaa aac aga att 
Xaa Thr Xaa Glu Glu Asn Arg He 

40 45 
eta gac ttg ttt cag act agt gaa 
Leu Asp Leu Phe Gin Thr Ser Glu 
55 60 
ttg ggt ttc aca age tac aca gaa aag agt ggt 
Leu Gly Phe Thr Ser Tyr Thr Glu Lys Ser Gly 

70 75 
gat att tgg gaa gag gaa aat tea gat aat ctg 
Asp He Trp Glu Glu Glu Asn Ser Asp Asn Leu 

85 90 
teg tec cct tea act tct aca ttt act ggc ttt 
Ser Ser Pro Ser Thr Ser Thr Phe Thr Gly Phe 

100 105 
cttttcagaa gtgataagga tcatattctt gaaattttta 
taggattttt ttaccagctt tgtttacaga cccaaatgta 
gcaattttct acagaattga atacctgtta aagaaaaatt 
gtaaaaaaaa aaaaaaatg 



tgacatttcc gaacacacat 
taaatgtaac atacaggcat 
tcaaccaaaa cagaagtcag 
ccttcattca atanttactc 
cctaactgtt caggcaaagg 
cttcaagaat atg gat 

Met Asp 

1 

aaa ata ata tta gga 
Lys He He Leu Gly 
15 

gaa ttt gat aaa aga 
Glu Phe Asp Lys Arg 
30 

tgt agt tea ccg gta 
Cys Ser Ser Pro Val 
50 

gag aaa tea gaa ttt 
Glu Lys Ser Glu Phe 
65 

ata tgc aat gtt tta 
He Cys Asn Val Leu 
80 

tta aca gcg ttw ttc 
Leu Thr Ala Xaa Phe 
95 

tagaatttaa aaaatgeata 



taaatatgta tggaaattct 
aatattaaaa ataaatattt 
acagaataaa cttgtgactg 



60 
120 
180 
240 
300 
356 



404 



452 



500 



548 



596 



644 



697 



757 
817 
877 
896 



<210> 438 
<211> 576 
<212> DNA 
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<400> 438 

cattttaatt catatytgag tgggcggtgg cgattggtgt tggcggtctg gctcagctgg 60 

gcagggggta actttactga tttgggggtg gtttttagtt taatttttct tttctagctt 120 

cccatcgacg gtcagtgcgc acgttgtaat cagctgaggc c atg tea gga gac gga 176 

Met Ser Gly Asp Gly 
1 5 

gec acg gag cag gca get gag tat gtc cca gag aag gtg aag aaa gcg 224 

Ala Thr Glu Gin Ala Ala Glu Tyr Val Pro Glu Lys Val Lys Lys Ala 

10 15 20 

gaa aag aaa tta gaa gag aat cca tat gac ctt gat get tgg age att 272 

Glu Lys Lys Leu Glu Glu Asn Pro Tyr Asp Leu Asp Ala Trp Ser lie 

25 30 35 

etc att cga gag gca cag aat caa cct ata gac aaa gca egg aag act 320 

Leu lie Arg Glu Ala Gin Asn Gin Pro lie Asp Lys Ala Arg Lys Thr 

40 45 50 

tat gaa cgc ctt gtt gec cag ttc ccc agt tct ggc aga ttc tgg aaa 368 

Tyr Glu Arg Leu Val Ala Gin Phe Pro Ser Ser Gly Arg Phe Trp Lys 

55 60 65 

ctg tac att gaa gca gag gtt act att tta ttt tat ttt ttc tta tat 416 

Leu Tyr lie Glu Ala Glu Val Thr lie Leu Phe Tyr Phe Phe Leu Tyr 

70 "* 75 80 85 

cag tat tgc age att cac tgt agt gat aga aaa caa gtt agg aac ata 4 64 

Gin Tyr Cys Ser lie His Cys Ser Asp Arg Lys Gin Val Arg Asn lie 

90 95 100 

gec aat taggacaagg aggatttaaa tgtgtcttac ctttattttg taaaataggt 520 
Ala Asn 

ataaaggagt aattaaaatg aatttttgaa atttgggtct tttacaagct gatgat 57 6 



<210> 439 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .349 



<400> 439 

actaactagc aaaeggggae tagaaatagg g atg ctg aaa age aac ggg gag 52 

Met Leu Lys Ser Asn Gly Glu 
1 5 



aga 


cgc 


agt 


cgt 


aac 


gca 


ctt 


ccg 


gcg 


gtc 


tac 


gcg 


agg 


aag 


atg 


get 


100 


Arg 


Arg 


Ser 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


Val 


Tyr 


Ala 


Arg 


Lys 


Met 


Ala 








10 










15 










20 










gca 


tec 


cag 


cag 


caa 


get 


tea 


gcg 


get 


tec 


tea 


get 


get 


ggt 


gta 


teg 


148 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


Ala 


Gly 


Val 


Ser 






25 










30 










35 












ggt 


cct 


agt 


teg 


get 


ggc 


ggc 


ccg 


ggt 


ccc 


cag 


cag 


cag 


ccg 


caa 


ccg 


196 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Pro 




40 










45 










50 










55 




cca 


gca 


caa 


ctg 


gtg 


ggc 


cct 


gec 


cag 


age 


ggc 


etc 


ctg 


cag 


caa 


cag 


244 


Pro 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 


Ala 


Gin 


Ser 


Gly 


Leu 


Leu 


Gin 


Gin 


Gin 












60 










65 










70 






caa 


cag 


gac 


ttc 


gat 


cct 


gtg 


cag 


cgt 


tat 


aag 


atg 


etc 


ate 


ccg 


cag 


292 


Gin 


Gin 


Asp 


Phe 


Asp 


Pro 


Val 


Gin 


Arg 


Tyr 


Lys 


Met 


Leu 


He 


Pro 


Gin 










75 










80 










85 








ctg 


aag 


gag 


agt 


eta 


cag 


gtg 


att 


ggc 


ctt 


aag 


cag 


cga 


gaa 


gca 


aac 


340 
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Leu Lys Glu Ser Leu Gin Val lie Gly Leu Lys Gin Arg Glu Ala Asn 

90 95 100 

tgg att tgg tagtctagag gggcaggccg rgggccaggt tagtcggaga 389 
Trp lie Trp 
105 

gagcgctaag cagaaagagg aatagaacct gctgttttgg cctggctggt gcacgcctgt 449 
gttccagcac tttgggaggc cgaggcgkgc ggatc 484 

<210> 440 
<211> 523 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 114 . .371 



<400> 440 

ctctctcagc ttccggctgg tagtagttcc gcttcctgtc cgactgtggt gtctttgctg 60 
agggtcacat tgagctgcag gttgaatccg gggtgccttt aggattcagc acc atg 116 

Met 



gcg 


gaa 


gac 


atg 


gag 


acc 


aaa 


ate 


aag 


aac 


tac 


aag 


acc 


gec 


cct 


ttt 


164 


Ala 


Glu 


Asp 


Met 


Glu 


Thr 


Lys 


He 


Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


Phe 










5 










10 










15 








gac 


age 


cgc 


ttc 


ccc 


aac 


cag 


aac 


cag 


act 


aga 


aac 


tgc 


tgg 


cag 


aac 


212 


Asp 


Ser 


Arg 


Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr 


Arg 


Asn 


Cys 


Trp 


Gin 


Asn 








20 










25 










30 










tac 


ctg 


gac 


ttc 


cac 


cgc 


tgt 


cag 


aag 


gca 


atg 


acc 


get 


aaa 


gga 


ggc 


260 


Tyr 


Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


Gly 






35 










40 










45 












gat 


ate 


tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg 


tac 


cag 


tec 


etc 


tgc 


308 


Asp 


He 


Ser 


Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg 


Val 


Tyr 


Gin 


Ser 


Leu 


Cys 




50 










55 










60 










65 




ccc 


aca 


tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa 


egg 


get 


gaa 


ggc 


acg 


356 


Pro 


Thr 


Ser 


Trp 


Val 


Thr 


Asp 


Trp 


Asp 


Glu 


Gin 


Arg 


Ala 


Glu 


Gly 


Thr 












70 










75 










80 







ttt ccc ggg aag ate tgaactggct gcatctccct ttcctctgtc ctccatcctt 411 
Phe Pro Gly Lys He 
85 

ctcccaggat ggtgaagggg gacctggtac ccagtgatcc ccaccccagg atcctaaatc 471 
atgacttacc tgctaataaa aactcattgg aaaagyraaa aaaaaaaaaa aa 523 



<210> 441 
<211> 499 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 95. .409 



<400> 441 

ttcgaagtgg ttgtstctgc actatttact gaaaagttca tcctttcccc agtgatttgt 60 
aacactgcct ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 

Met Cys Val Gly Leu Leu Met 
1 5 

gtt tct att ctg act gac ate aat ttg tct aat ctt gta gca gta cag 163 
Val Ser He Leu Thr Asp He Asn Leu Ser Asn Leu Val Ala Val Gin 

10 15 20 

tac agt cct gat tat tgc aac ttt agg aaa agg tct gat aaa aac caa 211 
Tyr Ser Pro Asp Tyr Cys Asn Phe Arg Lys Arg Ser Asp Lys Asn Gin 
25 ~ 30 35 
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gat gcc tec aca ttt tgt cat aat tgt aac caa ttt cac etc tgt etc 259 

Asp Ala Ser Thr Phe Cys His Asn Cys Asn Gin Phe His Leu Cys Leu 

40 45 50 55 

cag tat cac cac aaa att gtt ctt cct tgg agt gtc ttg get att ctt 307 

Gin Tyr His His Lys lie Val Leu Pro Trp Ser Val Leu Ala lie Leu 

60 65 70 

age caa ctg ttc etc cat att act tct aga att aga cca aca att tat 355 
Ser Gin Leu Phe Leu His lie Thr Ser Arg lie Arg Pro Thr lie Tyr 

75 80 85 

aaa tea aac aaa ccc aag age att gaa att ttg att gga ttt gta ttg 403 
Lys Ser Asn Lys Pro Lys Ser lie Glu lie Leu lie Gly Phe Val Leu 

90 95 100 

aat tta tagattaatc tggtaaacca tgtcatcttt acaatgttgt cttccaatac 459 
Asn Leu 
105 

atgaatatgg tacagctctt catttactta ggcctttgaa 499 



<210> 442 
<211> 460 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 75 . . 377 



<400> 442 

ccctcgcgga gcttactgag cgcggccccg agcccagctc cgccgccgag cgcctgtgcc 60 



ggcacggmwa cacc 


atg 


gag 


cgc 


ccg 


gat 


aag 


gcg 


gcg 


ctg 


aac 


gca 


ctg 


110 










Met 


Glu 


Arg 


Pro 


Asp 


Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 












1 








5 










10 








cag 


cct 


cct 


gag 


ttc 


aga 


aat 


gaa 


age 


tea 


tta 


gca 


tct 


aca 


ctg 


aag 


158 


Gin 


Pro 


Pro 
15 


Glu 


Phe 


Arg 


Asn 


Glu 
20 


Ser 


Ser 


Leu 


Ala 


Ser 
25 


Thr 


Leu 


Lys 




acg 


etc 


ctg 


ttc 


ttc 


aca 


get 


tta 


atg 


ate 


act 


gtt 


cct 


att 


ggg 


tta 


206 


Thr 


Leu 
30 


Leu 


Phe 


Phe 


Thr 


Ala 
35 


Leu 


Met 


He 


Thr 


Val 
40 


Pro 


He 


Gly 


Leu 




tat 


ttc 


aca 


act 


aaa 


tct 


tac 


ata 


ttt 


gaa 


ggc 


gcc 


ctt 


ggg 


atg 


tec 


254 


Tyr 


Phe 


Thr 


Thr 


Lys 


Ser 


Tyr 


He 


Phe 


Glu 


Gly 


Ala 


Leu 


Gly 


Met 


Ser 




45 










50 










55 










60 




aat 


agg 


gac 


age 


tat 


ttt 


tac 


get 


get 


att 


gtt 


gca 


gtg 


gtc 


gcc 


gtc 


302 


Asn 


Arg 


Asp 


Ser 


Tyr 
65 


Phe 


Tyr 


Ala 


Ala 


He 
70 


Val 


Ala 


Val 


Val 


Ala 
75 


Val 




cat 


gtg 


gtg 


ctg 


gcc 


etc 


ttt 


gtg 


tat 


ktg 


gcc 


tgg 


aat 


gaa 


ggc 


tea 


350 


His 


Val 


Val 


Leu 
80 


Ala 


Leu 


Phe 


Val 


Tyr 
85 


Xaa 


Ala 


Trp 


Asn 


Glu 
90 


Gly 


Ser 




cga 


cag 


tgk 


cgt 


gaa 


ggc 


aaa 


cag 


gat 


taaagtgaac atcacctttt 




397 


Arg 


Gin 


Xaa 


Arg 


Glu 


Gly 


Lys 


Gin 


Asp 



















95 100 



tatagcatta aattcatttt ttaaaatgat aaatgctgga gggggecate tgatttgaat 457 
aaa 4 60 



<210> 443 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30 . . 305 



<400> 443 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 



53 
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Met 
1 


Phe 


Arg 


Asp 


Phe 
5 


Gly 


Arg 


Arg 




ctg 


cag 


agg 


gat 


ttg 


aag 


aga 


gtg 


gtg 


gat 


get 


agg 


ctg 


agg 


etc 


age 


101 


Leu 


Gin 


Arg 


Asp 


Leu 


Lys 


Arg 


Val 


Val 


Asp 


Ala 


Arg 


Leu 


Arg 


Leu 


Ser 






10 










15 










20 












gag 


gag 


etc 


age 


ggc 


ggg 


agg 


ate 


aag 


ccg 


aag 


cct 


gtg 


gag 


gtc 


cag 


149 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


He 


Lys 


Pro 


Lys 


Pro 


Val 


Glu 


Val 


Gin 




25 










30 










35 










40 




gtg 


gtc 


acg 


cat 


cac 


atg 


cag 


cgc 


tac 


gee 


gtg 


tgg 


ttc 


gga 


ggc 


tec 


197 


Val 


Val 


Thr 


His 


His 


Met 


Gin 


Arg 


Tyr 


Ala 


Val 


Trp 


Phe 


Gly 


Gly 


Ser 












45 










50 










55 






atg 


ctg 


gec 


teg 


act 


ccc 


gag 


ttc 


ttt 


cag 


gtc 


tgc 


cac 


acc 


aag 


aag 


245 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


Phe 


Gin 


Val 


Cys 


His 


Thr 


Lys 


Lys 










60 










65 










70 








gac 


tat 


gaa 


gag 


tac 


ggg 


ccc 


age 


ate 


tgc 


cgc 


cac 


aac 


ccc 


gtc 


ttt 


293 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


He 


Cys 


Arg 


His 


Asn 


Pro 


Val 


Phe 





75 80 85 



gga gtc atg tec tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 345 
Gly Val Met Ser 
90 

gggaacaagt gtccttcaga acccagagaa ggccgccgtt ctgtaaatag egaegteggt 405 

gttgctgccc agcagcgtgc ttgeattgee ggtgcatgag gcgcggcgcg ggccttcagt 4 65 

aaaag 470 



<210> 444 
<211> 521 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 162. .455 



<400> 444 

agaataegta caccaatgcg gacaagctcc tagaagcagc agagcagttg getcagaegg 60 
gggaatgtga ccccgaggag atctacaagg cagctcgaca cctggaggtg cgcatccaag 120 
acttcgtgcg cagggtggag cageggaage ttctcctgga c atg tct gtt tec ttc 176 

Met Ser Val Ser Phe 
1 5 



cac 


aca 


cac 


acc 


aaa 


gag 


ttg 


tgg 


aca 


tgg 


atg 


gaa 


gac 


ctt 


cag 


aag 


224 


His 


Thr 


His 


Thr 


Lys 
10 


Glu 


Leu 


Trp 


Thr 


Trp 
15 


Met 


Glu 


Asp 


Leu 


Gin 
20 


Lys 




gag 


atg 


ttg 


gag 


gat 


gtc 


tgt 


gca 


gat 


tct 


gtg 


gat 


gca 


gtc 


cag 


gaa 


272 


Glu 


Met 


Leu 


Glu 
25 


Asp 


Val 


Cys 


Ala 


Asp 
30 


Ser 


Val 


Asp 


Ala 


Val 
35 


Gin 


Glu 




ctg 


ate 


aag 


cag 


ttc 


cag 


cag 


cag 


cag 


acc 


gee 


act 


eta 


kat 


gee 


aca 


320 


Leu 


He 


Lys 
40 


Gin 


Phe 


Gin 


Gin 


Gin 
45 


Gin 


Thr 


Ala 


Thr 


Leu 
50 


Xaa 


Ala 


Thr 




etc 


aat 


gtc 


ate 


aag 


gaa 


ggc 


gaa 


gac 


ctt 


ate 


cag 


cag 


etc 


agg 


tea 


368 


Leu 


Asn 
55 


Val 


He 


Lys 


Glu 


Gly 
60 


Glu 


Asp 


Leu 


He 


Gin 
65 


Gin 


Leu 


Arg 


Ser 




gcg 


cct 


ccc 


tec 


etc 


ggg 


gag 


ccc 


age 


gag 


gee 


agg 


tea 


gca 


tgg 


gca 


416 


Ala 


Pro 


Pro 


Ser 


Leu 


Gly 


Glu 


Pro 


Ser 


Glu 


Ala 


Arg 


Ser 


Ala 


Trp 


Ala 




70 










75 










80 










85 




kag 


ctt 


tec 


agt 


ggg 


aaa 


tgc 


etc 


ggg 


eta 


gac 


gtg 


aga 


taagtcctgt 


465 


Xaa 


Leu 


Ser 


Ser 


Gly 
90 


Lys 


Cys 


Leu 


Gly 


Leu 
95 


Asp 


Val 


Arg 











cccagtcttg actcagcaaa atettgettt gtggtcttgt gtgagectte ggtttc 521 



<210> 445 
<211> 625 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 281. .547 



<400> 445 

atgctcgcca gaatacccct cgttaaaggc aaggcggctt ctggctcttc cgcagctcag 60 
ttatgggttc ctgtgtgctt agtcatcggg tccgacacag gctggactga tctggggagc 120 
cgcgaagggc ctgccttcac aaagggacgt aacgcaagta ctgcgggcag tgtttgaata 180 
tggccctgaa caatgtgtcc ctgtcctccg gtgatcagag gagcagggtg gcctaccgct 240 
cttcccatgg cgacctcaga ccgcgggcgt casgttggcg atg gtc tec gga gac 295 

Met Val Ser Gly Asp 
1 5 

ggc ttc etc gtt tec agg cct gaa gcg att cat eta gga cct egg cag 343 
Gly Phe Leu Val Ser Arg Pro Glu Ala lie His Leu Gly Pro Arg Gin 

10 15 20 

gcg gtg cga cca age gtt egg gee gag age cgt cga gtg gat ggt ggc 391 
Ala Val Arg Pro Ser Val Arg Ala Glu Ser Arg Arg Val Asp Gly Gly 

25 30 35 

ggc egg age cca agg gaa cca gat ggc egg ggc egg age cgc caa ges 439 
Gly Arg Ser Pro Arg Glu Pro Asp Gly Arg Gly Arg Ser Arg Gin Ala 

40 4 5 50 

aga ttc tea cct tac cca ate cct gee gtt gaa ccc gat etc eta aga 487 
Arg Phe Ser Pro Tyr Pro lie Pro Ala Val Glu Pro Asp Leu Leu Arg 

55 60 65 

agt gtg ctg caa cag cgt ttg att gca tta gga ggt gtt ate gca get 535 
Ser Val Leu Gin Gin Arg Leu lie Ala Leu Gly Gly Val lie Ala Ala 
70 75 80 85 

cga att tea gtt taaacgaaca cctttcctct ggccctcact tagcttgtga 587 
Arg lie Ser Val 

acaggecttt ttaaaatcct ttcttggtgt agcagcaa 625 



<210> 446 
<211> 534 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 8. .523 



<400> 446 



agtcaag 


atg 


gcg 


gga 


gca 


get 


ace 


cag 


get 


tec 


ctg 


gag 


teg 


gee 


cca 






Met 


Ala 


Gly 


Ala 


Ala 


Thr 


Gin 


Ala 


Ser 


Leu 


Glu 


Ser 


Ala 


Pro 






1 








5 










10 










egg 


ate 


atg 


egg 


ctg 


gtg 


gee 


gaa 


tgc 


age 


cgc 


tec 


agg 


gee 


egg 


gca 


Arg 


lie 


Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 


Arg 


Ser 


Arg 


Ala 


Arg 


Ala 


15 










20 










25 










30 


ggc 


gag 


ctg 


tgg 


ctg 


ccg 


cat 


ggg 


aca 


gtg 


gee 


act 


cct 


gtg 


ttc 


atg 


Gly 


Glu 


Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val 


Ala 


Thr 


Pro 


Val 


Phe 


Met 








35 










40 










45 




cca 


gtg 


ggc 


acg 


cag 


gee 


ace 


atg 


aag 


ggc 


ate 


acg 


ace 


gaa 


cag 


ctg 


Pro 


Val 


Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly 


He 


Thr 


Thr 


Glu 


Gin 


Leu 








50 










55 










60 






gac 


get 


ctg 


ggt 


tgc 


cgc 


ate 


tgc 


ctg 


ggc 


aat 


ace 


tac 


cat 


ctg 


ggt 


Asp 


Ala 


Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu 


Gly 


Asn 


Thr 


Tyr 


His 


Leu 


Gly 






65 










70 










75 








eta 


agg 


ccg 


gga 


ccc 


gag 


ctg 


ate 


cag 


aaa 


gee 


aac 


ggt 


etc 


cac 


ggc 


Leu 


Arg 


Pro 


Gly 


Pro 


Glu 


Leu 


He 


Gin 


Lys 


Ala 


Asn 


Gly 


Leu 


His 


Gly 




80 










85 










90 










ttc 


atg 


aat 


tgg 


cct 


cat 


aat 


ctg 


eta 


acg 


gac 


age 


ggc 


ggt 


ttc 


cag 


Phe 


Met 


Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu 


Thr 


Asp 


Ser 


Gly 


Gly 


Phe 


Gin 


95 










100 










105 










110 
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atg gtg teg ctg gtg tct ctg tec 
Met Val Ser Leu Val Ser Leu Ser 
115 

ttc cgc tec ccc tac gac ggc aat 
Phe Arg Ser Pro Tyr Asp Gly Asn 
130 

tec gtg cag ate cag aat gcg ctg 
Ser Val Gin lie Gin Asn Ala Leu 
145 150 
gac gac gtg gtt age agt act gtg 
Asp Asp Val Val Ser Ser Thr Val 

160 165 
taggecatgt a 
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gag gtg acg 
Glu Val Thr 
120 

gag acc ctg 
Glu Thr Leu 
135 

ggc teg gac 
Gly Ser Asp 

act ggg cca 
Thr Gly Pro 



gag gag 
Glu Glu 

ctg age 
Leu Ser 

ate ate 
He He 
155 
cgt gtg 
Arg Val 
170 



ggc gtc cgc 
Gly Val Arg 

125 
ccg gag aaa 
Pro Glu Lys 
140 

atg cag ctg 
Met Gin Leu 

gag 
Glu 



385 



433 



481 



523 



534 



<210> 447 
<211> 111 
<212> DNA 
<213> Homo sapiens 

<400> 447 

agagctgagc tgaagcggga cccggagccc gagcagccgc cgccatggca atcaaatttc 60 
tggaagtcat caagcccttc tgtgtcatcc tgcaactaac atctgtgaag c 111 

<210> 448 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<400> 448 

ttttctcaac aattcctcac cgcagagcct ctgaagctcc caccaggcca gctctcctcc 60 

acccaccgct aactctcagc cccagtcacc ctcttggagc ttccctgctt tgaattaaag 120 

accactcatg cgcaaaaaaa aaaaaaaaaa 150 

<210> 449 
<211> 465 
<212> DNA 
<213> Homo sapiens 



<400> 449 

aaccaaagca 

tcagccttca 

tggagattga 

tcaaggagct 

acaagatccg 

aggagaaegg 

taceggggga 

aggggtctct 

<210> 450 
<211> 394 
<212> DNA 
<213> Homo 

<400> 450 

agtctgaaga 

cagccggttg 

cactttgcac 

tttcacagag 

caacaggaaa 

cttccgcaaa 

ttaacataac 



tgggctcttt 
cttcctcctc 
cgtggatgag 
gctggttgac 
gggcatgcag 
tggctcccac 
ccctgcggcc 
tcccctgccc 



sapiens 



tggcggcctc 
cttttgtgag 
agttcggcca 
gtttgggttg 
atcatattat 
catctgatga 
tgacaaaaaa 



gtttcctctt 
actcccgggg 
ctcctggaca 
tgttacaaac 
aagctgagca 
aggacaawtc 
aggctggtgc 
cctcagtttt 



ageagegega 
aagaattcct 
tgtcagaagg 
ggttcagttt 
agatggagta 
tgaaaagaaa 
aaaaaaaaaa 



tgccaggcct 
agctgtgaca 
tggagagtga 
ccacagaggc 
caccccagaa 
gctgcccccc 
catgagcagg 
ccacttttgg 



gggtccttcc 
ggaagaggag 
cgatgcccgg 
cttcatttct 
gaagtgaggg 
aacct cgtag 
gctcctcgtg 
ggtyt 



cagtgacccc 
atcccagaac 
getgecaggg 
ggcctgctgg 
tccccgaccc 
caacakgeaa 
cccctggccc 



ggtgctgcgg 
tggactgegg 
tgeattttae 
tcttcagaag 
aaggtccagg 
gatttttgag 



egctgegtag 
egtcgagtea 
cttttgacaa 
aaggacttcg 
ttcacactag 
actgcagcct 



aagtatcaat 
gctgaaagaa 
ggagactggc 
gaatgeacta 
aaggccaaaa 
attaataaag 



60 
120 
180 
240 
300 
360 
420 
465 



60 
120 
180 
240 
300 
360 
394 



<210> 451 



WO 02/083898 

<211> 391 
<212> DNA 
<213> Homo sapiens 



PCT/IB01/00914 



258 



<400> 451 

aagtctgaag 

tcagccggtt 

acactttgca 

ctttcacaga 

acaacaggaa 

acttccgcaa 

gttaacataa 



atggcggcct 
gcttttgtga 
cagttcggcc 
ggtttgggtt 
aatcatatta 
acatctgatg 
ccaaaaaaaa 



cagcagcgcg 
gaagaattcc 
atgtcagaag 
gggttcagtt 
tagatggagt 
atgaaaagaa 
aaaaaaaaag 



aggtgctgcg 
ttggactccg 
gtgcatttta 
ttcttcagaa 
aaaggtccag 
agatttttga 
s 



gcgctgcgta 
gcgtcgagt c 
ccttttgaca 
gaaggacttc 
gttcacacta 
gactgcagcc 



gaagtatcaa 
agctgaaaga 
aggagactgg 
ggaatgcact 
gaaggccaaa 
tattaataaa 



60 
120 
180 
240 
300 
360 
391 



<210> 452 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 452 

cccartcaga 

gtacgacctc 

cagcttatgt 

gcagttctgg 

atgatgaggc 

agcgccagtg 

tctaaaaacc 

aaaaagg 



gggctgaaca 
attgccgtgt 
gctattttac 
ttcctctgat 
ttgcagcatg 
taatccccca 
agacaaggaa 



tgtccgagtt 
ccaatcatta 
caacgtcgag 
ggagggacac 
gacaccaact 
ggagaacatc 
atacccttct 



tgtctgtaac 
tggagccatg 
atgatgaatt 
gaccaagcag 
aatgctgact 
tttgacactc 
tttatgagca 



ctgtcagcaa 
ggggttggcc 
ttataagaca 
ctctcagcag 
ccacgatcct 
tgcagactgc 
gaaggaaaca 



ggccttatgt 
actactaaag 
ccttcactta 
ggctttgggg 
gccaccctgt 
tagtgttctg 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
427 



<210> 453 
<211> 438 
<212> DNA 
<213> Homo sapiens 



<400> 453 

agatgttagg 

agaggt caac 

aaataacact 

ccgaggcggg 

cctgtctcta 

gctgctcagg 

gctgagatgg 

aaaaaaaaaa 



ctttgccaaa 
tgttttgtca 
gtctggccgg 
cagatcacga 
ctaaaaatac 
aggctgaggc 
cgccactgca 
aaaaaaag 



aactggatct 
tttgtgtagt 
gatcggtggc 
ggtcaggagt 
aaaaattagc 
aggagaatcg 
ttccaggcct 



ttagtatcac 
aagaacaaaa 
tcacgcctgt 
tcgagaccag 
caggcatggt 
ctcgaacccg 
ggggggtgac 



ttgagcactc 
taaagtgtgt 
aatcccagca 
cctgaccaac 
ggtgcacacc 
ggaggtggag 
agagcaagac 



attggacaca 
ggcattaaag 
ctttgggagg 
atgatgaaac 
tgtagtccca 
gttgcagtga 
t cagtctcag 



60 
120 
180 
240 
300 
360 
420 
438 



<210> 454 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<400> 454 

agtctgaaga 

cagccggttg 

cactttgcac 

tttcacagag 

caacaggaaa 

cttccgcaaa 

ttaacataac 



tggcggcctc 
cttttgtgag 
agttcggcca 
gtttgggttg 
atcatattat 
catctgatga 
tgamaaaaaa 



agcagcgcga 
aagaattcct 
tgtcagaagg 
ggttcagttt 
agatggagta 
tgaaaagaaa 
aaaaaaaaaa 



ggtgctgcgg 
tggactgcgg 
tgcattttac 
tcttcagaag 
aaggtccagg 
gatttttgag 
aggc 



cgctgcgtag 
cgtcgagtca 
cttttgacaa 
aaggacttcg 
ttcacactag 
actgcagcct 



aagtatcaat 
gctgaaagaa 
ggagactggc 
gaatgcacta 
aaggccaaaa 
attaataaag 



60 
120 
180 
240 
300 
360 
394 



<210> 455 
<211> 713 
<212> DNA 
<213> Homo sapiens 



<400> 455 
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atataggcgt 

ttacgcgcam. 

tatggaccgc 

acagccctga 

ctccggtggc 

gagtcggccc 

aaggtaaaga 

gcagcaccag 

gagtcagccg 

gacggaagca 

tggctgcctc 

ggagacgcgc 



ggctgaatac 
acgtacgttc 
tagcatctgt 
cacgagccgg 
ctcgctgggt 
tgaacgccca 
aaccaagtgg 
ctgtccccgc 
aatcctagcc 
gcagaccatc 
tccttccggc 
tggcgtcctc 



gggcctcctg 
ctcatgaaag 
cactggccac 
gtgctctctc 
cgcgaggagg 
cgactctcag 
aagagtgttt 
cctactccgg 
ccttcccttg 
aaggctacgg 
agccttatgc 
tgctgagcag 



259 

cgtacgtgcg 
ggacgacggg 
cggtttccgg 
ttctcaccgc 
cggaggactg 
ggtccagagg 
cctcctctgg 
accgccmcaa 
ggcctgctgt 
gaggcccggg 
tggctttgtc 
ccagcggaac 



tggtgtacgt 
agctgcatga 
gagtaagcgg 
ggcccacgtc 
tactctgagg 
ccgtgagacc 
ccgtaaagca 
agactccatg 
ggtgctcgac 
gcgcttgcga 
ttaaatggaa 
tgtaccatcg 



acgtgcgtga 
aagccgaagt 
cagctacctt 
tcctcgctgg 
ccaaaagcca 
ggccggctga 
ggtactctct 
ggatggacct 
atcagtgaca 
agatgaagtt 
tcaagactgt 
ccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



<210> 456 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -34 . .-1 



<400> 456 

Met Ser Leu Arg 

Leu His Ala Leu 
-15 

Leu Ala Ser Ser 
1 

Lys Glu Glu Ser 
15 

Cys lie Lys Thr 

Glu Val lie Gly 
50 

Thr Leu Lys Asp 
65 

lie Lys Lys lie 
80 



Leu Asp Thr Thr 
-30 

Gin Val Leu Leu 

Thr Lys Gly Gin 
5 

Leu Asp Ser Asp 
20 

Thr Ser Gly lie 
35 

Lys Gly Thr His 

Gly Arg Lys lie 
70 

Val Gin Lys Lys 
85 



Pro Ser Cys Asn 
-25 

Leu Leu Ser Leu 
-10 

Thr Lys Arg Asn 
10 

Leu Tyr Ala Glu 
25 

His Pro Lys Asn 
4 0 

Cys Asn Gin Val 
55 

Cys Leu Asp Pro 

Leu Ala Gly Asp 
90 



Ser Ala Arg Pro 
-20 

Leu Leu Thr Ala 
-5 

Leu Ala Lys Gly 

Leu Arg Cys Met 
30 

lie Gin Ser Leu 
45 

Glu Val lie Ala 
60 

Asp Ala Pro Arg 
75 

Glu Ser Ala Asp 



<210> 457 
<211> 65 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 457 
Met Lys Ala Ala 
-15 

Gin Ala Arg His 
1 

Asp Arg Val Lys 
15 

Ser Gly Arg Asp 
Ser 



Val Leu Thr Leu 

Phe Trp Gin Gin 
5 

Asp Leu Ala Thr 
20 

Tyr Val Ser Gin 
35 



Ala Val Leu Phe 
-10 

Asp Glu Pro Pro 
10 

Val Tyr Val Asp 
25 

Phe Glu Gly Ser 
40 



Leu Thr Gly Ser 
-5 

Gin Ser Pro Trp 

Val Leu Lys Asp 
30 

Ala Trp Glu Asn 
45 



<210> 458 
<211> 80 
<212> PRT 
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<213> Homo sapiens 



260 
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<220> 

<221> SIGNAL 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 31 

<223> Xaa = Ala,Val 



<400> 458 
Met Gly Arg Ala 
-25 

Ala Leu Leu Leu 
-10 

Thr Ser Ser Asn 
10 

Asn Pro Thr Asn 
25 

Thr Ala Ser Leu 
40 



Met Val Ala Arg 
-20 

Pro Thr Gin lie 
-5 

Ser Ser Gin Ser 

Ala Thr Thr Lys 
30 

Phe Val Val Ser 
45 



Leu Gly Leu Gly 
-15 

Tyr Ser Ser Glu 
1 

Thr Ser Asn Ser 
15 

Xaa Ala Gly Gly 

Leu Ser Leu Leu 
50 



Leu Leu Leu Leu 

Thr Thr Thr Gly 
5 

Gly Leu Ala Pro 
20 

Ala Leu Gin Ser 
35 

His Leu Tyr Ser 



<210> 459 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -27. .-1 



<400> 459 
Met Ser Ser Ala 
-25 

Leu Gly Leu Leu 
-10 

Ala Glu Glu Asp 

Gin Val Arg Pro 

25 

Pro His Cys Pro 
40 

Lys lie Cys Leu 
55 

Lys Leu Leu Glu 
70 



Ala Gly Phe Cys 
-20 

Leu Leu Pro Leu 
-5 

Gly Asp Leu Gin 
10 

Arg His He Thr 

Thr Ala Gin Leu 
45 

Asp Leu Gin Ala 
60 

Ser 



Ala Ser Arg Pro 

Val Val Ala Phe 
1 

Cys Leu Cys Val 
15 

Ser Leu Glu Val 
30 

He Ala Thr Leu 

Pro Leu Tyr Lys 
65 



Gly Leu Leu Phe 
-15 

Ala Ser Ala Glu 
5 

Lys Thr Thr Ser 
20 

He Lys Ala Gly 
35 

Lys Asn Gly Arg 
50 

Lys He lie Lys 



<210> 460 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18 . . -1 



<400> 460 

Met He Pro Ala Val Val Leu Leu 
-15 

Ala Ala Leu Gly Glu Pro Gin Leu 

1 5 
Phe Leu Tyr Gly He Val Leu Thr 



Leu Leu Leu Leu Val Glu Gin Ala 
-10 -5 
Cys Tyr He Leu Asp Ala He Leu 
10 

Leu Leu Tyr Cys Arg Leu Lys He 
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15 






20 










25 










30 


Gin Val Arg 


Lys 


Ala 


Ala 


lie 


Thr 


Ser 


Tyr 


Glu 


Lys 


Ser 


Asp 


Gly 


Val 






35 










40 










45 




Tyr Thr Gly 


Leu 


Ser 


Thr 


Arg 


Asn 


Gin 


Glu 


Thr 


Tyr 


Glu 


Thr 


Leu 


Lys 




50 










55 










60 






His Glu Lys 


Pro 


Pro 


Gin 






















65 




























<210> 461 




























<211> 92 




























<212> PRT 




























<213> Homo : 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -24 . . • 


-1 


























<400> 461 




























Met Val Met 


Gly 


Leu 


Gly 


Val 


Leu 


Leu 


Leu 


Val 


Phe 


Val 


Leu 


Gly 


Leu 






-20 










-15 










-10 




Gly Leu Thr 


Pro 


Pro 


Thr 


Leu 


Ala 


Gin 


Asp 


Asn 


Ser 


Arg 


Tyr 


Thr 


His 




-5 










1 








5 








Phe Leu Thr 


Gin 


His 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly 


Arg 


Asp 


Asp 


Arg 


10 








15 










20 










Tyr Cys Glu 


Ser 


He 


Met 


Arg 


Arg 


Arg 


Gly 


Leu 


Thr 


Ser 


Pro 


Cys 


Lys 


25 






30 










35 










40 


Asp lie Asn 


Thr 


Phe 


He 


His 


Gly Asn 


Lys 


Arg 


Arg 


Ser 


Arg 


Pro 


Ser 






45 










50 










55 




Val Lys Thr 


Arg 


Met 


Glu 


Thr 


Leu 


Thr 


Glu 


Lys 


Thr 












60 










65 

















<210> 462 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<220> 

<221> UNSURE 
<222> 50 

<223> Xaa = Asp, Asn 
<400> 462 



Met 


Leu 
-15 


Leu 


He 


Leu 


Leu 


Ser 
-10 


Val 


Ala 


Leu 


Leu 


Ala 
-5 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 


Val 


Ser 


Gin 


Glu 


Asp 


Val 


Pro 


Leu 


Val 


He 


1 








5 










10 










15 




Ser 


Asp 


Gly 


Gly 
20 


Asp 


Ser 


Glu 


Gin 


Phe 
25 


He 


Asp 


Glu 


Glu 


Arg 
30 


Gin 


Gly 


Pro 


Pro 


Leu 
35 


Gly 


Gly 


Gin 


Gin 


Ser 
40 


Gin 


Pro 


Ser 


Ala 


Gly 
45 


Asp 


Gly 


Asn 


Gin 


Xaa 
50 


Asp 


Gly 


Pro 


Gin 


Gin 
55 


Gly 


Pro 


Pro 


Gin 


Gin 
60 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 


Pro 


Gin 


Gly 


Lys 


Pro 


Gin 


Gly 


Pro 


Pro 


Pro 


65 










70 










75 










80 


Gin 


Gly 


Gly 


Arg 


Pro 
85 


Gin 


Gly 


Pro 


Pro 


Gin 
90 


Gly 


Gin 


Ser 


Pro 


Gin 
95 





<210> 463 
<211> 106 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 463 
Met Leu Leu lie 
-15 

Gin Asp Leu Asp 
1 

Ser Asp Gly Gly 
20 

Pro Pro Leu Gly 
35 

Gin Asp Asp Gly 
50 

Gin Gin Gly Pro 
65 

Gin Gly Gly Gin 



Leu Leu Ser Val 
-10 

Glu Asp Val Ser 
5 

Asp Ser Glu Gin 

Gly Gin Gin Ser 
40 

Pro Gin Gin Gly 

55 

Pro Pro Pro Gin 
70 

Ser Cys Cys Cys 
85 



Ala Leu Leu Ala 
-5 

Gin Glu Asp Val 
10 

Phe lie Asp Glu 
25 

Gin Pro Ser Ala 

Pro Pro Gin Gin 
60 

Gly Lys Pro Gin 
75 

Asp Lys 
90 



Phe Ser Ser Ala 

Pro Leu Val lie 
15 

Glu Arg Gin Gly 
30 

Gly Asp Gly Asn 
45 

Gly Gly Gin Gin 

Gly Pro Pro Gin 
80 



<210> 464 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24 . .-1 



<400> 464 

Met Pro Ser Ser Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys 

-20 *" -15 -10 

Cys Leu Val Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala 

-5 15 
Gin Lys Thr Asp Thr Ser His His Asp Gin Asp His Pro Thr Phe Asn 

10 15 20 

Lys lie Thr Pro Asn Leu Ala Glu Phe Ala Phe Ser Leu Tyr Arg Gin 
25 30 35 40 

Leu Ala His Gin Ser Asn Ser Thr Asn lie Phe Phe Ser Pro Val Ser 

45 50 55 

lie Ala Thr Ala Phe Ala Met Leu Ser Leu Gly Thr Lys Ala Asp Thr 

60 65 70 

His Asp Glu lie Leu Glu Gly Leu Asn Phe Asn Leu Thr Glu lie Pro 

75 80 85 

Glu Ala Gin lie His Glu Gly Phe Gin Glu Leu Leu Arg Thr Leu Asn 

90 95 100 

Gin Pro Asp Ser Gin Leu Gin Leu Thr Thr Gly Lys Asn 
105 110 115 



<210> 465 
<211> 124 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -30. .-1 



<400> 465 

Met Pro Ala Cys Arg Leu Gly Pro Leu Ala Ala Ala Leu Leu Leu Ser 
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-30 -25 
Leu Leu Leu Phe Gly Phe 
-10 

Thr Gly Val Cys Pro Glu 
5 

Cys Val Ser Asp Ser Glu 
20 

Gly Cys Ala Thr Phe Cys 
35 40 
Pro Gin Val Asn lie Asn 
55 

Cys Gin Val Asp Ser Gin 
70 

Gly Cys Gly Lys Val Ser 
85 



263 

-20 

Thr Leu Val Ser Gly Thr 
-5 

Leu Gin Ala Asp Gin Asn 
10 

Cys Ala Asp Asn Leu Lys 
25 " 30 

Ser Leu Pro Asn Asp Lys 
45 

Phe Pro Gin Leu Gly Leu 
60 

Cys Pro Gly Gin Met Lys 
75 

Cys Val Thr Pro Asn Phe 
90 



PCT/IB01/00914 

-15 



Gly 


Ala 


Glu 


Lys 






1 




Cys 


Thr 


Gin 


Glu 


15 








Cys 


Cys 


Ser 


Ala 


Glu 


Gly 


Ser 


Cys 








50 


Cys 


Arg 


Asp 


Gin 






65 




Cys 


Cys 


Arg 


Asn 




80 







<210> 466 
<211> 124 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. .-1 



<400> 466 

Met Pro Ala Cys 

-30 

Leu Leu Leu Phe 

Thr Gly Val Cys 
5 

Cys Val Ser Asp 
20 

Gly Cys Ala Thr 
35 

Pro Gin Val Asn 

Cys Gin Val Asp 
7 0 

Gly Cys Gly Lys 
85 



Arg Leu Gly Pro 
-25 

Gly Phe Thr Leu 
-10 

Pro Glu Leu Gin 
10 

Ser Glu Cys Ala 
25 

Phe Cys Ser Leu 
40 

lie Asn Phe Pro 
55 

Ser Gin Cys Pro 

Val Ser Cys Val 
90 



Leu Ala Ala Ala 
-20 

Val Ser Gly Thr 
-5 

Ala Asp Gin Asn 

Asp Asn lie Lys 
30 

Pro Asn Asp Lys 
4 5 

Gin Leu Gly Leu 
60 

Gly Gin Met Lys 
75 

Thr Pro Asn Phe 



Leu 


Leu 


Leu 


Ser 








-15 


Gly 


Ala 


Glu 


Lys 






1 




Cys 


Thr 


Gin 


Glu 


15 








Cys 


Cys 


Ser 


Ala 


Glu 


Gly 


Ser 


Cys 








50 


Cys 


Arg 


Asp 


Gin 






65 




Cys 


Cys 


Arg 


Asn 




80 







<210> 467 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 



<400> 467 
Met Phe Pro Gly 
-15 

Ala Ala His Leu 
1 

Lys Val Cys Ser 

Glu Gly Ala Asn 
35 

His His Lys Gly 
50 



Phe Val Phe Leu 
-10 

Trp Val Leu Ala 
5 

Phe Thr Pro Glu 
20 

Asn Ser Arg Arg 
Leu 



Leu Phe Val lie 

Ala Phe Met Gly 
10 

Ala Ser Lys Thr 
25 

Thr Ala Phe Lys 
40 



Ser Leu Ala Ala 
-5 

Arg lie Thr Val 
15 

Val Ser Pro Pro 
30 

Ser Cys Asn Thr 
45 
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<210> 468 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 



<400> 468 
Met Gly Pro Val 
-30 

He Ala Thr He 
-15 

Ala Phe Trp Trp 
1 

Gin Ser Leu Ala 

Arg Asp Ala Val 
35 



Lys Gin Leu Lys 

Met Val Leu Leu 
-10 

His Asn Met Gly 
5 

Leu Thr Trp Tyr 
20 

Lys Lys Cys Phe 



Arg Met Phe Glu 
-25 

Ser Phe Ala Leu 

Leu Ala Leu He 
10 

Ser Leu Ser Phe 
25 

Ala Val Cys Leu 
40 



Pro Thr Arg Leu 
-20 

Thr Leu Cys Ser 
-5 

Phe Cys He Leu 
15 

He Pro Phe Ala 
30 

Ala 



<210> 469 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -26. .-1 



<400> 469 
Met Thr Met Arg 
-25 

Trp Ala Phe Cys 
-10 

Phe Ser Gin Pro 
10 

Gin Glu His He 
25 

Ala Glu Met Ser 
40 

Asp Tyr Asp Gly 
55 

He Thr His Val 

Ser Glu Asp Glu 
90 

Asp Lys Asn Asn 
105 

Leu Gin 
120 



Ser Leu Leu Arg 
-20 

Ala Pro Gly Ala 
-5 

Gly Ser Met Gly 

Met Glu His Leu 
30 

Pro Gin Glu Leu 
45 

Asn Asn Leu Leu 
60 

His Lys Glu Glu 
75 

Leu He Asn He 

Asp Gly Tyr He 
110 



Thr Pro Phe Leu 
-15 

Arg Ala Glu Glu 
1 

Leu Asp Lys Asn 
15 

Glu Gly Val He 

Gin Leu His Tyr 
50 

Asp Gly Leu Glu 
65 

Gly Ser Glu Gin 
80 

He Asp Gly Val 
95 

Asp Tyr Ala Glu 



Cys Gly Leu Leu 

Pro Ala Ala Ser 
5 

Thr Val His Asp 
20 

Asn Lys Pro Glu 
35 

Phe Lys Met His 

Leu Ser Thr Ala 
70 

Ala Pro Leu Met 
85 

Leu Arg Asp Asp 
100 

Phe Ala Lys Ser 
115 



<210> 470 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -46. .-1 



<400> 470 
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Met Asp Gin Leu Val Phe Lys Glu Thr lie Trp Asn Asp Ala Phe Trp 

-45 -40 -35 

Gin Asn Pro Trp Asp Gin Gly Gly Leu Ala Val lie lie Leu Phe He 

-30 " -25 -20 -15 

Thr Ala Val Leu Leu Leu He Leu Phe Ala He Val Phe Gly Leu Leu 

-10 -5 1 

Thr Ser Thr Glu Asn Thr Gin Cys Glu Ala Gly Glu Glu Glu 

5 10 15 



<210> 471 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47 . .-1 



<400> 471 
Met Arg He Ala 
-45 

Gly Ala Gly Trp 
-30 

Thr Ser Leu Ala 
-15 

Met Gin Met Tyr 
5 

Gin Gly Gly Leu 
20 

Phe Asn Asn Leu 
35 

lie Phe Pro Glu 
50 

Gly Leu He His 

Val Gly Leu Tyr 
85 

Ala Ala Pro Val 
100 

Arg Asn 
115 



Asn Arg Thr Arg 
-40 

Thr His Gly Arg 
-25 

Leu Ser Ser Leu 
-10 

Ser Arg Gin Leu 

Leu Gly Ser Gly 
25 

Glu Asn Leu Val 
40 

He Leu Leu Cys 
55 

Arg Val Cys Val 
70 

Tyr He Asn Lys 

Leu Thr Pro Ala 
105 



Phe Ser Ser Pro 

Gly Met Met Val 
-20 

Leu Ser Leu Leu 
-5 

Ala Ser Thr Glu 
10 

Leu Phe Val Phe 

Phe Gly Lys Gly 
45 

Leu Leu Leu Ala 
60 

Thr Thr Cys Phe 
75 

He Ser Ser Thr 
90 

Lys Val Thr Gly 



Phe Leu Ala Arg 
-35 

Val Gly Thr Gly 

Leu Phe Ala Gly 
1 

Trp Leu Thr He 

15 

Ser Leu Thr Ala 
30 

Phe Gin Ala Lys 

Leu Phe Ala Ser 
65 

He Phe Ser Met 
80 

Leu Tyr Gin Ala 

95 

Lys Ser Lys Lys 
110 



<210> 472 
<211> 90 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 472 

Met Glu Thr Gly 

-35 

Met Arg Arg He 

Ser Trp Ala Ala 
1 

He Val Ser Asp 
15 

Phe Gin Leu Leu 
30 

Ser Met Cys Phe 



Ala Ser Ala Ser 
-30 

Trp Ser Leu Gly 
-15 

Leu Phe His Asp 
5 

Leu Ser Thr Val 
20 

Asp Val Trp Thr 
35 

Val Phe Leu Leu 



He Pro Glu Leu 
-25 

Leu Gly Leu Val 
-10 

Gly Phe Ala Val 

Arg Phe Val Ala 
25 

Arg Asn Phe Arg 
40 

Leu Pro 



He Cys Glu Ala 
-20 

Thr Leu Thr Ala 
-5 

Leu Gly Gly Asn 
10 

Gin Gin Gin His 

Lys Pro Leu Gly 
45 
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55 
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<210> 473 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Ala, Val 



<400> 473 



Met Arg Leu 


Leu 


Gin 


Leu 


Leu 


Phe 


Arg 


Ala 


Ser 


Pro 


Ala 


Thr 


Leu 


Leu 


-25 






-20 










-15 










-10 


Leu Val Leu 


Cys 


Leu 


Gin 


Leu 


Gly Ala Asn 


Lys 


Ala 


Gin 


Asp 


Asn 


Thr 






-5 










1 








5 






Arg Lys lie 


He 


He 


Lvs 

Jr 


Asn 


Phe 


Asp 


He 


Pro 


Lys 


Ser 


Val 


Arg 


Pro 


10 










15 










20 








Asn Asp Glu 


Xaa 


Leu 


Gin 


Cys 


Leu 


Gin 


Phe 


Lys 


Gin 


Asn 








25 








30 










35 










<210> 474 




























<211> 120 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -32. .- 


-1 


























<400> 474 




























Met Ala Ser 


Pro 


Lys 


Gly 


Phe 


Phe 


Asn 


Tyr 


Leu 


Thr 


Tyr 


Phe 


Leu 


Ala 


-30 










-25 










-20 








Ala Gly Ala 


Val 


Thr 


Leu 


Gly 


He 


Gly 


Phe 


Phe 


Ala 


Leu 


Ala 


Ser 


Ala 


-15 








-10 










-5 










Leu Trp Phe 


Leu 


He 


Cys 


Lys 


Arg 


Arg 


Glu 


He 


Phe 


Gin 


Asn 


Ser 


Lys 


1 




5 










10 










15 




Phe Lys Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 


Pro 


Ser 


Met 


Ala 


Lys 




20 










25 










30 






lie Lys Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe 


lie 


Ser 


Arg 


Asn 


Phe 


His 


Thr 


35 










40 










45 








Gly Arg Phe 


Gin 


Leu 


Gin 


Gin 


Leu 


Lys 


He 


He 


Leu 


Lys 


Met 


Asn 


Pro 


50 








55 










60 










Asn Leu Gin 


Gin 


Lys 


He 


Ser 


Phe 


Val 


He 


Pro 


Gin 


Arg 


Pro 


Ala 


Pro 


65 






70 










75 










80 


Gin Gin He 


Ala 


Ala 


Val 


Leu 


His 






















85 

























<210> 475 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. . -1 



<400> 475 
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Met Lys Phe 


Phe 


Val 


Phe 


Ala 


Leu 


Val 


Leu 


Ala 


Leu 


Met 


lie 


Ser 


Met 






-15 










-10 










-5 




lie Ser Ala 


Asp 


Ser 


His 


Glu 


Lys 


Arg 


His 


His 


Gly 


Tyr 


Arg 


Arg 


Lys 




1 








5 










10 








Phe His Glu 


Lys 


His 


His 


Ser 


Tyr 


His 


He 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 


15 








20 










25 










Glu Glu Ser 


Ser 


Lys 


Ser 


Asn 


Ala 


Asn 


Glu 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 


30 






35 










40 










45 


Tyr Thr Leu 


Cys 


Phe 


Arg 


He 


Leu 


Ala 


Phe 


Ser 


He 


Val 


Thr 










50 










55 














<210> 476 




























<211> 87 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -20. 


-1 


























<400> 476 




























Met Gin Lys 


Val 


Thr 


Leu 


Gly 


Leu 


Leu 


Val 


Phe 


Leu 


Ala 


Gly 


Phe 


Pro 


-20 






-15 










-10 










-5 


Val Leu Asp 


Ala 


Asn 


Asp 


Leu 


Glu 


Asp 


Lys 


Asn 


Ser 


Pro 


Phe 


Tyr 


Tyr 






1 








5 










10 






Asp Trp His 


Ser 


Leu 


Gin 


Val 


Gly 


Gly 


Leu 


lie 


Cys 


Ala 


Gly 


Val 


Leu 


15 










20 










25 








Cys Ala Met 


Gly 


He 


He 


He 


Val 


Met 


Ser 


Ala 


Lys 


Cys 


Lys 


Cys 


Lys 


30 








35 










40 










Phe Gly Gin 


Lys 


Ser 


Gly 


His 


His 


Pro 


Gly 


Glu 


Thr 


Pro 


Pro 


Leu 


lie 


45 






50 










55 










60 


Thr Pro Gly 


Ser 


Ala 


Gin 


Ser 
























65 
























<210> 477 




























<211> 105 




























<212> PRT 




























<213> Homo : 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -19. .• 


-1 


























<220> 




























<221> UNSURE 


























<222> 37 




























<223> Xaa = 


* 


t Lys 
























<400> 477 




























Met Arg Gly 


Ala 


Thr 


Arg 


Val 


Ser 


lie 


Met 


Leu 


Leu 


Leu 


Val 


Thr 


Val 






-15 










-10 










-5 




Ser Asp Cys 


Ala 


Val 


He 


Thr 


Gly Ala 


Cys 


Glu 


Arg 


Asp 


Val 


Gin 


Cys 




1 








5 










10 








Gly Ala Gly 


Thr 


Cys 


Cys 


Ala 


lie 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 


Leu 


Arg 


15 








20 










25 










Met Cys Thr 


Pro 


Leu 


Gly 


Arg 


Xaa 


Gly 


Glu 


Glu 


Cys 


His 


Pro 


Gly 


Ser 


30 






35 










40 










45 


His Lys lie 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 


Pro 


Cys 






50 










55 










60 




Leu Pro Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 


Arg 


Cys 




65 










70 










75 






Ser Met Asp 


Leu 


Lys 


Asn 


lie 


Asn 


Phe 

















80 85 
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<210> 478 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 

<220> 

<221> UNSURE 
<222> 31 

<223> Xaa = His, Arg 



<400> 478 



Met 


Ser 
-35 


Asn 


Thr 


Thr 


Val 


Pro 
-30 


Asn 


Ala 


Pro 


Gin 


Ala 
-25 


Asn 


Ser 


Asp 


Ser 


Met 


Val 


Gly 


Tyr 


Val 


Leu 


Gly 


Pro 


Phe 


Phe 


Leu 


He 


Thr 


Leu 


Val 


Gly 


-20 










-15 










-10 










-5 


Val 


Val 


Val 


Ala 


Val 
1 


Val 


Met 


Tyr 


Val 
5 


Gin 


Lys 


Lys 


Lys 


Arg 
10 


Val 


Asp 


Arg 


Leu 


Arg 
15 


His 


His 


Leu 


Leu 


Pro 
20 


Met 


Tyr 


Ser 


Tyr 


Asp 
25 


Pro 


Ala 


Glu 


Glu 


Leu 
30 


Xaa 


Glu 


Ala 


Glu 


Gin 
35 


Glu 


Leu 


Leu 


Ser 


Asp 
40 


Met 


Gly 


Asp 


Pro 


Lys 


Val 


Val 


His 


Gly 


Trp 


Gin 


Ser 


Gly 


Tyr 


Gin 


His 


Lys 


Arg 


Met 


Pro 


45 










50 










55 










60 


Leu 


Leu 


Asp 


Val 


Lys 


Thr 























65 



<210> 479 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 479 



Met 


Lys 


Ala 


Leu 
-15 


He 


Val 


Leu 


Gly 


Leu 
-10 


Val 


Leu 


Leu 


Ser 


Val 
-5 


Thr 


Val 


Gin 


Gly 


Lys 
1 


Val 


Phe 


Glu 


Arg 
5 


Cys 


Glu 


Leu 


Ala 


Arg 
10 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


Met 


Asp 


Gly 


Tyr 


Arg 


Gly 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


15 










20 










25 










30 


Leu 


Ala 


Lys 


Trp 


Glu 

35 


Ser 


Gly 


Tyr 


Asn 


Thr 
40 


Arg 


Ala 


Thr 


Asn 


Tyr 
45 


Asn 


Ala 


Gly 


Asp 


Arg 

50 


Ser 


Thr 


Asp 


Tyr 


Gly 
55 


He 


Phe 


Gin 


He 


Asn 
60 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 


Pro 


Gly Ala 


Val 


Asn 


Ala 


Cys 


His 






65 










70 










75 








Leu 


Ser 
80 


Cys 


Ser 


Gly 


Trp 


His 
85 


Gly 


Glu 


He 


Val 


Val 
90 


Lys 


Thr 


Glu 


Met 


Ser 


Val 


Ser 


Met 


Phe 


Lys 


Val 


Val 


Glu 


Cys 


Asn 


Ser 


Arg 


He 


Phe 


Leu 


95 










100 










105 










110 


Leu 


Gin 


Leu 


He 


Leu 


Ser 


Leu 


Ser 


His 

















115 



<210> 480 
<211> 101 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -29. .-1 

<400> 480 



Met 


Arg 


Glu 


Glu 


Lys 
-25 


Lys 


Pro 


Phe 


Glu 


Arg 
-20 


Glu 


Arg 


Glu Ser 


Val 
-15 


Cys 


Val 


Cys 


Met 


Cys 
-10 


Val 


Phe 


Ser 


Thr 


Gin 
-5 


Gly 


Ala 


Leu 


Gly Glu 
1 


Met 


Ala 


Ala 


His 
5 


Phe 


He 


Asp 


Glu 


Lys 
10 


Leu 


Arg 


Pro 


Ser 


Glu 
15 


Gly Asn 


Gly 


His 


Arg 


Gly 


Thr 


Leu 


Asp 


Ser 


Leu 


Ser 


Ser 


Asp 


Gin 


Glu 


Ser Tyr 


He 


Pro 


20 










25 










30 








35 


Ser 


Thr 


Ala 


Asp 


Pro 
40 


Thr 


Gin 


Ala 


Gly 


Pro 
45 


Glu 


Leu 


Leu His 


Lys 
50 


Asn 


Leu 


Pro 


Val 


Thr 
55 


Ser 


Arg 


Ser 


Gin 


Pro 
60 


Leu 


Pro 


Ser 


Asp Leu 
65 


Ala 


He 


Pro 


Ala 


Ala 
70 


Ala 


Leu 























<210> 481 




























<211> 128 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -26. . ■ 


-1 


























<400> 481 




























Met Cys Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 


-25 








-20 










-15 










Ser Leu Phe 


Leu 


He 


Gin 


Leu 


Leu 


He 


Ser 


Phe 


Ser 


Glu 


Asn 


Gly 


Phe 


-10 






-5 










1 








5 




He His Ser 


Pro 


Arg 


Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp 


Gly 


Asn 


Glu 


Glu 




10 










15 










20 






Glu Cys Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


25 










30 










35 








Cys Val Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Lys 


Lys 


Tyr 


Cys 


Phe 


Pro 


40 








45 










50 










Tyr Phe Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


55 






60 










65 










70 


Asp Met Phe 


Gly 


He 


Met 


Met 


Leu 


Leu 


Leu 


He 


Ala 


Val 


Leu 


lie 


Thr 






75 










80 










85 




Gly Phe Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


lie 



90 95 100 



<210> 482 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24 . . -1 

<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 
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<220> 

<221> UNSURE 
<222> 79 

<223> Xaa = Lys,Asn 
<400> 482 



Met 


Arg 


Leu 


Ser 


Trp 


Phe 


Arg 


Val 


Leu 


Thr 


Val 


Leu 


Ser 


lie 


Cys 


Leu 










-20 










-15 










-10 




Ser 


Ala 


Val 


Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Lys 


Arg 


Lys 


Leu 


Gin 








-5 










1 








5 








He 


Gly 


Val 


Lys 


Lys 


Arg 


Val 


Asp 


His 


Cys 


Pro 


He 


Lys 


Ser 


Arg 


Lys 




10 










15 










20 










Gly 


Asp 


Val 


Leu 


His 


Met 


His 


Tyr 


Thr 


Gly 


Lys 


Leu 


Glu 


Asp 


Gly 


Thr 


25 










30 










35 










40 


Glu 


Phe 


Asp 


Ser 


Ser 


Leu 


Pro 


Gin 


Asn 


Gin 


Pro 


Phe 


Val 


Phe 


Ser 


Leu 








45 










50 










55 




Gly 


Thr 


Gly 


Gin 


Val 


He 


Lys 


Gly 


Trp 


Asp 


Gin 


Gly 


Leu 


Leu 


Gly 


Met 








60 










65 










70 






Cys 


Glu 


Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 


Glu 


Leu 


Gly 


Tyr 






75 










80 










85 








Gly 


Glu 


Arg 


Gly 


Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 


Ala 


Thr 


Leu 


Val 




90 










95 










100 










Phe 


Glu 


Val 


Glu 


Leu 


Leu 


Lys 


He 


Glu 


Arg 


Arg 


Thr 


Glu 


Leu 






105 










110 










115 













<210> 483 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 483 



Met 


Ala 


Ser 


Ser 


Thr 
-20 


Ser 


Val 


Ser 


Thr 


Gly 
-15 


Gin 


lie 


Leu 


Leu 


Thr 
-10 


Pro 


Ala 


Cys 


Leu 
-5 


Gly 


Ser 


Trp 


Ala 


Glu 
1 


He 


Arg 


Ser 


Pro 
5 


Val 


Arg 


He 


Ser 
10 


He 


Ala 


Ser 


Asp 


Phe 
15 


Pro 


Thr 


Ala 


Arg 


Val 
20 


Ser 


Leu 


Trp 


Pro 


Pro 


Ala 


Pro 


Gly 


Met 


Val 


Pro 


He 


Lys 


He 


Ser 


Gly 


Cys 


Ala 


25 










30 










35 










Trp 


Ala 


Phe 


Ser 


Pro 
45 


Ala 





















<210> 484 
<211> 55 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -23. .-1 



<400> 484 
Met Ala Gly Pro 
-20 

Ala Ala Ala Leu 
-5 

Gin Met Pro Thr 
10 

Ser Tyr Thr Ala 



Ala 


Ala 


Ala 


Phe 


Gly 


Phe 


Ala 


Ser 








1 


Gly 


Arg 


Ser 


Cys 




15 






Trp 


Pro 


Cys 





Arg 


Arg 


Leu 


Gly 


-15 








Tyr 


Gly 


Ala 


His 

5 


Leu 


Thr 


Arg 


Pro 



Ala Leu Ser Gly 
-10 

Gly Ala Asn Ser 

Thr Asn Thr Thr 
25 
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30 

<210> 485 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23. .-1 

<400> 485 

Met Ala Gly Pro Ala Ala Ala 
-20 

Ala Ala Ala Leu Gly Phe Ala 
-5 

Gin Met Pro Thr Gly Arg Ser 
10 15 
Ser Tyr Thr Ala Trp Pro Cys 
30 

<210> 486 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 486 
Met Phe Arg Arg 
-15 

Gly lie Ala Gly 
1 

Ala Lys Asp Gin 

Ser Glu Glu Lys 
35 

<210> 487 
<211> 83 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 

<400> 487 



Met 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Pro 


Val 


Leu 


Val 


Val 


Val 


Leu 


Ser 


He 


-15 










-10 










-5 










1 


Val 


Leu 


Glu 


Gly 


Pro 


Ala 


Pro 


Ala 


Gin 


Gly 


Thr 


Pro 


Asp 


Val 


Ser 


Ser 








5 










10 










15 






Ala 


Leu 


Asp 


Lys 


Leu 


Lys 


Glu 


Phe 


Gly 


Asn 


Thr 


Leu 


Glu 


Asp 


Lys 


Ala 






20 










25 










30 








Arg 


Glu 


Leu 


He 


Ser 


Arg 


He 


Lys 


Gin 


Ser 


Glu 


Leu 


Ser 


Ala 


Lys 


Met 




35 










40 










45 










Arg 


Glu 


Trp 


Phe 


Ser 


Glu 


Thr 


Phe 


Gin 


Lys 


Val 


Lys 


Asp 


Lys 


Leu 


Lys 



50 55 60 65 

He Asp Ser 
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Phe Arg Arg 

-15 
Ser Tyr Gly 
1 

Cys Leu Thr 



Leu Gly Ala 

Ala His Gly 
5 

Arg Pro Thr 
20 



Leu Ser Gly 
-10 

Ala Asn Ser 

Asn Thr Thr 
25 



Leu Thr Phe Ala Gin Leu 
-10 

Gly Val Tyr He Phe Gin 
5 

Lys Glu Leu Lys Glu Lys 
20 25 
Lys Ser 



Leu Phe Ala Thr Val Leu 
-5 

Pro Val Phe Glu Gin Tyr 
10 15 
Met Gin Leu Val Gin Glu 
30 



<210> 488 
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<211> 98 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -43. .-1 

<400> 488 



Met Phe Arg 


He 


Glu 


Gly 


Leu 


Ala 


Pro 


Lys 


Leu 


Asp 


Pro 


Glu 


Glu 


Met 




-40 










-35 










-30 






Lys Arg Lys 


Met 


Arg 


Glu 


Asp 


Val 


He 


Ser 


Ser 


He 


Arg 


Asn 


Phe 


Leu 


-2 5 










-20 










-15 








He Tyr Val 


Ala 


Leu 


Leu 


Arg 


Val 


Ser 


Glu 


Cys 


Leu 


Pro 


Gly 


Cys 


Asp 


-10 








-5 










1 








5 


Cys Asp Thr 


Ser 


Gly 


Glu 


Leu 


Thr 


Asp 


Gly 


His 


Pro 


Leu 


Thr 


Leu 


Arg 






10 










15 










20 




Gly His Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Leu 


Asn 


Gly 


Ser 


Gly 


Glu 


Gin 


Gly 




25 










30 










35 






Gly Ser He 


Tyr 


Leu 


Lys 


Glu 


He 


Gly 


Gin 


His 


Met 


Lys 


Thr 


Gly 


His 


40 










45 










50 








His He 




























55 




























<210> 489 




























<211> 82 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -43. 


-1 


























<400> 489 




























Met Ala Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 




-40 










-35 










-30 






Ser Val He 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


-25 










-20 










-15 








Ser Phe Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


-10 








-5 










1 








5 


His Gin Asn 


He 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 






10 










15 










20 




Glu Gin Leu 


Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Ala 


Asp 


Ser 


Thr 


Ala 


Val 




25 










30 










35 






Leu His 




























<210> 490 




























<211> 99 




























<212> PRT 




























<213> Homo ; 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -28 . . 


-1 


























<400> 490 




























Met Thr Ser 


Ala 


Leu 


Thr 


Gin 


Gly 


Leu 


Glu 


Arg 


He 


Pro 


Asp 


Gin 


Leu 




-25 










-20 










-15 






Gly Tyr Leu 


Val 


Leu 


Ser 


Glu 


Gly 


Ala 


Val 


Leu 


Ala 


Ser 


Ser 


Gly 


Asp 


-10 










-5 










1 








Leu Glu Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Ser 


Glu 


Leu 


Val 


Ser 


5 






10 










15 










20 


Thr Ala Cys 


Gly 


Phe 


Arg 


Leu 


His 


Arg 


Gly 


Met 


Asn 


Val 


Pro 


Phe 


Lys 
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25 

Arg Leu Ser Val Val Phe Gly Glu His 

40 45 
Gly Gin Arg Val Phe Val Val Lys Arg 

55 60 
lie Asp Val 
70 
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30 35 
Thr Leu Leu Val Thr Val Ser 
50 

Gin Asn Arg Gly Arg Glu Pro 
65 



<210> 491 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 148 

<223> Xaa = Glu, Gin 



<400> 491 

Met His Val Met 

-15 

Val Leu Leu Gin 
5 

Thr Ser Gly Ser 
20 

Gin Ala Thr Gly 

35 

Glu Arg Pro Gly 
50 

Gin Arg lie Pro 

Ala Gin Arg His 
85 

Ala Cys Ser Thr 
100 

Leu Pro Pro Pro 
115 

Leu Leu Gin Glu 
130 

Glu Leu Xaa Gly 

Tyr Val Asp Gly 
165 



Ala Ala Ser Met 
-10 

Ala Ala Arg Gly 

Gly Glu Gly Val 
25 

Ser Arg Pro Ala 
40 

Asp Val Val Asn 
55 

Val Ser Gly Arg 
70 

Gly Val Asp Leu 

Cys His Val Tyr 
105 

Glu Glu Arg Glu 
120 

Asn Ser Arg Leu 
135 

Ala Glu Phe Thr 
150 

His Val Pro Lys 



Ala 


Arg 


Gly 


Gly 






-5 




Thr 


Trp 


Trp 


Asn 


10 








Ala 


Leu 


Gly 


Thr 


Gly 


Glu 


Glu 


Asp 








45 


Val 


Val 


Phe 


Val 






60 




Val 


Gly Asp 


Asn 




75 






Glu 


Gly 


Ala 


Cys 


90 








Val 


Ser 


Glu 


Asp 


Asp 


Asp 


Met 


Leu 








125 


Gly 


Cys 


Gin 


He 






140 




Leu 


Pro 


Lys 


He 




155 






Pro 


His 







Val Ser Ala Arg 
1 

Arg Pro Gly Gly 
15 

Thr Arg Lys Phe 
30 

Ala Gly Gly Pro 

Asp Arg Ser Gly 
65 

Val Leu His Leu 
80 

Glu Ala Ser Leu 
95 

His Leu Asp Leu 
110 

Asp Met Ala Pro 

Val Leu Thr Pro 
145 

Thr Arg Asn Phe 
160 



<210> 492 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -48. .-1 



<400> 492 
Met Glu Ala Leu 
-45 

Leu Glu Asp Phe 
-30 

Val Ser Gly Leu 



Gly Lys Leu Lys 

Arg Val Lys Thr 
-25 

Leu Met Leu Leu 



Gin Phe Asp Ala 
-40 

Cys Gly Gly Ala 
Leu Phe Leu Ser 



Tyr Pro Lys Thr 
-35 

Thr Val Thr He 
-20 

Glu Leu Gin Tyr 
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-15 










-10 






Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


1 








5 










Gly 


Asp 


Lys 


Leu 


Lys 


He 


Asn 


He 


Asp 








20 










25 


Cys 


Ala 


Tyr 


Leu 


Ser 


He 


Asp 


Ala 


Met 






35 










40 




Leu 


Asp 


Val 


Glu 


His 


Asn 


Leu 


Phe 


Lys 




50 










55 






He 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu 


Arg 


65 










70 
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-5 

Leu Tyr Val Asp Lys Ser Arg 
10 15 
Val Leu Phe Pro His Met Pro 
30 

Asp Val Ala Gly Glu Gin Gin 
4 5 

Gin Arg Leu Asp Lys Asp Gly 
60 

His Asp 
75 



<210> 493 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -29. .-1 



<220> 

<221> UNSURE 
<222> 36 

<223> Xaa = Gin, Arg 
<400> 493 

Met Ala Ala Thr Asp Phe Val Gin Glu Met Arg Ala Val Gly Glu Arg 

-25 -20 -15 

Leu Leu Leu Lys Leu Gin Arg Leu Pro Gin Ala Glu Pro Val Glu He 

-10 ~ -5 1 

Val Ala Phe Ser Val He He Leu Phe Thr Ala Thr Val Leu Leu Leu 

5 10 15 

Leu Leu He Ala Cys Ser Cys Cys Cys Thr His Cys Cys Cys Pro Glu 
20 25 30 35 

Xaa Arg Gly Arg Lys Val Gin Val Gin Pro Thr Pro Pro 
40 45 



<210> 494 
<211> 107 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -27 . .-1 



<400> 494 
Met Leu Phe Glu 
-25 

Ser Thr Leu Ser 
-10 

Leu Leu Ser Pro 

Leu Gly Leu Ala 
25 

Pro Asp Pro Pro 
40 

Leu Arg He Leu 
55 

Gin Pro Val Trp 
70 



Glu Ala Leu Pro 
-20 

Leu Val Gin Phe 
-5 

Gly Val Glu Asn 
10 

Cys Cys Leu Leu 

Glu Gly Pro Gly 
45 

Gin Leu Pro Gly 
60 

Ser Arg Thr Pro 
75 



Leu Ser Cys Ser 

Ser Pro Ser Gly 
1 

Leu Ser Val Leu 
15 

Thr Leu Thr Met 
30 

Thr Gin Arg Val 

Ala Pro Asp Gly 
65 

Asp Leu Lys 
80 



Asp Pro Val Leu 
-15 

Arg Thr Gin Asp 
5 

Asp Val Ser Pro 
20 

Ser Cys Pro Gly 
35 

Trp Gin Gly Ala 
50 

Val Ser Pro Tyr 



WO 02/083898 



PCT/IB01/00914 



275 

<210> 495 
<211> 112 
<212> PRT 

<213> Homo sapiens 



<220> 

<22l> SIGNAL 
<222> -39. .-1 



<220> 

<221> UNSURE 
<222> -37 

<223> Xaa = * , Glu, Lys , Gin 
<400> 495 

Met Leu Xaa His Leu Ser Ser Leu Pro Thr Gin Met Asp Tyr Lys Gly 

-35 -30 -25 

Gin Lys Leu Ala Glu Gin Met Phe Gin Gly lie lie Leu Phe Ser Ala 

-20 -15 -10 

He Val Gly Phe He Tyr Gly Tyr Val Ala Glu Gin Phe Gly Trp Thr 

-5 15 
Val Tyr He Val Met Ala Gly Phe Ala Phe Ser Cys Leu Leu Thr Leu 
10 15 20 25 

Pro Pro Trp Pro He Tyr Arg Arg His Pro Leu Lys Trp Leu Pro Val 

30 35 40 

Gin Ala Gin Thr Thr Arg Asn Gin Gly Lys Glu Lys Leu Arg Gly Met 

45 50 55 

Leu Lys He He Glu Val Phe Met He Gin His Leu Leu Leu Phe Leu 
60 65 70 



<210> 496 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -21. .-1 



<400> 496 
Met Ala Leu Cys 
-20 

Pro Pro Thr Val 
-5 

Met Met Asn Asn 
15 

Pro Cys Arg Pro 
30 

Ser Trp Lys Ser 
45 

Lys Ser Gly Gly 
60 

Ser Arg Pro Ser 



Ala Leu Thr Arg 
-15 

Ala Ala Pro Ala 
1 

Gly Leu Leu Gin 

Val Leu Thr Ser 
35 

Arg Thr Lys Tyr 
50 

Arg Asp His Thr 
65 

He Gin Arg 
80 



Ala Leu Arg Ser 
-10 

Pro Ser Leu Phe 
5 

Gin Pro Ser Ala 
20 

Val Ala Leu Asn 

Thr He Thr Pro 
55 

Gly Ala Gly Asn 
70 



Leu Asn Leu Ala 

Pro Ala Ala Gin 
10 

Leu Met Leu Leu 
25 

Ala Asn Phe Val 
40 

Val Lys Met Arg 

Val Arg Ser Asn 
75 



<210> 497 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15 . . -1 



WO 02/083898 



PCT/IB01/00914 



276 



<400> 497 

Met Ala Val Leu 

-15 

Ala Leu Leu Gly 
5 

Pro Asp Ala Trp 
20 

Arg Gly Asn Glu 
35 

Gly Trp Val Arg 
50 

Arg Arg Met Leu 



Ala Gly Ser Leu 
-10 

Gly Arg Trp Leu 

Gly Leu Pro Thr 
25 

Tyr Gin Pro Ser 
40 

Arg Leu Ser Thr 
55 

Lys Gly Arg Lys 
70 



Leu Gly Pro Thr 
-5 

Gin Pro Arg Ala 
10 

Pro Gin Gin Ala 

Asn lie Lys Arg 
4 5 

Pro Ala Gly Val 
60 

Ser Leu Ser His 
75 



Ser Arg Ser Ala 
1 

Trp Leu Gly Phe 
15 

Arg Gly Lys Ala 
30 

Lys Asn Lys His 

Gin Val lie Leu 
65 



<210> 498 
<211> 87 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 



<400> 498 

Met Lys Val Leu 

Tyr Trp Asp Asp 
-20 

Leu Leu Thr Gly 
-5 

Tyr Ala Arg Leu 
10 

Leu Leu Gin Lys 

Gly Arg Gly Lys 
45 



Leu Leu Thr Gly 
-35 

Asn Phe Asp Pro 

Ala Asn Ala Gly 

1 

Gly Ser His Leu 
15 

Ala Arg Trp Leu 
30 

Trp lie Thr 



Leu Gly Ala Leu 
-30 

Ala Ser Leu Gin 
-15 

Val Gly Glu Glu 
5 

Val Leu Thr Ala 
20 

Thr Leu Val Val 
35 



Phe Phe Ala Tyr 
-25 

Gly Ala Arg Val 
-10 

Leu Ala Tyr His 

His Thr Glu Ala 
25 

Ser Thr Leu Gly 
40 



<210> 499 
<211> 125 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Asn, Thr 



<400> 499 

Met Lys Leu Leu 

-15 

Gly Ser Arg Gly 
5 

Cys Pro Val Glu 
20 

Val Glu Trp Ser 
35 

Gin Val Pro Lys 

50 



Thr His Asn Leu 
-10 

Phe Pro Leu Arg 

Phe Asn Pro Asn 
25 

Ala Phe Leu Glu 
40 

Gly Pro Val Glu 
55 



Leu Ser Ser His 
-5 

Leu Gin Ala Thr 
10 

Phe Val Ala Arg 

Ala Ala Asp Asn 
45 

Gly Tyr Glu Glu 
60 



Val Arg Gly Val 
1 

Glu Val Arg lie 
15 

Met lie Pro Lys 
30 

Leu Arg Leu lie 

Asn Glu Glu Phe 
65 
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Leu Arg Thr Met His His 
70 

Thr Leu Gin Cys Pro Glu 
85 

lie Pro Xaa Met Leu Leu 
100 



277 

Leu Leu Leu Glu Val Glu 
75 

Ser Gly Arg Met Phe Pro 
90 

Ser Glu Glu Glu Thr Glu 
105 
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Val lie Glu Gly 
80 

lie Ser Arg Gly 
95 

Ser 
110 



<210> 500 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -22. .-1 



<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = Ile,Asn 



<400> 500 

Met Lys lie Ser Val Ala Ala lie Pro Phe Phe Leu Leu lie Thr lie 

-20 -15 -10 

Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 

-5 1 5 10 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys lie Pro Arg Gin Xaa 
15 " 20 25 

Tyr Gly Leu Leu 
30 



<210> 501 
<211> 72 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -42. .-1 



<400> 501 

Met Asn Arg Leu Ala Gly Val Gly 
-40 -35 
Ser Ser Leu Leu Gin Ser Val Glu 

-25 -20 
Glu Val Ala Ala Ala Pro Gly Thr 
-10 -5 
Leu Ser Ala Asp Lys Phe Gin Val 
10 

Gin Thr Leu Met Ser Ser Leu Gly 
25 30 



Trp Arg Val Asp Tyr Thr Leu Ser 
-30 

Glu Pro Met Val His Leu Arg Leu 
-15 

Pro Ala Gin Pro Val Ala Met Ser 

1 5 
Leu Leu Ala Glu Leu Lys Gin Ala 
15 20 



<210> 502 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -20. .-1 



<400> 502 

Met Ala Ala Ala Ala Val Pro Ser Leu Leu Leu Ser Leu Pro Pro His 



WO 02/083898 



PCT/IB01/00914 



-20 

Gin Gly Leu Thr 

Val Leu His Pro 
15 

Ser Arg Gin Leu 
30 

Leu Cys Gin Thr 
45 

Thr Val Arg Thr 



-15 

Phe Ser Asn Lys 
1 

Glu Pro Gly Leu 
20 

Arg Val Ala Leu 

35 

Gin Leu Pro Ala 
50 



278 

-10 

lie Gin Pro Phe 
5 

Arg Asp Trp Leu 

Pro Glu Lys Gly 
40 

Thr Pro Cys Phe 
55 



-5 

Gly Ala Gin Gly 
10 

Leu Pro Thr Cys 
25 

Ser Glu Gly Ser 

Leu Pro Ser Asn 
60 



<210> 503 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 503 

Met Lys Phe Asn 

Thr Phe Gin Ser 
1 

Arg Asn Gly Leu 
15 

Ser Thr Gly Ser 
30 



Phe Val Leu Leu 
-15 

Val Lys Lys Trp 
5 

Met Trp Pro Gly 
20 

Arg Leu 
35 



lie Leu Ser Lys 
-10 

Leu Asn Tyr Phe 

Ala Val Ala His 
25 



Val Leu Asp Asp 
-5 

Gin Phe Thr Leu 
10 

Ala Cys Asn Pro 



<210> 504 
<211> 91 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<220> 

<221> UNSURE 
<222> 29 

<223> Xaa = Met, Thr 



<400> 504 

Met Tyr Gly Lys 

lie Ser Ala Leu 

1 

Ser Ser Val Thr 
15 

Gly Gin Leu Val 
30 

Leu lie lie Leu 

lie Ser Tyr Ser 
65 



He He Phe Val 
-15 

Ser Thr Thr Glu 
5 

Lys Ser Tyr He 
20 

His Arg Phe Thr 
35 

Cys Val Met Ala 
50 

He Arg Arg Leu 



Leu Leu Leu Ser 
-10 

Val Ala Met His 

Ser Ser Gin Thr 
25 

Val Pro Ala Pro 
40 

Gly He He Gly 
55 

He Lys Ala 
70 



Glu He Val Ser 
-5 

Thr Ser Thr Ser 
10 

Asn Gly Glu Xaa 

Val Val He He 
45 

Thr He Leu Leu 
60 



<210> 505 
<211> 114 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 505 



Met 


Gin 


Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 






-30 










-25 










-20 








Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe 


Ser 


Leu 


Thr 


Ala 




-15 










-10 










-5 










Phe 


Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


1 








5 










10 










15 




lie 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 








20 










25 










30 






Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe 


He 


Phe 


Ser 


Met 






35 










40 










45 








Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He 


Ser 


Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 




50 










55 










60 










Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 


Ser 


Lys 


Lys 



65 70 75 80 

Arg Asn 



<210> 506 

<211> 53 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -13. .-1 



<400> 506 
Met Ala Val Ala 
-10 

Phe Arg Val Ala 
5 

Asn Tyr Asp Val 
20 

Phe Gin Ser Val 



Phe Val Leu Ser 

Asp Gin Arg Lys 
10 

Met Lys Asp Phe 
25 

Lys 



Leu Gly Val Ala 
-5 

Lys Ala Tyr Ala 
15 

Glu Glu Met Arg 
30 



Ala Leu Tyr Lys 
1 

Asp Phe Tyr Arg 

Lys Ala Gly He 
35 



<210> 507 
<211> 108 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -40. .-1 



<400> 507 

Met Gin Lys Phe 

-40 

Val Lys Ala Ser 

Met Val Ala Pro 
-5 

Gin Glu Ala Ala 
10 

Gin He Glu Glu 
25 

Phe Gly Asn Pro 
Pro Leu Ala Val 



Arg Lys Met Ser 
-35 

Ser Pro Trp Leu 
-20 

Val Thr Phe Ala 

Leu lie Arg Ser 
15 

His Ala He Gin 
30 

Glu Glu Phe Val 
45 

Pro Met Trp Asn 



Glu Thr His His 
-30 

Ser Ser Ser Val 
-15 

Ser He Val Glu 

1 

Arg Glu Lys Pro 
20 

Asp Leu Leu Val 
35 

lie Val Glu Arg 
50 

Lys His Gly Cys 



Ser Val lie Ser 
-25 

Thr Ala Pro Ser 
-10 

Glu Glu Leu Gin 
5 

Leu Ala Leu He 

Phe Tyr Glu Ala 
40 

Thr Pro Gin Gly 
55 
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<210> 508 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 



<220> 

<221> UNSURE 
<222> -22 

<223> Xaa = * , Cys 
<220> 

<221> UNSURE 
<222> -39 

<223> Xaa = * , Tyr 
<220> 

<221> UNSURE 
<222> 57 

<223> Xaa = lie, Leu 
<220> 

<221> UNSURE 
<222> -43,-27,63 
<223> Xaa = Leu, Met 

<220> 

<221> UNSURE 
<222> 3 

<223> Xaa = Lys,Gln 
<220> 

<221> UNSURE 
<222> -42 

<223> Xaa = Lys,Thr 



<400> 508 
Met Phe Thr Pro 
-45 

lie Gly Val Asp 
-30 

lie Gly Leu Ser 
-15 

lie Xaa Asp Ala 
5 

Ser Gly Lys Val 
20 

Leu Val Asn Gin 
35 

Leu Asn Ser Asn 
50 

Leu Thr Gin Ser 



Xaa Xaa Val Lys 
-40 

Xaa lie Met Lys 
-2 5 

Ser Asp Leu Gin 
-10 

Leu Ser Thr Val 

Ser Ala Asp Asn 
25 

Val Pro Lys lie 
40 

lie Asn Asp Xaa 
55 

Gin lie Ala Leu 
70 



Xaa Ala Tyr Tyr 

Thr Xaa Phe Ser 
-20 

Gin Val Gly Gly 
-5 

Leu Gin Tyr Ala 
10 

Thr Val Gly Arg 

Val Pro Asp Asp 
45 

Leu Met Val Thr 
60 

Asn Glu Lys Leu 
75 



Asp Thr Glu Arg 
-35 

Pro Asn Arg Val 

Ala Ser Ala Arg 
1 

Glu Asp Val Leu 
15 

Phe Leu Met Ser 
30 

Phe Glu Thr Met 

Tyr Xaa Ala Asn 
65 

Val Asn Leu 
80 



<210> 509 
<211> 85 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 509 

Met Lys Ala Ala Leu 
-30 

Gly Ala Gin Leu Ala 
-15 

Gin Leu Gly Asp Val 
1 5 
Gin Arg Leu Met Asp 
20 

Tyr Arg Ser Leu Leu 
35 

Ala Ser Lys Val Leu 
50 



Glu Asp Thr Leu Ala Glu 
-25 

His lie Gin Ala Leu lie 
-10 

Arg Ala Asp Ser Glu Arg 
10 

lie Lys Ser Arg Leu Glu 
25 

Glu Gly Gin Glu Asp His 
40 



Thr Glu Ala Arg Phe 
-20 

Ser Gly lie Glu Ala 
-5 

Gin Asn Gin Glu Tyr 
15 

Gin Glu lie Ala Thr 

30 

Tyr Asn Asn Leu Ser 
45 



<210> 510 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 510 




























Met 


Val 


Asp 


Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu 


Asp 


Ala 


Lys 


Thr 


Cys 


-25 










-20 










-15 










-10 


Leu 


Val 


Leu 


Cys 


Ser 
-5 


Arg 


Val 


Leu 


Ser 


Val 
1 


He 


Ser 


Val 


Lys 
5 


Glu 


He 


Lys 


Thr 


Gin 
10 


Leu 


Ser 


Leu 


Gly 


Arg 

15 


His 


Pro 


He 


He 


Ser 
20 


Asn 


Trp 


Phe 


Asp 


Tyr 
25 


He 


Pro 


Ser 


Thr 


Arg 
30 


Tyr 


Lys 


Asp 


Pro 


Cys 
35 


Glu 


Leu 


Leu 


His 


Leu 


Cys 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


40 










45 










50 










55 


Pro 


Asp 


Gly 


He 


Phe 
60 


Ser 


Leu 


Leu 


He 


Pro 
65 


Ala 


Arg 


Leu 


Gin 


Asn 
70 


Tyr 


Leu 


Asn 


Leu 


Glu 


He 

























75 



<210> 511 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47 . . -1 



<400> 511 

Met Val Lys Ser Ser 
-45 

Arg Glu Lys Glu Gly 
-30 

Thr Met Pro Leu Leu 
-15 

Ser Ser Arg Val Ser 
5 

Arg Trp Cys Ser 



Leu Gin Arg He Leu Asn 
-40 

Asp Lys Pro Ser Ala Thr 
-25 

Ser Leu His Ser Arg Gly 
-10 -5 
Leu His Cys Cys Ser Asn 
10 



Ser His Cys Phe Ala 
-35 

He His Ala Ser Arg 
-20 

Gly Ser Ser Ser Glu 
1 

Pro Gly Pro Gly Pro 
15 



WO 02/083898 

20 



282 



PCT/IB01/00914 



<210> 512 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 

<220> 

<221> UNSURE 
<222> 9 

<223> Xaa = Cys, Trp 
<400> 512 

Met Val Lys Ser Ser 
-45 

Arg Glu Lys Glu Gly 
-30 

Thr Met Pro Leu Leu 
-15 

Ser Ser Arg Val Ser 
5 

Arg Trp Cys Ser 
20 

<210> 513 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<400> 513 



Met 


Ala 


Val 


Leu 


Leu 




-15 








Pro 


Gly 


Pro 


Arg 


Pro 


1 








5 


Phe 


Pro 


Ala 


Arg 


Pro 








20 




Pro 


Gly 


Asp 


Leu 


Ala 






35 






Leu 


Arg 


Leu 


Leu 


Thr 




50 








Lys 


Gin 


His 


Leu 


Val 


65 










Gly 


Thr 


Phe 


Tyr 


Phe 










85 


Lys 


Asp 


Lys 


Ser 


Lys 








100 




He 











<210> 514 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 



Leu Gin Arg He Leu 
-40 

Asp Lys Pro Ser Ala 
-2 5 

Ser Leu His Ser Arg 
-10 

Leu His Xaa Cys Ser 
10 



Asn Ser His Cys Phe Ala 
-35 

Thr He His Ala Ser Arg 
-20 

Gly Gly Ser Ser Ser Glu 
-5 1 
Asn Pro Gly Pro Gly Pro 
15 



Leu 


Leu 


Leu 


Arg 


Ala 


Leu 


Arg 


Arg 


Gly 


Pro 


Gly 




-10 










-5 










Leu 


Trp 


Gly 


Pro 


Gly 


Pro 


Ala 


Trp 


Ser 


Pro 


Gly 










10 










15 




Gly 


Arg 


Gly Arg 


Pro 


Tyr 


Met 


Ala 


Ser 


Arg 


Pro 








25 










30 






Glu 


Ala 


Gly 


Gly 


Arg 


Ala 


Leu 


Gin 


Ser 


Leu 


Gin 






40 










45 








Pro 


Thr 


Phe 


Glu 


Gly 


He 


Asn 


Gly 


Leu 


Leu 


Leu 




55 










60 










Gin 


Asn 


Pro 


Val 


Arg 


Leu 


Trp 


Gin 


Leu 


Leu 


Gly 


70 










75 










80 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Asn 


Lys 


Glu 










90 










95 




Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 


He 


Phe 



105 110 
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<220> 

<221> UNSURE 
<222> 43 

<223> Xaa = Asp, Gly , Asn, Ser 



<220> 

<221> UNSURE 
<222> 22 

<223> Xaa = Glu, Gly 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Pro,Arg 



<400> 514 
Met Ala Ala Ser 
-30 

Asn Val Ala Pro 
-15 

Gly Gin Arg Leu 
1 

Pro Thr Ser Ser 

Ala Leu Gly Gly 
35 

Ser Ser Gin Arg 
50 

Gly Gly Gly Pro 
65 



Gly Ala Pro Arg 

Leu Ala Val Phe 
-10 

Ala Ser Glu Pro 
5 

Val Pro Xaa Thr 
20 

Ala Leu Ala Leu 

Met Pro Arg Gin 
55 

Gly Lys Ser Pro 
70 



lie Leu Val Asp 
-25 

Gin Met Leu Lys 

Gin Asp Pro Ala 
10 

Arg Gly Arg Asn 
25 

Ala Glu Xaa Xaa 
40 

Pro Ser Ala Thr 

Thr Arg Gly Ser 
75 



Leu Leu Lys Leu 
-20 

Ser Met Cys Ala 
-5 

Ala Val Ser Leu 
15 

Lys Gly Ser Ala 
30 

Ser Arg Glu Gly 
45 

Arg Leu Pro Lys 

60 

Thr 



<210> 515 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 515 
Met Ala Asn Leu 
-35 

Ser Arg His Thr 
-20 

Ser lie Arg Gly 

Arg Ser Leu Leu 
15 

Gly Phe Lys Asn 
30 

Leu Val Lys Arg 
45 

Arg His Lys Gin 



Phe lie Arg Lys 
-30 

Val Lys Pro Arg 
-15 

Ala Ala Pro Val 
1 

Ser Pro Gly Leu 
20 

Lys Thr Val Leu 
35 

Arg Gly Arg Trp 
50 

Arg Gin Met 
65 



Met Val Asn Pro 
-25 

Ala Leu Ser Thr 
-10 

Ala Val Glu Pro 
5 

Leu Pro His Leu 

Lys Lys Arg Cys 
40 

Tyr Val Tyr Cys 
55 



Leu Leu Tyr Leu 

Phe Leu Phe Gly 
-5 

Gly Ala Ala Val 
10 

Leu Pro Ala Leu 
25 

Lys Asp Cys Tyr 

Lys Thr His Pro 
60 



<210> 516 
<211> 86 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -36. . -1 

<400> 516 



Met Gly Pro 


Ala 


Leu 


Arg 


Ser 


Leu 


Gin 


Val 


Lys 


Lys 


Gly 


Thr 


Glu 


His 


-35 








-30 










-25 










Ala Asp Pro 


Leu 


Pro 


Phe 


Pro 


Ser 


Val 


Ser 


Leu 


Ser 


Gly 


Phe 


Thr 


Val 


-20 






-15 










-10 










-5 


Gly Thr Leu 


Ser 


Glu 


Thr 


Ser 


Thr 


Gly 


Gly 


Pro 


Ala 


Thr 


Pro 


Thr 


Trp 






1 








5 










10 






Lys Glu Cys 


Pro 


He 


Cys 


Lys 


Glu 


Arg 


Phe 


Pro 


Ala 


Glu 


Ser 


Asp 


Lys 


15 










20 










25 








Asp Ala Leu 


Glu 


Asp 


His 


Met 


Asp 


Gly 


His 


Phe 


Phe 


Phe 


Ser 


Thr 


Gin 


30 








35 










40 










Asp Pro Phe 


Thr 


Phe 


Glu 






















45 






50 






















<210> 517 




























<211> 52 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -30. .- 


-1 


























<400> 517 




























Met Ser Ser 


Cys 


Gly 


He 


Val 


Gly 


Ser 


Ser 


Val 


Ser 


Phe 


Gin 


Leu 


Asp 


-30 






-25 










-20 










-15 


Ala Val Lys 


Leu 


Leu 


Leu 


Lys 


Met 


Val 


Ser 


Ser 


Ala 


Thr 


Thr 


Glu 


Arg 






-10 










-5 










1 




Cys Cys Asn 


Gly 


Ser 


Ala 


Asn 


Phe 


His 


Lys 


Asn 


Leu 


Cys 


Ala 


Thr 


Gly 


5 










10 










15 








lie Lys Asn 


Phe 


























20 




























<210> 518 




























<211> 182 




























<212> PRT 




























<213> Homo sapiens 
























<400> 518 




























Met Ser Ala 


Thr 


Val 


Val 


Asp 


Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


1 




5 










10 










15 




Ser Lys Glu 


Met 


Ser 


Leu 


Glu 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 




20 










25 










30 






Trp Arg Lys 


Lys 


Lys 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 


35 










40 










45 








Pro Ala Glu 


Val 


Asp 


Glu 


Glu 


Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


50 








55 










60 










Gin Tyr lie 


Ser 


Ser 


Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


65 






70 










75 










80 


Thr Leu Lys 


His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 






85 










90 










95 




Ser Ser Gly 


Glu 


Trp 


Tyr 


Lys 


Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 




100 










105 










110 






Thr Lys Tyr 


Arg 


Lys 


Gly 


Ala 


Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


115 










120 










125 








Lys Lys Lys 


Asp 


Cys 


Phe 


Glu 


Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


130 








135 










140 










Thr Gly Thr 


Asn 


He 


Ala 


Pro 


Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


145 






150 










155 










160 
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Phe Asp Tyr Asp Gly Lys Arg Asp Arg Trp Asn Gly Tyr Asn Pro Glu 

165 " 170 175 

Glu His Met Lys lie Val 
180 



<210> 519 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<400> 519 



Met 


Asp 


Val 


Leu 


1 








Glu 


Glu 


He 


Lys 








20 


Gly 


Cys 


Leu 


Gly 






35 




Leu 


Lys 


Leu 


Lys 




50 






Glu 


Arg 


Asn 


Lys 


65 








Gin 


Glu 


Val 


Phe 


Asn 


Pro 


Pro 


Leu 








100 


Tyr 


Gin 


Cys 


Asn 






115 




Asp 


Ser 


Trp 


Leu 




130 






Leu 


Gin 


Lys 





145 



Val Ser Glu Cys 
5 

Ser Leu Thr Ala 

Ala Ser Pro Asn 
40 

Tyr Arg Leu Asn 
55 

Pro Thr Lys Asn 
70 

Gly His Ala He 
85 

Leu Val Thr Pro 

Ser Ala Met Gly 
120 

Phe Asp Phe Phe 
135 



Ser Ala Arg Leu 
10 

Glu He Asp Arg 
25 

Leu Glu Gin Leu 

He Leu Arg Lys 
60 

Met He Asn He 
75 

Lys Ala Ala Tyr 
90 

Ser Gin Gin Ala 
105 

He Ser Gin Val 

Lys Tyr Tyr Tyr 
140 



Leu Gin Gin Glu 
15 

Leu Lys Asn Cys 
30 

Gin Glu Glu Asn 
45 

Ser Leu Gin Ala 

He Ser Arg Leu 
80 

Pro Asp Leu Glu 
95 

Lys Phe Gly Asp 
110 

Met Tyr Cys His 
125 

His His Cys His 



<210> 520 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28 . .-1 

<400> 520 

Met Asn Leu Phe Arg Phe Leu Gly Asp Leu Ser His Leu Leu Ala He 

-25 ^ -20 -15 

He Leu Leu Leu Leu Lys He Trp Lys Ser Arg Ser Cys Ala Ala His 

-10 ~ -5 1 

Pro Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys Leu Ser Val Ser 
5 10 15 20 

Leu Ser Leu Ser Val Ser Leu Ser Leu Ser Phe Ser Val Ser Lys Lys 
25 30 35 

Lys 

<210> 521 
<211> 140 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -40 . . -1 



<400> 521 

Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin Val Leu Leu Gly Phe 
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-40 






-35 










-30 










-25 


Ala Gly Arg 


Asn 


Lys 


Ser 


Gly 


Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala 






-20 










-15 










-10 




Leu His Leu 


Leu 


Lys 


Ser 


Gly 


Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He 




-5 










1 








5 








Arg Glu Leu 


Tyr 


Arg 


Arg 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser 


10 








15 










20 










Asp Leu Ser 


Thr 


lie 


Lys 


Ser 


Ser 


Val 


Phe 


Ser 


Leu 


Asp 


Gly 


Gly 


Ser 


25 






30 










35 










40 


Ser Pro Val 


Glu 


Pro 


Asp 


Leu 


Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro 






45 










50 










55 




Ser Thr Ser 


Val 


Thr 


Pro 


His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly 


Ser 


Val 




60 










65 










70 






Leu Leu Gin 


Asp 


Thr 


Lys 


Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro 


75 










80 










85 








Pro lie Pro 


Pro 


Val 


His 


Pro 


Asp 


Val 


Gin 


Leu 


Lys 










90 








95 










100 










<210> 522 




























<211> 154 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> UNSURE 


























<222> 135 




























<223> Xaa = 


His, Asn, 


Pro, 


Thr 




















<220> 




























<221> UNSURE 


























<222> 137 




























<223> Xaa = 


lie, 


Leu 
























<220> 




























<221> UNSURE 


























<222> 139 




























<223> Xaa = 


Pro, Arg 
























<220> 




























<221> UNSURE 


























<222> 136 




























<223> Xaa - 


Pro, 


Gin 
























<400> 522 




























Met Ala Glu 


Val 


Glu 


Gin 


Lys 


Lys 


Lys 


Arg 


Thr 


Phe 


Arg 


Lys 


Phe 


Thr 


1 




5 










10 










15 




Tyr Arg Gly 


Val 


Asp 


Leu 


Asp 


Gin 


Leu 


Leu 


Asp 


Met 


Ser 


Tyr 


Glu 


Gin 




20 










25 










30 






Leu Met Gin 


Leu 


Tyr 


Ser 


Ala 


Arg 


Gin 


Ala 


Ala 


Ala 


Glu 


Pro 


Gly 


Pro 


35 










40 










45 








Ala Ala Glu 


Ala 


Ala 


Leu 


Pro 


Ala 


Glu 


Ala 


Pro 


Ala 


Gin 


Gly 


Gin 


Glu 


50 








55 










60 










Gly Gly Ala 


Ala 


His 


Gly 


Glu 


Ala 


Gly 


Ser 


Gly 


Glu 


Asp 


Ala 


Pro 


Ala 


65 






70 










75 










80 


Gly His Asp 


His 


Pro 


Thr 


Arg 


Asp 


Gly 


Gly 


Gin 


His 


Gly 


Gly 


Arg 


Leu 






85 










90 










95 




Gin Arg Gin 


Asp 


Leu 


Gin 


Pro 


Gly 


Gly 


Asp 


Gin 


Ala 


Arg 


Asp 


Asp 


Arg 




100 










105 










110 






Pro Leu Pro 


Gly 


Arg 


Val 


Leu 


His 


His 


Leu 


Gin 


Ala 


Arg 


Lys 


Ala 


Trp 


115 










120 










125 








Pro Ala Arg 


His 


Arg 


Gly 


Xaa 


Xaa 


Xaa 


Leu 


Xaa 


Leu 


His 


Pro 


Ser 


Gin 


130 








135 










140 










Val Met Ala 


Gin 


Leu 


He 


Lys 


Ala 


His 


Met 
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<210> 523 
<211> 65 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = * , Ala, Glu, Gly, lie, Lys, Leu, Pro, Gin, Arg, Ser, Thr, Val 



<400> 523 

Met Ala Glu Glu Gin Gly Arg Glu Arg Asp Ser Val Pro Lys Pro Ser 

15 10 15 

Val Leu Phe Leu His Pro Asp Leu Gly Val Gly Gly Ala Glu Arg Leu 

20 25 30 

Val Leu Leu Pro Phe Pro Thr Glu Arg Arg Leu Leu Phe Cys Thr Ala 

35 40 45 

Pro Arg Ser Leu Asn Ser Leu Pro Leu Lys Lys Lys Lys lie Xaa Lys 
50 55 60 

Ser 
65 



<210> 524 
<211> 133 
<212> PRT 

<213> Homo sapiens 



<400> 524 

Met Glu Tyr Val 

1 

Gin Phe Thr Val 
20 

Val Ser Arg Asp 
35 

Val Cys Tyr Lys 
50 

Leu lie Val lie 
65 

Val lie Gin Glu 

Leu Pro Gin Lys 
100 

Lys Leu Gly Leu 
115 

Asp Glu Ser Asp 
130 



Leu Glu Val Lys 
5 

Cys Asp Val Pro 

Arg Cys Gin Glu 
40 

Lys Ala Val Pro 
55 

lie Ala Gly Ala 
70 

Ser Arg Lys Glu 
85 

Ser Ser Glu Lys 

Lys Leu Arg Val 
120 

Glu 



Asn Ser Pro Arg 
10 

Leu Tyr Asp lie 
25 

Leu Gly Cys Cys 

lie Tyr lie His 
60 

Phe Val lie Thr 
75 

Lys Ala lie Pro 
90 

Ala Glu Leu Ala 
105 

Leu Gly Leu Lys 



His Leu Leu Lys 
15 

Cys Asp Tyr Asn 
30 

Phe Tyr Glu Gly 
45 

Val Phe Ser Ala 

lie lie Tyr Arg 
80 

Val Tyr Val Ala 
95 

Ser Ser Ser Ser 
110 

Val Leu Gly Leu 
125 



<210> 525 
<211> 149 
<212> PRT 

<213> Homo sapiens 



Met 


Arg 


Tyr 


Asn 


Glu 


Lys 


Glu 


Leu 


Gin 


Ala 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


1 








5 










10 










15 




Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 








20 










25 










30 






Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 






35 










40 










45 








Arg 


Thr 


Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 



50 55 60 
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Arg 


Val 


Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


65 










70 








Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 










85 










Gin 


Glu 


Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 








100 










105 


Arg 


Ser 


Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 






115 










120 




Asp 


Pro 


Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 




130 










135 






Ser 


Gly 


Leu 


Gin 


Ala 











145 



<210> 526 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 526 



Met 


Pro 


Val 


Val 


Pro 


Ala 


Leu 


Gly Arg 


1 








5 










Arg 


Ser 


Gly 


Val 
20 


Arg 


Asp 


Gin 


Pro 


Gly 
25 


Ser 


Leu 


Leu 
35 


Lys 


He 


Gin 


Lys 


Leu 
40 


Ala 


Trp 


Ser 
50 


Gin 


Leu 


Leu 


Gly 


Arg 
55 


Leu 


Arg 


Gly 


Gly 


Gly 


Gly 


Cys 


Ser 


Glu 


Pro 


Arg 


65 










70 








Trp 


Val 


Thr 


Glu 


Gin 
85 


Asp 


Ser 


He 


Ser 



<210> 527 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21. .-1 

<220> 

<221> UNSURE 
<222> -5 

<223> Xaa = Ser, Thr 
<400> 527 

Met Asp Pro Asn Cys Ser Cys Ala Ala 

-20 -15 
Xaa Pro Ala Ser Ala Lys Ser Ala Asn 
-5 1 
Ala Ala Pro Gly Ala Leu Ala Pro Pro 

15 20 
Cys His Leu Ser Val Cys Pro Lys Glu 
30 35 

<210> 528 
<211> 120 
<212> PRT 

<213> Homo sapiens 
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Phe 


Ser 


He 


Ser 


Phe 


He 


Glu 




75 










80 


Cys 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 


90 










95 




Ala 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 










110 






He 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 








125 








Glu 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 






140 










Pro 


Arg 


Trp 


Ala 


Asp 


His 


Leu 


10 










15 




Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 










30 






Gly 


Tyr 


Gly 


Gly 


Gly 


Cys 


Leu 








45 








Arg 


Glu 


Asn 


His 


Leu 


Ser 


Pro 






60 










Leu 


Cys 


His 


Cys 


Thr 


Pro 


Ala 




75 










80 


Lys 


He 


Glu 










90 














Gly 


Val 


Ser 


Cys 


Thr 


Cys 


Ala 






-10 










Ala 


Pro 


Pro 


Ala 


Arg 


Arg 


Ala 


5 










10 




Ser 


Trp 


Thr 


Pro 


Ala 


Leu 


Pro 










25 






Val 


Leu 


Val 











<220> 

<221> SIGNAL 
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<222> -35. . -1 
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<400> 528 




























Met 


Ser 


Lys 


His 


Leu 


His 


Cys 


Lys 


Val 


Leu 


Gly 


Leu 


Pro 


Leu 


Arg 


Leu 


-35 










-30 










-25 










-20 


Glu 


Val 


Ser 


Ala 


Ser 
-15 


Cys 


Leu 


Ser 


Gin 


Ser 
-10 


Leu 


Ala 


Met 


Ser 


Pro 
-5 


Gin 


Thr 


His 


Ser 


Gin 
1 


Thr 


Cys 


He 


Arg 
5 


Asn 


Leu 


Val 


Thr 


Cys 
10 


He 


Asn 


Tyr 


Pro 


Arg 
15 


Thr 


Ser 


Thr 


Gly 


Cys 
20 


Lys 


Gly 


Thr 


Thr 


Thr 
25 


Gin 


Arg 


He 


Met 


Glu 


Pro 


Val 


Glu 


Leu 


Glu 


Val 


Glu 


Gly 


Thr 


Glu 


Gin 


Asp 


Asn 


Ala 


Lys 


30 










35 










40 










45 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly Arg 


Gly 


Asn 


Glu 


Asn 


Thr 


Met 


Leu 


Arg 


Gly 










50 










55 










60 




Gly 


Phe 


Ser 


Met 
65 


Asn 


Thr 


Thr 


Val 


Gly 
70 


Gin 


Gly 


He 


Ser 


Lys 
75 


Gin 


Thr 


His 


His 


Thr 


Ser 


Thr 


Thr 


Ser 


Ser 



















80 85 



<210> 529 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -14. .-1 



<400> 529 

Met Leu Cys Trp 

Leu Asn Trp Ala 
5 

Leu Ser Leu He 
20 

Arg Leu Val Lys 

35 

Pro 



Leu Lys Leu Val 
-10 

Gin His Ala Glu 
10 

Gin Val Thr Leu 
25 

Ser Lys Pro Phe 
40 



Ser He Thr Thr 
-5 

Asn Thr Thr Glu 

Thr Ser Leu Glu 
30 

His Leu Gin Asn 
45 



Val Ala He He 
1 

Cys Ala His Trp 
15 

Ala Cys Gin Asn 

Phe Thr Cys Lys 
50 



<210> 530 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<400> 530 

Met Lys Lys Lys 

1 

Pro Gin Tyr Gin 
20 

Cys Leu Arg Glu 
35 

Leu Gly Lys Thr 
50 

Val Phe Leu Ser 
65 

Ala Glu Gin Pro 
Leu Ser Leu 



Glu Glu Thr Thr 
5 

Leu Val Asn Ala 

Val He Gly Glu 
40 

Glu Ser Gin Gly 
55 

Phe Leu He Phe 
70 

Phe Phe Lys Glu 
85 



Leu Ser Glu Met 
10 

Glu Ser Thr Ser 
25 

Tyr Ser Val His 

He Gly Leu Trp 
60 

Ser Thr Ser Phe 
75 

Pro Pro Thr Glu 
90 



Glu Pro Val Glu 
15 

Pro Phe Leu His 
30 

Glu Phe Ser Leu 
45 

He Ala Leu Val 

Tyr He Ser Asn 
80 

Ala Ala Lys Glu 
95 



<210> 531 
<211> 128 
<212> PRT 
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<220> 

<221> UNSURE 
<222> 70 

<223> Xaa - Pro, Thr 



<400> 531 




























Met 


Ala 


Gly 


Asn 


Leu 


Leu 


Ser 


Gly Ala 


Gly Arg Arg 


Leu 


Trp 


Asp 


Trp 


1 








5 










10 










15 




Val 


Pro 


Leu 


Ala 
20 


Cys 


Arg 


Ser 


Phe 


Ser 
25 


Leu 


Gly 


Val 


Pro 


Arg 
30 


Leu 


He 


Gly 


He 


Arg 
35 


Leu 


Thr 


Leu 


Pro 


Pro 
40 


Pro 


Lys 


Val 


Val 


Asp 
45 


Arg 


Trp 


Asn 


Glu 


Lys 
50 


Arg 


Ala 


Met 


Phe 


Gly 
55 


Val 


Tyr 


Asp 


Asn 


He 
60 


Gly 


He 


Leu 


Gly 


Asn 


Phe 


Glu 


Lys 


His 


Xaa 


Lys 


Glu 


Leu 


He 


Arg 


Gly 


Pro 


He 


Trp 


Leu 


65 










70 










75 










80 


Arg 


Gly 


Trp 


Lys 


Gly 
85 


Asn 


Glu 


Leu 


Gin 


Arg 
90 


Cys 


He 


Arg 


Lys 


Arg 
95 


Lys 


Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp 
105 


Asp 


Leu 


His 


Asn 


Leu 
110 


Asn 


Lys 


Arg 


He 


Arg 
115 


Tyr 


Leu 


Tyr 


Lys 


His 
120 


Phe 


Asn 


Arg 


His 


Gly 
125 


Lys 


Phe 


Arg 



<210> 532 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<400> 532 

Met Phe Leu Thr Leu Ala Asp Thr 
1 5 

Leu Leu Asn Val Tyr Glu Gly Glu 
20 

Leu Leu Glu Ser Trp He He Val 

35 40 
He Ser Ser His 
50 



Cys Lys Leu Arg Gly Met Ser Phe 

10 15 
Ala Thr Val Ser Ser Val Leu Glu 
25 30 
Gly Asn Glu Arg Tyr Phe Asp Gly 
45 



<210> 533 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<400> 533 



Met 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly Val 


Ala 


Thr 


Trp 


Leu 


Trp 


1 








5 










10 








15 




Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys Arg 


Met 


Ala 


Phe 


Asn 


Gly 








20 










25 








30 






Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp Asp 


Cys 


Pro 


Leu 


Val 


Trp 






35 










40 








45 








Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His Cys 


He 


Leu 


Lys 


Trp 


Leu 




50 










55 








60 










His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro Met 


Cys 


Arg 


Gin 


Glu 


Trp 



65 70 75 80 

Lys Phe Lys Glu 



<210> 534 
<211> 80 
<212> PRT 

<213> Homo sapiens 
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<400> 534 

Met Ala Thr Pro Thr Gin Thr Pro 
1 5 

Pro Phe Tyr Tyr Asp Tyr Asn Thr 
20 

Ala Thr lie Leu Phe Leu Leu Gly 

35 40 
Val Lys Cys Arg Lys Ala Asp Ser 

50 55 
Ser Cys Lys Ser Glu Leu Pro Ser 
65 ^ 7Q 



291 



Thr Lys Ala Pro Glu Glu Pro Asp 

10 15 
Val Gin Thr Val Gly Met Thr Leu 
25 30 
lie Leu lie Val lie Ser Lys Lys 
4 5 

Arg Ser Glu Ser Pro Thr Cys Lys 
60 

Ser Ala Pro Gly Gly Gly Gly Val 
75 80 



<210> 535 
<211> 127 
<212> PRT 

<213> Homo sapiens 



<400> 535 

Met Ser Phe Ser 

1 

Ala Phe Met Lys 
20 

Lys Asp lie Lys 
35 

Lys Phe Thr lie 
50 

Val Gly Glu Glu 
65 

Thr Val Val Gin 

Asn lie Lys Ser 
100 

Met Thr Leu Gly 
115 



Gly Lys Tyr Gin 
5 

Ala lie Gly Leu 

Gly Val Ser Glu 
40 

Thr Ala Gly Ser 
55 

Cys Glu Leu Glu 
70 

Leu Glu Gly Asp 
85 

Val Thr Glu Leu 

Asp lie Val Phe 
120 



Leu Gin Ser Gin 
10 

Pro Glu Glu Leu 
25 

lie Val Gin Asn 

Lys Val lie Gin 
60 

Thr Met Thr Gly 
75 

Asn Lys Leu Val 
90 

Asn Gly Asp lie 
105 

Lys Arg lie Ser 



Glu Asn Phe Glu 
15 

lie Gin Lys Gly 
30 

Gly Lys His Phe 
4 5 

Asn Glu Phe Thr 

Glu Lys Val Lys 
80 

Thr Thr Phe Lys 
95 

lie Thr Asn Thr 
110 

Lys Arg lie 
125 



<210> 536 
<211> 77 
<212> PRT 

<213> Homo sapiens 



<400> 536 

Met Ala Ala Gly Thr Ser Pro Ala 
1 5 

Ala Ala Gly Pro Thr Ser Gly Ala 
20 

Glu Ser Leu Pro Gly Asn Tyr Ser 

35 40 
Trp lie Ala Ser Arg Gin Lys Ser 

50 55 
Ser Arg Lys Asp Lys Gly Gly Lys 
65 70 



Asp Ala Leu Cys Asp Gly Lys Lys 

10 15 
Phe Tyr Asn Glu Leu Arg Ala Leu 
25 30 
Ser Ala Ser Thr Val Val Leu Leu 
45 

Gin Ser Ser Arg Asn Ser Gin Cys 
60 

Gin Glu Glu His Glu 
75 



<210> 537 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 65 

<223> Xaa = Asp, Gly 



<400> 537 
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Met Pro Gly Pro Thr Pro Ser Gly Thr 
1 ' 5 

Ser Pro Ser Lys Ala Val Ala Ala Arg 

20 25 
Arg Gly Lys Met Pro Ala Val Gly Gin 

35 40 
Arg Gin Ala Pro Gly Gly Cys Gly Asp 

50 ~ 55 

Xaa Ser Lys Leu Ala Leu Phe Gin Tyr 
65 70 
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Asn Val Gly Ser Ser Gly Arg 
10 15 
Ala Arg Asp Pro Leu Ser Gly 
30 

Gly Val Gin Ala Ala Gin Pro 
45 

Ser Thr Gin Lys lie His Leu 
60 

Trp Leu 
75 



<210> 538 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 538 

Met Ser Ser Trp 

1 

Ala Arg Leu Leu 

20 

Gin Thr Asp Gin 
35 

Lys Lys Tyr Tyr 
50 



Ser Leu Met Cys 
5 

Tyr Glu Ala Asn 

Ala Val Glu Phe 
40 

Glu Lys Met Pro 
55 



Met Gin Asp Met 
10 

Leu Pro Glu Asn 
25 

Phe lie Arg Asp 



Gly Asn Thr Lys 
15 

Phe Arg Arg Pro 
30 

Lys Tyr Glu Lys 
45 



<210> 539 
<211> 122 
<212> PRT 

<213> Homo sapiens 



<400> 539 

Met Glu Ala Ser 

1 

Val Val Arg Val 
20 

He Ala Thr Val 
35 

Met Val Leu Ser 
50 

Ser He Ala Ala 
65 

Gly Val Ala Ser 

Thr Gly Leu Ser 
100 

Ala He Ala Ala 
115 



Ala Leu Thr Ser 

5 

Ala Ser Gly Ser 

Val He Gly Gly 
40 

Ala Met Gly Phe 
55 

Lys Met Met Ser 
7 0 

Gly Ser Leu Val 
85 

Gly Leu Thr Lys 

Val He Ala Arg 
120 



Ser Ala Val Thr 
10 

Ala Val Val Leu 
25 

Val Val Ala Met 

Thr Ala Ala Gly 
60 

Ala Ala Ala He 
75 

Ala Thr Leu Gin 
90 

Phe He Leu Gly 
105 

Phe Tyr 



Ser Val Ala Lys 
15 

Pro Leu Ala Arg 
30 

Ala Ala Val Pro 
45 

He Ala Ser Ser 

Ala Asn Gly Gly 
80 

Ser Leu Gly Ala 
95 

Ser He Gly Ser 
110 



<210> 540 
<211> 74 
<212> PRT 

<213> Homo sapiens 



<400> 540 

Met Ala Ser Ser 

1 

Thr Pro Glu He 
20 

Arg Phe Ala Thr 
35 

Leu Gly Leu Phe 
50 



Thr Val Pro Val 
5 

Pro Asp Asn Val 

Asp Arg Asn Asp 
4 0 

Ala Ala Gly Val 
55 



Ser Ala Ala Gly 
10 

Gly Asp Trp Leu 
25 

Phe Arg Arg Asn 

Trp Leu Ala Arg 
60 



Ser Ala Asn Glu 
15 

Arg Gly Val Tyr 
30 

Leu He Leu Asn 
45 

Asn Leu Ser Asp 
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lie Asp Leu Met Ala Pro Gin Pro Gly Val 
65 70 



<210> 541 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<400> 541 

Met Ser Ser Ala 

1 

Glu Lys Glu Phe 

20 

Leu Phe Arg Ala 
35 

Val Met Ala Phe 
50 

Gly Tyr Leu His 
65 

lie Asp Ser Glu 
Asp 



Pro Glu Pro Pro 
5 

Gin Ser Pro Gly 

Val Asn Pro Glu 
40 

Gly Leu Val Thr 
55 

Ala lie Gin Glu 
70 

Gly His Ser Tyr 
85 



Thr Phe Lys Lys 
10 

Leu Arg Gly Val 
25 

Leu Phe lie Lys 

Leu Ser Leu Cys 
60 

Asn Lys Lys Asp 
75 

Met Arg Arg Lys 
90 



Glu Pro Pro Lys 
15 

Arg Thr Thr Thr 
30 

Pro Asn Lys Pro 
45 

Val Ala Tyr lie 

Leu Tyr Glu Ala 
80 

Thr Ser Lys Trp 
95 



<210> 542 
<211> 98 
<212> PRT 

<213> Homo sapiens 



<400> 542 

Met Ala Ala Glu 

1 

Val Thr Thr Phe 
20 

Leu Ser Val Asn 
35 

Leu Leu Lys Asp 
50 

Val Asn Gin Asp 
65 

Gly Glu Leu Ala 
Lys Lys 



Pro Leu Thr Glu 
5 

Phe Thr Phe Ala 

Glu Phe Lys Glu 
40 

Val Gly Ser Leu 
55 

Ser Glu Leu Lys 
70 

Lys Glu lie Arg 
85 



Leu Glu Glu Ser 
10 

Arg Gin Glu Gly 
25 

Leu Val Thr Gin 

Asp Glu Lys Met 
60 

Phe Asn Glu Tyr 
75 

Lys Lys Lys Asp 
90 



lie Glu Asn Val 
15 

Arg Lys Asp Ser 
30 

Gin Leu Pro His 
45 

Lys Ser Leu Asp 

Trp Arg Leu lie 
80 

Leu Lys lie Arg 
* 95 



<210> 543 
<211> 115 
<212> PRT 
<213> Homo sapiens 



<400> 543 

Met Val Ala Ala 

1 

Arg Leu Gin Leu 
2 0 

Gin Thr Leu Lys 
35 

Ala Asn Asn Cys 
50 

Met Leu Ala Lys 
65 

Leu Gly Thr Ala 

lie Asp Pro Gly 
100 



Lys Lys Thr Lys 
5 

Val Met Lys Ser 

Met lie Arg Gin 
40 

Pro Ala Leu Arg 
55 

Thr Gly Val His 
70 

Cys Gly Lys Tyr 
85 

Asp Ser Asp lie 



Lys Ser Leu Glu 
10 

Gly Lys Tyr Val 
25 

Gly Lys Ala Lys 

Lys Ser Glu lie 
60 

His Tyr Ser Gly 
75 

Tyr Arg Val Cys 
90 

lie Arg Ser Met 
105 



Ser lie Asn Ser 
15 

Leu Gly Tyr Lys 

30 

Leu Val lie Leu 
45 

Glu Tyr Tyr Ala 

Asn Asn lie Glu 
80 

Thr Leu Ala lie 
95 

Pro Glu Gin Thr 
110 
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Gly Glu Lys 
115 

<210> 544 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 85 

<223> Xaa = Asp,Asn 
<400> 544 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu Pro Leu Phe Asp Arg Val 

1 5 10 15 

Leu Val Glu Arg Ser Ala Ala Glu Thr Val Thr Lys Gly Gly lie Met 

20 25 30 

Leu Pro Glu Lys Ser Gin Gly Lys Val Leu Gin Ala Thr Val Val Ala 

35 4 0 4 5 

Val Gly Ser Gly Ser Lys Gly Lys Gly Gly Glu lie Gin Pro Val Ser 

50 55 60 

Val Lys Val Gly Asp Lys Val Leu Leu Pro Glu Tyr Gly Gly Thr Lys 
65 70 75 80 

Val Val Leu Asp Xaa Lys Asp Tyr Phe Leu Phe Arg Asp Gly Asp lie 

85 90 95 

Leu Gly Lys Tyr Val Asp 
100 

<210> 545 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 545 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu Pro Leu Phe Asp Arg Val 

15 10 15 

Leu Val Glu Arg Ser Ala Ala Glu Thr Val Thr Lys Gly Gly lie Met 

20 25 30 

Leu Pro Glu Lys Ser Gin Gly Lys Val Leu Gin Ala Thr Val Val Ala 

35 40 45 

Val Gly Ser Gly Ser Lys Gly Lys Gly Gly Glu lie Gin Pro Val Ser 

50 " 55 ' 60 

Val Lys Val Gly Asp Lys Val Leu Leu Pro Glu Tyr Gly Gly Thr Lys 
65 70 75 80 

Val Val Leu Asp Asp Lys Asp Tyr Phe Leu Phe Arg Asp Gly Asp lie 

85 90 ~ 95 

Leu Gly Lys Tyr Val Asp 
100 

<210> 546 
<211> 116 
<212> PRT 

<213> Homo sapiens 
<400> 546 

Met Ser Ala Thr Ala Ala Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu 

15 10 15 

Gly Gly Pro Pro Ala Pro Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu 

20 25 30 

Gin Gin Thr Gin Ala Gin Val Asp Glu Val Val Asp lie Met Arg Val 

35 40 45 

Asn Val Asp Lys Val Leu Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp 



WO 02/083898 



PCT/IB01/00914 



50 

Asp Arg Ala Asp 
65 

Ala Ala Lys Leu 

lie lie Leu Gly 
100 

Tyr Phe Ser Thr 
115 



55 

Ala Leu Gin Ala 
70 

Lys Arg Lys Tyr 
8 5 

Val lie Cys Ala 



295 

60 

Gly Ala Ser Gin 
75 

Trp Trp Lys Asn 
90 

lie lie Leu lie 
105 



Phe Glu Thr Ser 
80 

Leu Lys Met Met 
95 

lie lie lie Val 
110 



<210> 547 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 54 

<223> Xaa = Glu, Lys 



<400> 547 

Met Ser Leu Ser 

1 

Pro Phe His Ser 
20 

Val Val Leu Thr 
35 

Tyr Gin Asn Val 
50 

Glu Thr Asn He 
65 

Ser Arg Asn Leu 
Phe 



Leu Val Phe Arg 
5 

Ser Ser Ser Asn 

Gin Thr Leu Val 
40 

Pro Xaa Ser Ala 
55 

His Leu Phe His 
70 

Phe Asn Ser Leu 
85 



Ala Ala Ser Tyr 
10 

Gin Phe Leu Gin 
25 

Leu Leu His Phe 

Gin Gly Lys Gly 
60 

Phe Leu Thr Phe 
75 

Leu Cys Leu Met 
90 



Phe Lys Leu Val 
15 

Pro Pro Gly Trp 
30 

Glu Arg Phe Ser 
45 

Asn Leu Gin Pro 

Pro Lys Gin He 
80 

Cys Leu Thr Tyr 
95 



<210> 548 
<211> 113 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 110 

<223> Xaa = Ser, Thr 



<400> 548 

Met Ser Met Thr 

1 

Gly Ala Phe Ser 
20 

Met Val Gly Leu 
35 

His Met Leu Asp 
50 

Gly Phe He Leu 
65 

Lys Glu Thr Lys 

Lys He Gly Val 
100 

His 



Asp Leu Leu Asn 
5 

Ala Thr Asp Ser 

Lys Lys Lys Ser 
40 

Lys Asp Lys Ser 
55 

Lys Gly Phe Ser 
70 

Met Leu Met Ala 
85 

Asp Glu Phe Ser 



Ala Glu Asp He 
10 

Phe Asp His Lys 
25 

Ala Asp Asp Val 

Gly Phe He Glu 
60 

Pro Asp Ala Arg 
75 

Ala Gly Asp Lys 
90 

Thr Leu Val Ala 
105 



Lys 


Lys 


Ala 


Val 






15 




Lys 


Phe 


Phe 


Gin 




30 






Lys 


Lys 


Val 


Phe 


45 








Glu 


Asp 


Glu 


Leu 


Asp 


Leu 


Ser 


Ala 








80 


Asp 


Gly 


Asp 


Gly 






95 




Glu 


Xaa 


Lys 


Lys 




110 







<210> 549 
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<211> 112 
<212> PRT 

<213> Homo sapiens 



<400> 549 



Met 


Asp 


Pro 


Arg 


Lys 


Val 


Asn 


Glu 


Leu 


Arg 


Ala 


Phe 


Val 


Lys 


Met 


Cys 


1 








5 










10 










15 




Lys 


Gin 


Asp 


Pro 
20 


Ser 


Val 


Leu 


His 


Thr 

25 


Glu 


Glu 


Met 


Arg 


Phe 
30 


Leu 


Arg 


Glu 


Trp 


Val 
35 


Glu 


Ser 


Met 


Gly 


Gly 
40 


Lys 


Val 


Pro 


Pro 


Ala 

45 


Thr 


Gin 


Lys 


Ala 


Lys 
50 


Ser 


Glu 


Glu 


Asn 


Thr 
55 


Lys 


Glu 


Glu 


Lys 


Pro 
60 


Asp 


Ser 


Lys 


Lys 


Val 


Glu 


Glu 


Asp 


Leu 


Lys 


Ala 


Asp 


Glu 


Pro 


Ser 


Ser 


Glu 


Glu 


Ser 


Asp 


65 










70 










75 










80 


Leu 


Glu 


He 


Asp 


Lys 
85 


Glu 


Gly 


Val 


He 


Glu 
90 


Pro 


Asp 


Thr 


Asp 


Ala 
95 


Pro 


Gin 


Glu 


Met 


Gly 
100 


Asp 


Glu 


Asn 


Ala 


Glu 
105 


He 


Thr 


Ala 


Gin 


His 
110 


Phe 


Leu 



<210> 550 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 550 



Met 


Val 


Leu 


Leu 


Glu 


Ser 


Glu 


Gin 


Phe 


Leu 


Thr 


Glu 


Leu 


Thr 


Arg 


Leu 


1 








5 










10 










15 




Phe 


Gin 


Lys 


Cys 
20 


Arg 


Thr 


Ser 


Gly 


Ser 
25 


Val 


Tyr 


He 


Thr 


Leu 
30 


Lys 


Lys 


Tyr 


Asp 


Gly 
35 


Arg 


Thr 


Lys 


Pro 


He 
40 


Pro 


Lys 


Lys 


Gly 


Thr 
45 


Val 


Glu 


Gly 


Phe 


Glu 
50 


Pro 


Ala 


Asp 


Asn 


Lys 
55 


Cys 


Leu 


Leu 


Arg 


Ala 
60 


Thr 


Asp 


Gly 


Lys 


Lys 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 


Glu 


Val 


Asn 


Lys 


Phe 


Gin 


65 










70 










75 










80 


Met 


Ala 


Tyr 


Ser 


Asn 

85 


Leu 


Leu 


Arg 


Ala 


Asn 
90 


Met 


Asp 


Gly 


Leu 


Lys 
95 


Lys 


Arg 


Asp 


Lys 


Lys 
100 


Asn 


Glu 


Thr 


Lys 


Lys 
105 


Thr 


Lys 


Ala 


Ala 


Ala 
110 


Ala 


Ala 


Ala 


Ala 


Thr 
115 


Ala 


Ala 


Gin 























<210> 551 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 40 

<223> Xaa = Gin, Arg 
<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = His, Arg 
<220> 

<221> UNSURE 
<222> 38 

<223> Xaa = Phe, He 
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<220> 

<221> UNSURE 
<222> 97 

<223> Xaa = Phe, Leu 



<400> 551 

Met Asp Ser Leu 

1 

Leu Gly Arg Asn 
20 

Lys Arg Thr Glu 
35 

Pro Val Gin Ser 
50 

Glu Phe Leu Gly 
65 

Val Leu Asp lie 

Xaa Phe Ser Ser 
100 



Pro Ser Gly Lys 
5 

Arg Lys Glu Asn 

Phe Xaa Thr Xaa 
40 

Leu Leu Asp Leu 
55 

Phe Thr Ser Tyr 
70 

Trp Glu Glu Glu 
85 

Pro Ser Thr Ser 



lie Xaa Arg Lys 
10 

Leu Glu Pro Asn 
25 

Glu Glu Asn Arg 

Phe Gin Thr Ser 
60 

Thr Glu Lys Ser 
75 

Asn Ser Asp Asn 
90 

Thr Phe Thr Gly 
105 



Val Lys lie lie 
15 

Ala Glu Phe Asp 
30 

lie Cys Ser Ser 
45 

Glu Glu Lys Ser 

Gly lie Cys Asn 
80 

Leu Leu Thr Ala 
95 

Phe 



<210> 552 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<400> 552 

Met Ser Gly Asp 

1 

Lys Val Lys Lys 
20 

Asp Ala Trp Ser 

35 

Lys Ala Arg Lys 
50 

Gly Arg Phe Trp 
65 

Tyr Phe Phe Leu 

Gin Val Arg Asn 
100 



Gly Ala Thr Glu 
5 

Ala Glu Lys Lys 

lie Leu lie Arg 
40 

Thr Tyr Glu Arg 
55 

Lys Leu Tyr lie 
70 

Tyr Gin Tyr Cys 
85 

lie Ala Asn 



Gin Ala Ala Glu 
10 

Leu Glu Glu Asn 

25 

Glu Ala Gin Asn 

Leu Val Ala Gin 
60 

Glu Ala Glu Val 
75 

Ser lie His Cys 
90 



Tyr Val Pro Glu 
15 

Pro Tyr Asp Leu 
30 

Gin Pro lie Asp 
45 

Phe Pro Ser Ser 

Thr lie Leu Phe 
80 

Ser Asp Arg Lys 
95 



<210> 553 
<211> 106 
<212> PRT 
<213> Homo sapiens 



<400> 553 

Met Leu Lys Ser 

1 

Val Tyr Ala Arg 
20 

Ser Ser Ala Ala 
35 

Pro Gin Gin Gin 
50 

Ser Gly Leu Leu 
65 

Tyr Lys Met Leu 

Leu Lys Gin Arg 
100 



Asn Gly Glu Arg 
5 

Lys Met Ala Ala 

Gly Val Ser Gly 
40 

Pro Gin Pro Pro 
55 

Gin Gin Gin Gin 
70 

lie Pro Gin Leu 
85 

Glu Ala Asn Trp 



Arg Ser Arg Asn 
10 

Ser Gin Gin Gin 
25 

Pro Ser Ser Ala 

Ala Gin Leu Val 
60 

Gin Asp Phe Asp 
75 

Lys Glu Ser Leu 

90 
lie Trp 
105 



Ala Leu Pro Ala 
15 

Ala Ser Ala Ala 
30 

Gly Gly Pro Gly 
45 

Gly Pro Ala Gin 

Pro Val Gin Arg 
80 

Gin Val He Gly 
95 
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<210> 554 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<400> 554 












Met 


Ala 


Glu 


Asp 


Met 


Glu 


Thr 


Lys 


1 








5 








Phe 


Asp 


Ser 


Arg 
20 


Phe 


Pro 


Asn 


Gin 


Asn 


Tyr 


Leu 
35 


Asp 


Phe 


His 


Arg 


Cys 
40 


Gly 


Asp 
50 


He 


Ser 


Val 


Cys 


Glu 
55 


Trp 


Cys 


Pro 


Thr 


Ser 


Trp 


Val 


Thr 


Asp 


65 










70 






Thr 


Phe 


Pro 


Gly 


Lys 
85 


He 







He Lys Asn Tyr Lys Thr Ala Pro 

10 15 
Asn Gin Thr Arg Asn Cys Trp Gin 
25 30 
Gin Lys Ala Met Thr Ala Lys Gly 
4 5 

Tyr Gin Arg Val Tyr Gin Ser Leu 
60 

Trp Asp Glu Gin Arg Ala Glu Gly 
75 80 



<210> 555 
<211> 105 
<212> PRT 

<213> Homo sapiens 



<400> 555 

Met Cys Val Gly 

1 

Ser Asn Leu Val 
20 

Lys Arg Ser Asp 
35 

Asn Gin Phe His 
50 

Trp Ser Val Leu 
65 

Arg He Arg Pro 

He Leu He Gly 
100 



Leu Leu Met Val 
5 

Ala Val Gin Tyr 

Lys Asn Gin Asp 
4 0 

Leu Cys Leu Gin 
55 

Ala He Leu Ser 
70 

Thr He Tyr Lys 
85 

Phe Val Leu Asn 



Ser He Leu Thr 
10 

Ser Pro Asp Tyr 
25 

Ala Ser Thr Phe 

Tyr His His Lys 
60 

Gin Leu Phe Leu 
75 

Ser Asn Lys Pro 
90 

Leu 
105 



Asp He Asn Leu 
15 

Cys Asn Phe Arg 
30 

Cys His Asn Cys 
4 5 

He Val Leu Pro 

His He Thr Ser 
80 

Lys Ser He Glu 
95 



<210> 
<211> 
<212> 
<213> 



556 
101 
PRT 
Homo 



sapiens 



<220> 

<221> UNSURE 
<222> 95 

<223> Xaa = Cys, Trp 
<220> 

<221> UNSURE 
<222> 86 

<223> Xaa = Leu, Val 



<400> 556 

Met Glu Arg Pro 

1 

Phe Arg Asn Glu 
20 

Phe Thr Ala Leu 
35 

Lys Ser Tyr He 



Asp Lys Ala Ala 
5 

Ser Ser Leu Ala 

Met lie Thr Val 
40 

Phe Glu Gly Ala 



Leu Asn Ala Leu 
10 

Ser Thr Leu Lys 
25 

Pro lie Gly Leu 
Leu Gly Met Ser 



Gin Pro Pro Glu 
15 

Thr Leu Leu Phe 
30 

Tyr Phe Thr Thr 
45 

Asn Arg Asp Ser 
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50 55 60 

Tyr Phe Tyr Ala Ala lie Val Ala Val Val Ala Val His Val Val Leu 
65 70 75 80 

Ala Leu Phe Val Tyr Xaa Ala Trp Asn Glu Gly Ser Arg Gin Xaa Arg 

85 90 95 

Glu Gly Lys Gin Asp 
100 



<210> 557 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 557 

Met Phe Arg Asp 

1 

Val Asp Ala Arg 
20 

Lys Pro Lys Pro 
35 

Tyr Ala Val Trp 
50 

Phe Gin Val Cys 
65 

lie Cys Arg His 



Phe Gly Arg Arg 
5 

Leu Arg Leu Ser 

Val Glu Val Gin 
40 

Phe Gly Gly Ser 
55 

His Thr Lys Lys 
70 

Asn Pro Val Phe 
85 



Leu Gin Arg Asp 
10 

Glu Glu Leu Ser 
25 

Val Val Thr His 

Met Leu Ala Ser 
60 

Asp Tyr Glu Glu 
75 

Gly Val Met Ser 
90 



Leu Lys Arg Val 
15 

Gly Gly Arg lie 
30 

His Met Gin Arg 
45 

Thr Pro Glu Phe 

Tyr Gly Pro Ser 
80 



<210> 558 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 86 

<223> Xaa = * , Glu 
<220> 

<221> UNSURE 
<222> 51 

<223> Xaa = Asp, Tyr 
<400> 558 

Met Ser Val Ser Phe His Thr His Thr Lys Glu Leu Trp Thr Trp Met 

15 10 15 

Glu Asp Leu Gin Lys Glu Met Leu Glu Asp Val Cys Ala Asp Ser Val 

20 25 30 

Asp Ala Val Gin Glu Leu lie Lys Gin Phe Gin Gin Gin Gin Thr Ala 

35 40 45 

Thr Leu Xaa Ala Thr Leu Asn Val lie Lys Glu Gly Glu Asp Leu lie 

50 55 60 

Gin Gin Leu Arg Ser Ala Pro Pro Ser Leu Gly Glu Pro Ser Glu Ala 
65 70 75 80 

Arg Ser Ala Trp Ala Xaa Leu Ser Ser Gly Lys Cys Leu Gly Leu Asp 
8 5 90 95 

Val Arg 

<210> 559 
<211> 89 
<212> PRT 

<213> Homo sapiens 



<400> 559 
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Met 


Val 


Ser 


Gly 


Asp 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 


He 


His 


1 








5 










10 










15 




Leu 


Gly 


Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 


Ser 


Arg 








20 










25 










30 






Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly 


Arg 


Gly 






35 










40 










45 








Arg 


Ser 


Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


lie 


Pro 


Ala 


val 


blU 




50 










55 










60 










Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 


Leu 


Gly 


65 










70 










75 










80 


Gly Val 


He 


Ala 


Ala 


Arg 


He 


Ser 


Val 
























85 
























<210> 560 




























<211> 172 




























<212> PRT 




























<213> Homo sapiens 
























<400> 560 




























Met 


Ala 


Gly 


Ala 


Ala 


Thr 


Gin 


Ala 


Ser 


Leu 


Glu 


Ser 


Ala 


Pro 


Arg 


He 


1 








5 










10 










15 




Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 


Arg 


Ser 


Arg 


Ala 


Arg 


Ala 


Gly 


Glu 








20 










25 










30 






Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val 


Ala 


Thr 


Pro 


Val 


Phe 


Met 


Pro 


Val 






35 










40 










45 








Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly 


He 


Thr 


Thr 


Glu 


Gin 


Leu 


Asp 


Ala 




50 










55 










60 










Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu 


Gly 


Asn 


Thr 


Tyr 


His 


Leu 


Gly 


Leu 


Arg 


65 










70 










75 










80 


Pro 


Gly 


Pro 


Glu 


Leu 


He 


Gin 


Lys 


Ala 


Asn 


Gly 


Leu 


His 


Gly 


Phe 


Met 










85 










90 










95 




Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu 


Thr 


Asp 


Ser 


Gly 


Gly 


Phe 


Gin 


Met 


Val 








100 










105 










110 






Ser 


Leu 


Val 


Ser 


Leu 


Ser 


Glu 


Val 


Thr 


Glu 


Glu 


Gly 


Val 


Arg 


Phe 


Arg 






115 










120 










125 








Ser 


Pro 


Tyr 


Asp 


Gly 


Asn 


Glu 


Thr 


Leu 


Leu 


Ser 


Pro 


Glu 


Lys 


Ser 


Val 




130 










135 










140 










Gin 


He 


Gin 


Asn 


Ala 


Leu 


Gly 


Ser 


Asp 


He 


He 


Met 


Gin 


Leu 


Asp 


Asp 


145 










150 










155 










160 


Val 


Val 


Ser 


Ser 


Thr 


Val 


Thr 


Gly 


Pro 


Arg 


Val 


Glu 


















165 










170 















<210> 561 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .527 

<220> 

<221> sig_peptide 

<222> 57 . . 143 

<223> Von Heijne matrix 

score 14.3000001907349 

seq LLVCCLLTPGVQG/QE 

<400> 561 

cttttttccc ctgcctgccc ttcgggcacc tcaggaaggc accttcctct gtcaga atg 59 

Met 

get acc atg gta cca tec gtg ttg tgg ccc agg gec tgc tgg act ctg 107 
Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg Ala Cys Trp Thr Leu 
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-25 -20 -15 

ctg gtc tgc tgt ctg ctg acc cca ggt gtc cag ggg cag gag ttc ctt 155 

Leu Val Cys Cys Leu Leu Thr Pro Gly Val Gin Gly Gin Glu Phe Leu 

-10 -5 1 

ttg egg gtg gag ccc cag aac cct gtg etc tct get gga ggg tec ctg 203 

Leu Arg Val Glu Pro Gin Asn Pro Val Leu Ser Ala Gly Gly Ser Leu 

5 10 15 20 

ttt gtg aac tgc agt act gat tgt ccc age tct gag aaa ate gee ttg 251 

Phe Val Asn Cys Ser Thr Asp Cys Pro Ser Ser Glu Lys lie Ala Leu 

25 30 35 

gag acg tec eta tea aag gag ctg gtg gee agt ggc atg ggc tgg gca 299 

Glu Thr Ser Leu Ser Lys Glu Leu Val Ala Ser Gly Met Gly Trp Ala 

40 45 50 

gee ttc aat etc age aac gtg act ggc aac agt egg ate etc tgc tea 347 

Ala Phe Asn Leu Ser Asn Val Thr Gly Asn Ser Arg lie Leu Cys Ser 

55 60 65 

gtg tac tgc aat ggc tec cag ata aca ggc tec tct aac ate acc gtg 395 

Val Tyr Cys Asn Gly Ser Gin lie Thr Gly Ser Ser Asn lie Thr Val 

70 75 80 

tac agg etc ccg gag cgt gtg gag ctg gca ccc ctg cct cct tgg cag 443 

Tyr Arg Leu Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Pro Trp Gin 

85 90 95 100 

ccg gtg ggc cag aac ttc acc cct gcg ctg cca agt gga gga tgg gtc 4 91 

Pro Val Gly Gin Asn Phe Thr Pro Ala Leu Pro Ser Gly Gly Trp Val 

105 110 115 

gee ccg gac cak cct cac ggt ggt get get teg ctg g 528 

Ala Pro Asp Xaa Pro His Gly Gly Ala Ala Ser Leu 

120 125 



<210> 562 
<211> 682 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 230. .682 



<220> 

<221> sig_peptide 
<222> 230. .277 
<223> Von Heijne matrix 
score 13 

seq LLLLLSALAGLFG/AA 



<400> 562 

gtgtttcgat acagactaag ettttaggee aaccctcctg actggatggg ggeggeggge 60 
gtggcatgca tgaaaagtaa acatcagaga cctgaagaag cttataaaat agcttgggag 120 
aggecagtea ccaagacagg catctcaaat eggctgatte tgcatctgga aactgccttc 180 
atcttgaaag aaaagctcca ggtcccttct ccagccaccc agccccaag atg gtg atg 238 

Met Val Met 
-15 



ctg 


ctg 


ctg 


ctg 


ctt 


tec 


gca 


ctg 


get 


ggc 


etc 


ttc 


ggt 


gcg 


gca 


gag 


286 


Leu 


Leu 


Leu 


Leu 
-10 


Leu 


Ser 


Ala 


Leu 


Ala 
-5 


Gly 


Leu 


Phe 


Gly 


Ala 
1 


Ala 


Glu 




gga 


caa 


gca 


ttt 


cat 


ctt 


ggg 


aag 


tgc 


ccc 


aat 


cct 


ccg 


gtg 


cag 


gag 


334 


Gly 


Gin 


Ala 


Phe 


His 


Leu 


Gly 


Lys 


Cys 


Pro 


Asn 


Pro 


Pro 


Val 


Gin 


Glu 




5 










10 










15 












aat 


ttt 


gac 


gtg 


aat 


aag 


tat 


etc 


gga 


aga 


tgg 


tac 


gaa 


att 


gag 


aag 


382 


Asn 


Phe 


Asp 


Val 


Asn 


Lys 


Tyr 


Leu 


Gly 


Arg 


Trp 


Tyr 


Glu 


lie 


Glu 


Lys 




20 










25 










30 










35 




ate 


cca 


aca 


acc 


ttt 


gag 


aat 


gga 


cgc 


tgc 


ate 


cag 


gee 


aac 


tac 


tea 


430 


lie 


Pro 


Thr 


Thr 


Phe 


Glu 


Asn 


Gly Arg 


Cys 


lie 


Gin 


Ala 


Asn 


Tyr 


Ser 
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40 










45 










50 






eta 


atg 


gaa 


aac 


gga 


aag 


ate 


aaa 


gtg 


tta 


aac 


cag 


gag 


ttg 


aga 


get 


478 


Leu 


Met 


Glu 


Asn 
55 


Gly 


Lys 


He 


Lys 


Val 
60 


Leu 


Asn 


Gin 


Glu 


Leu 
65 


Arg 


Ala 




gat 


gga 


act 


gtg 


aat 


caa 


ate 


gaa 


ggt 


gaa 


gee 


acc 


cca 


gtt 


aac 


etc 


526 


Asp 


Gly 


Thr 
70 


Val 


Asn 


Gin 


He 


Glu 

75 


Gly 


Glu 


Ala 


Thr 


Pro 
80 


Val 


Asn 


Leu 




aca 


gag 


cct 


gee 


aag 


ctg 


gaa 


gtt 


aag 


ttt 


tec 


tgg 


ttt 


atg 


cca 


teg 


574 


Thr 


Glu 
85 


Pro 


Ala 


Lys 


Leu 


Glu 
90 


Val 


Lys 


Phe 


Ser 


Trp 

95 


Phe 


Met 


Pro 


Ser 




gca 


ccg 


tac 


tgg 


ate 


ctg 


gee 


acc 


gac 


tat 


gag 


aac 


tat 


gee 


etc 


gtg 


622 


Ala 


Pro 


Tyr 


Trp 


.He 


Leu 


Ala 


Thr 


Asp 


Tyr 


Glu 


Asn 


Tyr 


Ala 


Leu 


Val 




100 










105 










110 










115 




tat 


tec 


tgt 


acc 


tgc 


ate 


ate 


caa 


ctt 


ttt 


cac 


gtg 


gat 


ttt 


get 


tgg 


670 


Tyr 


Ser 


Cys 


Thr 


Cys 
120 


He 


He 


Gin 


Leu 


Phe 
125 


His 


Val 


Asp 


Phe 


Ala 
130 


Trp 




ate 


ttg 


gca 


aga 


























682 


He 


Leu 


Ala 


Arg 
135 





























<210> 563 
<211> 500 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 72 . . 500 



<220> 

<221> sig_peptide 

<222> 72 . . 122 

<223> Von Heijne matrix 

score 10.6999998092651 
seq VLMLLAVLIWTGA/EN 



<220> 

<221> misc_feature 
<222> 1. .3 

<223> n=a, g, c or t 



<400> 563 

nnngaacatc taatcgaaaa cttgtttcct gagttgtcct gtgctggagg tetgetcaga 60 
cgaaggtctc c atg gcg tta gaa gtc ttg atg etc etc get gtc ttg att 110 
Met Ala Leu Glu Val Leu Met Leu Leu Ala Val Leu He 
-15 -10 -5 



tgg 


acc 


ggt 


get 


gag 


aac 


etc 


cat 


gtg 


aaa 


ata 


agt 


tgc 


tct 


ctg 


gac 


158 


Trp 


Thr 


Gly 


Ala 


Glu 
1 


Asn 


Leu 


His 


Val 
5 


Lys 


lie 


Ser 


Cys 


Ser 
10 


Leu 


Asp 




tgg 


ttg 


atg 


gtc 


tea 


gtt 


ate 


cca 


gtt 


gca 


gaa 


age 


aga 


aat 


ctg 


tat 


206 


Trp 


Leu 


Met 


Val 


Ser 


Val 


He 


Pro 


Val 


Ala 


Glu 


Ser 


Arg 


Asn 


Leu 


Tyr 






15 










20 










25 










ata 


ttt 


gcg 


gat 


gaa 


tta 


cat 


ctg 


gga 


atg 


ggc 


tgc 


cct 


gca 


aat 


egg 


254 


He 


Phe 
30 


Ala 


Asp 


Glu 


Leu 


His 
35 


Leu 


Gly 


Met 


Gly 


Cys 
40 


Pro 


Ala 


Asn 


Arg 




ata 


cat 


aca 


tat 


gta 


tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


302 


He 


His 


Thr 


Tyr 


Val 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 




45 










50 










55 










60 




ate 


agg 


aca 


agg 


gta 


gtt 


tct 


gag 


gaa 


act 


etc 


ctt 


ttt 


caa 


acc 


gag 


350 


He 


Arg 


Thr 


Arg 


Val 
65 


Val 


Ser 


Glu 


Glu 


Thr 
70 


Leu 


Leu 


Phe 


Gin 


Thr 
75 


Glu 




ctg 


tac 


ttt 


acc 


cca 


agg 


aat 


ata 


gat 


cat 


gac 


cct 


cag 


gaa 


ate 


cat 


398 


Leu 


Tyr 


Phe 


Thr 


Pro 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin 


Glu 


He 


His 
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80 85 
ttg gag tgt tec acc tct agg aaa tea gtg tgg 
Leu Glu Cys Ser Thr Ser Arg Lys Ser Val Trp 

95 100 
act gag aat gaa ata aaa ttg gat cct agt cct 
Thr Glu Asn Glu lie Lys Leu Asp Pro Ser Pro 

110 115 
cag aca 
Gin Thr 
125 

<210> 564 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52. .495 

<220> 

<221> sig_peptide 

<222> 52. .102 

<223> Von Heijne matrix 

score 10.6999998092651 

seq VLMLLAVLIWTGA/EN 



<400> 564 

cttgtttcct gagttgtcct gtgctggagg tetgetcaga cgaaggtctc c atg gcg 57 

Met Ala 



tta 


gaa 


gtc 


ttg 


atg 


etc 


etc 


get 


gtc 


ttg 


att 


tgg 


acc 


ggt 


get 


gag 


105 


Leu 


Glu 


Val 


Leu 


Met 


Leu 


Leu 


Ala 


Val 


Leu 


He 


Trp 


Thr 


Gly 


Ala 


Glu 




-15 










-10 










-5 










1 




aac 


etc 


cat 


gtg 


aaa 


ata 


agt 


tgc 


tct 


ctg 


gac 


tgg 


ttg 


atg 


gtc 


tea 


153 


Asn 


Leu 


His 


Val 


Lys 


He 


Ser 


Cys 


Ser 


Leu 


Asp 


Trp 


Leu 


Met 


Val 


Ser 










5 










10 










15 








gtt 


ate 


cca 


gtt 


gca 


gaa 


age 


aga 


aat 


ctg 


tat 


ata 


ttt 


gcg 


gat 


gaa 


201 


Val 


He 


Pro 


Val 


Ala 


Glu 


Ser 


Arg 


Asn 


Leu 


Tyr 


He 


Phe 


Ala 


Asp 


Glu 








20 










25 










30 










tta 


cat 


ctg 


gga 


atg 


ggc 


tgc 


cct 


gca 


aat 


egg 


ata 


cat 


aca 


tat 


gta 


249 


Leu 


His 


Leu 


Gly 


Met 


Gly 


Cys 


Pro 


Ala 


Asn 


Arg 


He 


His 


Thr 


Tyr 


Val 






35 










40 










45 












tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


ate 


agg 


aca 


agg 


gta 


297 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 


He 


Arg 


Thr 


Arg 


Val 




50 










55 










60 










65 




gtt 


tct 


gag 


gaa 


act 


etc 


ctt 


ttt 


caa 


acc 


gag 


ctg 


tac 


ttt 


acc 


cca 


345 


Val 


Ser 


Glu 


Glu 


Thr 


Leu 


Leu 


Phe 


Gin 


Thr 


Glu 


Leu 


Tyr 


Phe 


Thr 


Pro 












70 










75 










80 






agg 


aat 


ata 


gat 


cat 


gac 


cct 


cag 


gaa 


ate 


cat 


ttg 


gag 


tgt 


tec 


acc 


393 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin 


Glu 


He 


His 


Leu 


Glu 


Cys 


Ser 


Thr 










85 










90 










95 








tct 


agg 


aaa 


tea 


gtg 


tgg 


ctt 


aca 


cca 


gtt 


tct 


act 


gag 


aat 


gaa 


ata 


441 


Ser 


Arg 


Lys 


Ser 


Val 


Trp 


Leu 


Thr 


Pro 


Val 


Ser 


Thr 


Glu 


Asn 


Glu 


He 








100 










105 










110 










aaa 


ttg 


gat 


cct 


agt 


cct 


ttt 


att 


get 


gac 


ttt 


cag 


aca 


aca 


gca 


gaa 


489 


Lys 


Leu 


Asp 


Pro 


Ser 


Pro 


Phe 


He 


Ala 


Asp 


Phe 


Gin 


Thr 


Thr 


Ala 


Glu 





115 120 125 

gag tta gg 497 
Glu Leu 
130 



PCT/IB01/00914 

90 

ctt aca cca gtt tct 446 
Leu Thr Pro Val Ser 
105 

ttt att get gac ttt 494 

Phe He Ala Asp Phe 

120 

500 



<210> 565 
<211> 342 



WO 02/083898 



PCT/IB01/00914 



304 



<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 118 . . 342 



<220> 

<221> sig_peptide 

<222> 118 . . 189 

<223> Von Heijne matrix 

score 10.6000003814697 
seq LLLLLLCPAHVGG/LW 



<400> 565 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 
atg ctg ccg ccc tta ccc tec cgc etc ggg ctg ctg ctg ctg ctg etc 165 
Met Leu Pro Pro Leu Pro Ser Arg Leu Gly Leu Leu Leu Leu Leu Leu 

-20 -15 -10 

ctg tgc ccg gcg cac gte ggc gga ctg tgg tgg get gtg ggc age ccc 213 
Leu Cys Pro Ala His Val Gly Gly Leu Trp Trp Ala Val Gly Ser Pro 

-5 1 5 

ttg gtt atg gac cct acc age ate tgc agg aag gca egg egg ctg gcc 261 
Leu Val Met Asp Pro Thr Ser lie Cys Arg Lys Ala Arg Arg Leu Ala 

10 15 20 

ggg egg cag gcc gag ttg tgc cag get gag ccg gaa gtg gtg gca gag 309 
Gly Arg Gin Ala Glu Leu Cys Gin Ala Glu Pro Glu Val Val Ala Glu 
25 30 " 35 4 0 

ctg get egg ggc gcc egg etc ggg gtg cga rag 342 
Leu Ala Arg Gly Ala Arg Leu Gly Val Arg Xaa 
45 ~ 50 



<210> 566 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 242. .433 



<220> 

<221> sig_peptide 

<222> 242. .307 

<223> Von Heijne matrix 

score 10.1999998092651 
seq LLALAALLQGAVS/LK 



<400> 566 

acatttgccc cagggaaggt cacagctgcc tgaactbbtc aaaactccca gacacgcact 60 
gcctgtgcag gatceggage ccagcagcac tgccagggcc ttgaagtgct tattcagaga 120 
cctttcttca tagactactt ttttttcttt aagcagcaaa aggagaaaat tgtcatcaaa 180 
ggatattcca gattcttgac agcattctcg tcatctctga ggacatcacc atcatctcag 240 
g atg agg ggc atg aag ctg ctg ggg gcg ctg ctg gca ctg gcg gcc eta 289 
Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 
-20 ' -15 -10 

ctg cag ggg gcc gtg tec ctg aag ate gca gcc ttc aac ate cag aca 337 
Leu Gin Gly Ala Val Ser Leu Lys lie Ala Ala Phe Asn lie Gin Thr 

-5 1 5 10 

ttt ggg gag acc aag atg tec aat gcc acc etc gte age tac att gtg 385 
Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr lie Val 
15 20 25 
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cag ate ctg age cgc tat gac ate gee ctg gtc cag gag gtc aga gac a 434 
Gin lie Leu Ser Arg Tyr Asp lie Ala Leu Val Gin Glu Val Arg Asp 
30 ~ 35 40 

<210> 567 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136. . 486 

<220> 

<221> sig_peptide 
<222> 136. .234 
<223> Von Heijne matrix 
score 8.5 

seq FFFVLGSLI FCFG/ IW 
<400> 567 

ctcttttggg gttcttcctt tctctctcag ctctccgtct ctctttctct ctcagcctct 60 
ttctttctcc ctgtctcccc cactgtcagc acctcttctg tgtggtgagt ggacegctta 120 
ccccactagg tgaag atg tea gee cag gag age tgc etc age etc ate aag 171 
Met Ser Ala Gin Glu Ser Cys Leu Ser Leu lie Lys 
-30 -25 



tac 


ttc 


etc 


ttc 


gtt 


ttc 


aac 


etc 


ttc 


ttc 


ttc 


gtc 


etc 


ggc 


age 


ctg 


219 


Tyr 


Phe 


Leu 


Phe 


Val 


Phe 


Asn 


Leu 


Phe 


Phe 


Phe 


Val 


Leu 


Gly 


Ser 


Leu 






-20 










-15 










-10 












ate 


ttc 


tgc 


ttc 


ggc 


ate 


tgg 


ate 


etc 


att 


gac 


aag 


acc 


age 


ttc 


gtg 


267 


He 


Phe 


Cys 


Phe 


Gly 


He 


Trp 


He 


Leu 


He 


Asp 


Lys 


Thr 


Ser 


Phe 


Val 




-5 










1 








5 










10 






tec 


ttt 


gtg 


ggc 


ttg 


gee 


ttc 


gtg 


cct 


ctg 


cag 


ate 


tgg 


tec 


aaa 


gtc 


315 


Ser 


Phe 


Val 


Gly 


Leu 


Ala 


Phe 


Val 


Pro 


Leu 


Gin 


He 


Trp 


Ser 


Lys 


Val 










15 










20 










25 








ctg 


gee 


ate 


tea 


gga 


ate 


ttc 


acc 


atg 


ggc 


ate 


gee 


etc 


ctg 


ggt 


tgt 


363 


Leu 


Ala 


He 


Ser 


Gly 


He 


Phe 


Thr 


Met 


Gly 


He 


Ala 


Leu 


Leu 


Gly 


Cys 








30 










35 










40 










gtg 


ggg 


gee 


tea 


agg 


age 


tec 


get 


gee 


tec 


tgg 


gee 


tgt 


att 


ttg 


gga 


411 


Val 


Gly 


Ala 


Ser 


Arg 


Ser 


Ser 


Ala 


Ala 


Ser 


Trp 


Ala 


Cys 


He 


Leu 


Gly 






45 










50 










55 












tgc 


tgc 


tgc 


tec 


tgt 


ttg 


cca 


cac 


aga 


tea 


ccc 


tgg 


gaa 


tec 


tea 


tct 


459 


Cys 


Cys 


Cys 


Ser 


Cys 


Leu 


Pro 


His 


Arg 


Ser 


Pro 


Trp 


Glu 


Ser 


Ser 


Ser 




60 










65 










70 










75 




cca 


etc 


age 


ggg 


cca 


get 


gra 


gcg 


aag 


c 














487 


Pro 


Leu 


Ser 


Gly 


Pro 


Ala 


Xaa 


Ala 


Lys 



















80 



<210> 568 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 233. . 478 

<220> 

<221> sig_peptide 

<222> 233. .328 

<223> Von Heijne matrix 

score 8.19999980926514 
seq LVAAMVLLSVVFC/LY 
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<400> 568 

atgaatgtca gaataggccc attctaccag ctctggctga gcctgagctt ccaaaagtga 60 

gctgagctgt tcaaccttgg atcttaatta ctcctagcag ggataattag gtccctcttt 120 

ctcagattac aggtttgaga agttccaaat atccgatgcc ctagaggttt ttttccctta 180 

attttattgt tacaattggc cttcatctag ctctggatgg ttgaactgta gc atg gca 238 

Met Ala 



aag 


atg 


ttt 


gat 


etc 


agg 


acg 


aag 


ate 


atg 


ate 


ggc 


ate 


gga 


age 


age 


286 


Lys 


Met 


Phe 


Asp 


Leu 


Arg 


Thr 


Lys 


He 


Met 


He 


Gly 


He 


Gly 


Ser 


Ser 




-30 










-25 










-20 










-15 




tta 


ctg 


gtt 


gec 


gcg 


atg 


gtg 


etc 


eta 


agt 


gtt 


gtg 


ttc 


tgt 


ctt 


tac 


334 


Leu 


Leu 


Val 


Ala 


Ala 
-10 


Met 


Val 


Leu 


Leu 


Ser 
-5 


Val 


Val 


Phe 


Cys 


Leu 
1 


Tyr 




ttc 


aaa 


gta 


get 


aag 


gca 


eta 


aaa 


get 


gca 


aag 


gac 


cct 


gat 


get 


gtg 


382 


Phe 


Lys 


Val 
5 


Ala 


Lys 


Ala 


Leu 


Lys 
10 


Ala 


Ala 


Lys 


Asp 


Pro 
15 


Asp 


Ala 


Val 




get 


gta 


aaa 


aat 


cac 


aac 


cca 


gac 


aag 


gtg 


tgt 


tgg 


gee 


acg 


aac 


age 


430 


Ala 


Val 
20 


Lys 


Asn 


His 


Asn 


Pro 

25 


Asp 


Lys 


Val 


Cys 


Trp 
30 


Ala 


Thr 


Asn 


Ser 




cak 


gec 


aaa 


gec 


ace 


acc 


atg 


gag 


tct 


tgt 


cca 


tct 


etc 


cag 


tgc 


tgt 


478 


Xaa 


Ala 


Lys 


Ala 


Thr 


Thr 


Met 


Glu 


Ser 


Cys 


Pro 


Ser 


Leu 


Gin 


Cys 


Cys 




35 










40 










45 










50 





<210> 569 
<211> 469 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 290. .469 



<220> 

<221> sig_peptide 
<222> 290. .346 
<223> Von Heijne matrix 
score 7 . 5 

seq LAVFFLVSTQLFA/EE 



<400> 569 

agatgttatg 

agaggaggcg 

ctggaggctc 

tcggacctcc 

aaccagagaa 



ggcatcgacg 
gctaattaaa 
aaggaggctg 
ttcccggcgg 
actcagcacc 



agttaccgtc 
tattgagcag 
ggataaatac 
cagctaccga 
ccgcgggact 



teggaaaetc 
aaagtcgcgt 
cgcaagcact 
gagtgeggas 
gtccgtcgca 



etc 


gtg 


gee 


ttg 


gca 


gtc 


ttt 


ttt 


ctt 


gtc 


tec 


Leu 


Val 


Ala 


Leu 


Ala 


Val 


Phe 


Phe 


Leu 


Val 


Ser 




-15 










-10 










gaa 


gaa 


ata 


gga 


gee 


aat 


gat 


gat 


ctg 


aat 


tac 


Glu 


Glu 


He 


Gly Ala 


Asn 


Asp 


Asp 


Leu 


Asn 


Tyr 


1 








5 










10 




gac 


age 


gac 


cag 


ate 


aag 


gag 


gaa 


ctg 


ccg 


gag 


Asp 


Ser 


Asp 


Gin 


He 


Lys 


Glu 


Glu 


Leu 


Pro 


Glu 








20 










25 






ctg 


cag 


aga 


ate 


gec 


egg 


aga 


ccc 


aag 






Leu 


Gin 


Arg 


He 


Ala 


Arg 


Arg 


Pro 


Lys 







tcaatcacgc aagegaaagg 
ggggagaatg tcacgtgggt 
gagcaggega aagagegege 
gaecagegtg egcteggagg 
aaatccaac atg aaa ate 
Met Lys He 
act cag ctg ttt gca 
Thr Gin Leu Phe Ala 
-5 

tgg tec gac tgg tac 
Trp Ser Asp Trp Tyr 
15 

ccc ttt gag cat ctt 
Pro Phe Glu His Leu 
30 



60 
120 
180 
240 
298 

346 



394 



442 



469 



35 



40 



<210> 570 
<211> 489 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 253. .489 



<220> 

<221> sig_peptide 

<222> 253. .336 

<223> Von Heijne matrix 

score 7.40000009536743 
seq CLLLNLAPAPLNA/DS 



<400> 570 

ctctttgctc taacagacag cagcgacttt aggctggata atagtcaaat tcttacctcg 60 
ctctttcact gctagtaaga tcagattgcg tttctttcag ttactcttca atcgccagtt 120 
tcttgatctg cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 180 
ggaaaaacaa aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 240 
gattttactt ag atg att tac aca atg aag aaa gta cat gca ctt tgg get 291 
Met lie Tyr Thr Met Lys Lys Val His Ala Leu Trp Ala 
-25 -20 



tct 


gta 


tgc 


ctg 


ctg 


ctt 


aat 


ctt 


gec 


cct 


gec 


cct 


ctt 


aat 


get 


gat 


339 


Ser 


Val 


Cys 


Leu 


Leu 


Leu 


Asn 


Leu 


Ala 


Pro 


Ala 


Pro 


Leu 


Asn 


Ala 


Asp 




-15 










-10 










-5 










1 




tct 


gag 


gaa 


gat 


gaa 


gaa 


cac 


aca 


att 


ate 


aca 


gat 


acg 


gag 


ttg 


cca 


387 


Ser 


Glu 


Glu 


Asp 


Glu 


Glu 


His 


Thr 


He 


He 


Thr 


Asp 


Thr 


Glu 


Leu 


Pro 










5 










10 










15 








cca 


ctg 


aaa 


ctt 


atg 


cat 


tea 


ttt 


tgt 


gca 


ttc 


aag 


gcg 


gat 


gat 


ggc 


435 


Pro 


Leu 


Lys 


Leu 


Met 


His 


Ser 


Phe 


Cys 


Ala 


Phe 


Lys 


Ala 


Asp 


Asp 


Gly 








20 










25 










30 










cca 


tgt 


aaa 


gca 


ate 


atg 


aaa 


aga 


ttt 


tct 


tea 


ata 


ttt 


tea 


etc 


gac 


483 


Pro 


Cys 


Lys 


Ala 


He 


Met 


Lys 


Arg 


Phe 


Ser 


Ser 


He 


Phe 


Ser 


Leu 


Asp 





35 40 45 

agt gcg 489 
Ser Ala 
50 



<210> 571 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .483 



<220> 

<221> sig_peptide 

<222> 256. . 327 

<223> Von Heijne matrix 

score 7.09999990463257 
seq FLFLFNLIFFILG/AV 



<400> 571 

ggggegggge ctgccgagtc cgcggcgttc cccggctgca geegggaggg ggecgaggag 60 
tgactgagee ccgggctgtg cagtccgacg ccgactgagg cacgageggg tgacgctggg 120 
cctgcagcgc ggagcagaaa gcagaacccg cagagtcctc cctgctgctg tgtggacgac 180 
acgtgggcac aggcagaagt gggccctgtg accagctgca ctggtttcgt ggaaggaagc 240 
tccaggactg gcggg atg ggc tea gee tgt ate aaa gtc ace aaa tac ttt 291 
Met Gly Ser Ala Cys lie Lys Val Thr Lys Tyr Phe 
-20 -15 
etc ttc etc ttc aac ttg ate ttc ttt ate ctg ggc gca gtg ate ctg 339 
Leu Phe Leu Phe Asn Leu He Phe Phe He Leu Gly Ala Val He Leu 
-10 -5 1 
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ggc ttc ggg gtg tgg ate ctg gec gac aag age agt ttc ate tct gtc 387 

Gly Phe Gly Val Trp lie Leu Ala Asp Lys Ser Ser Phe lie Ser Val 

5 10 15 20 

ctg caa ace tec tec age teg ctt agg atg ggg gee tat gtc ttc ate 435 

Leu Gin Thr Ser Ser Ser Ser Leu Arg Met Gly Ala Tyr Val Phe lie 

25 30 35 

ggc gtg ggg gca gtc act atg etc atg ggc ttc ctg ggc tgc ate ggc g 484 

Gly Val Gly Ala Val Thr Met Leu Met Gly Phe Leu Gly Cys lie Gly 
40 45 50 



<210> 572 
<211> 482 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. .481 



<220> 

<221> sig_peptide 
<222> 29. . 82 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 



<400> 572 

agcctagcac tctgacctag cagtcaac atg aag get etc att gtt ctg ggg 52 

Met Lys Ala Leu lie Val Leu Gly 
-15 



ctt 


gtc 


etc 


ctt 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


100 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gee 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


148 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


Met 


Asp 


Gly 


Tyr 


Arg 


Gly 










10 










15 










20 








ate 


age 


eta 


gca 


aac 


tgg 


atg 


tgt 


ttg 


gee 


aaa 


tgg 


gag 


agt 


ggt 


tac 


196 


He 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 








25 










30 










35 










aac 


aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age 


act 


gat 


tat 


244 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tyr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


acc 


292 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




cca 


gga 


gca 


gtt 


aat 


gee 


tgt 


cat 


tta 


tec 


tgc 


agt 


get 


ttg 


ctg 


caa 


340 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Ala 


Leu 


Leu 


Gin 












75 










80 










85 






gat 


aac 


ate 


get 


gat 


get 


gta 


get 


tgt 


gca 


aag 


agg 


gtt 


gtc 


cgt 


gat 


388 


Asp 


Asn 


He 


Ala 


Asp 


Ala 


Val 


Ala 


Cys 


Ala 


Lys 


Arg 


Val 


Val 


Arg 


Asp 










90 










95 










100 








cca 


ccc 


acc 


teg 


gee 


tec 


caa 


agt 


get 


ggg 


att 


aca 


ggc 


gtg 


asc 


aac 


436 


Pro 


Pro 


Thr 


Ser 


Ala 


Ser 


Gin 


Ser 


Ala 


Gly 


He 


Thr 


Gly 


Val 


Xaa 


Asn 








105 










110 










115 










tgc 


gee 


egg 


cca 


cat 


tea 


gtt 


ctt 


ate 


aaa 


gaa 


ata 


acc 


cag 


act 


t 


482 


Cys 


Ala 


Arg 


Pro 


His 


Ser 


Val 


Leu 


He 


Lys 


Glu 


He 


Thr 


Gin 


Thr 







120 125 130 



<210> 573 
<211> 541 
<212> DNA 
<213> Homo sapiens 



WO 02/083898 



PCT/IB01/00914 



309 

<220> 
<221> CDS 
<222> 294 . . 539 



<220> 

<221> sig_peptide 
<222> 294 . . 380 
<223> Von Heijne matrix 
score 7 

seq LLMLFHTMAQIMA/EQ 



<400> 573 

ataactatgt 

taactctcaa 

tgcgggaggc 

aacagatgcg 

tctctggcgg 



gacattactt 
aaacttctga 
cctcctcccc 
ttttcagaag 
cctctgcagc 



agggcataat 
ttttcttgca 
cgaatcagca 
acagtactcc 
agcacagccg 



gat etc 
Asp Leu 

ctg atg 
Leu Met 

gaa aat 
Glu Asn 
5 

gtg egg 
Val Arg 

ttt att 
Phe He 

aca ga 
Thr 



caa gga aga ggg gtc ccc 
Gin Gly Arg Gly Val Pro 
-25 

ttg ttc cat aca atg get 
Leu Phe His Thr Met Ala 
-10 -5 
etc tea ggc ctt tec act 
Leu Ser Gly Leu Ser Thr 
10 

gaa aat ggg acg acc tgt 
Glu Asn Gly Thr Thr Cys 
25 

gta cct tat gat gtg tgg 
Val Pro Tyr Asp Val Trp 
40 



tctaaaattt 
tccctttttg 
acaatttttg 
agtgataagc 
gcctcattcg 

age ate gac 
Ser He Asp 
-20 

caa ate atg 
Gin He Met 

aac cct gaa 
Asn Pro Glu 
15 

etc atg gca 
Leu Met Ala 
30 

gee age aac 
Ala Ser Asn 
45 



ctgtcctagg 
gcagagtggt 
ctccaccgtt 
ggcgactttg 
gggcactgcg 

aga ctt cga 
Arg Leu Arg 
-15 

gca gaa caa 
Ala Glu Gin 
1 

aaa gat ata 
Lys Asp He 

gag ttt gca 
Glu Phe Ala 

tac gta gat 
Tyr Val Asp 
50 



agacactaga 
cacagtataa 
ttctttcggg 
agggattccc 
agt atg 
Met 
gtt etc 
Val Leu 

gaa gtg 
Glu Val 

ttt gtg 
Phe Val 

20 
gee aaa 
Ala Lys 
35 

ctg ate 
Leu lie 



60 
120 
180 
240 
296 

344 



392 



440 



488 



536 



541 



<210> 574 
<211> 453 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 274 . .453 



<220> 

<221> sig_peptide 

<222> 274 . . 345 

<223> Von Heijne matrix 

score 6.59999990463257 
seq KMVHLLVLSGAWG/MQ 



<400> 574 

attattggtt gggggaaacc acgaggggac 
ggcgtgggag tgaggyacca gattcagccc 
ccgggtgctg eggattgagg tcccggttcc 
ctgtccccga gcgggctcac agggtctgaa 
agccacccgg tgehtcstte ccaggascgc 



geggecgagg 
atttggcccc 
taacgaatct 
ggccacgcay 
aag atg gag 
Met Glu 



gga ggc ctg att aag atg gtc cat eta ctg gtc 
Gly Gly Leu He Lys Met Val His Leu Leu Val 
-15 -10 



agggtegctg tccacccggg 
gacgcctctg ttctcggaat 
ctgctggatt ggccgtaacc 
gaggcaaagg yaaagyycyg 
gaa ggc ggg aac eta 
Glu Gly Gly Asn Leu 
-20 

ttg tea ggt gee tgg 
Leu Ser Gly Ala Trp 
-5 



60 
120 
180 
240 
294 



342 



WO 02/083898 

ggc atg caa atg 
Gly Met Gin Met 
1 

age ctt ccc cga 
Ser Leu Pro Arg 

ttc tac ttc cac 
Phe Tyr Phe His 
35 



tgg gtg acc ttc gtc 
Trp Val Thr Phe Val 
5 

cat acc ttc gga eta 

His Thr Phe Gly Leu 

20 

ate 

He 



310 

tea ggc ttc ctg ctt 
Ser Gly Phe Leu Leu 
10 

gtg cag age aaa etc 
Val Gin Ser Lys Leu 
25 



PCT/IB01/00914 

ttc cga 390 
Phe Arg 
15 

ttc ccc 438 

Phe Pro 

30 

453 



<210> 575 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70. .495 



<220> 

<221> sig_peptide 

<222> 70. . 147 

<223> Von Heijne matrix 

score 6.09999990463257 
seq IGLASLFTENVLA/HP 



<400> 575 

ataggatgaa accatggaag tcttsctsag atctggggag ggagcctggc ccaagagccc 60 
gtggagcta atg aaa aca gaa cag ctg aat aga ttt get gga ttt ggt att 111 
Met Lys Thr Glu Gin Leu Asn Arg Phe Ala Gly Phe Gly He 
-25 -20 -15 



gga 


ctt 


gca 


agt 


etc 


ttt 


aca 


gaa 


aat 


gta 


ttg 


gca 


cat 


cct 


tgc 


att 


159 


Gly 


Leu 


Ala 


Ser 


Leu 


Phe 


Thr 


Glu 


Asn 


Val 


Leu 


Ala 


His 


Pro 


Cys 


He 








-10 










-5 










1 










gtt 


eta 


cgc 


cgc 


caa 


tgt 


cag 


gtt 


aat 


tac 


cat 


get 


cag 


cat 


tac 


cat 


207 


Val 


Leu 


Arg 


Arg 


Gin 


Cys 


Gin 


Val 


Asn 


Tyr 


His 


Ala 


Gin 


His 


Tyr 


His 




5 










10 










15 










20 




etc 


act 


cca 


ttt 


aca 


gtc 


ate 


aat 


att 


atg 


tac 


agt 


ttc 


aac 


aaa 


act 


255 


Leu 


Thr 


Pro 


Phe 


Thr 


Val 


He 


Asn 


He 


Met 


Tyr 


Ser 


Phe 


Asn 


Lys 


Thr 












25 










30 










35 






cag 


gga 


cct 


aga 


gee 


ctg 


tgg 


aaa 


gga 


atg 


rga 


agt 


aca 


ttt 


att 


gtc 


303 


Gin 


Gly 


Pro 


Arg 


Ala 


Leu 


Trp 


Lys 


Gly 


Met 


Xaa 


Ser 


Thr 


Phe 


He 


Val 










40 










45 










50 








cag 


gga 


gtc 


aca 


ctt 


gga 


gca 


gaa 


ggc 


ata 


att 


agt 


gaa 


ttt 


aca 


cct 


351 


Gin 


Gly 


Val 


Thr 


Leu 


Gly 


Ala 


Glu 


Gly 


He 


He 


Ser 


Glu 


Phe 


Thr 


Pro 








55 










60 










65 










ttg 


cca 


agg 


gag 


gtt 


tta 


cat 


aaa 


tgg 


agt 


cct 


aaa 


caa 


ata 


gga 


gaa 


399 


Leu 


Pro 


Arg 


Glu 


Val 


Leu 


His 


Lys 


Trp 


Ser 


Pro 


Lys 


Gin 


He 


Gly 


Glu 






70 










75 










80 












cac 


ctt 


eta 


ctg 


aaa 


tec 


eta 


act 


tac 


gtg 


gtg 


gca 


atg 


ctt 


ttt 


tat 


447 


His 


Leu 


Leu 


Leu 


Lys 


Ser 


Leu 


Thr 


Tyr 


Val 


Val 


Ala 


Met 


Leu 


Phe 


Tyr 




85 










90 










95 










100 




tea 


gca 


agt 


ctg 


att 


gaa 


aca 


gtg 


cag 


agt 


gag 


ata 


att 


cga 


gat 


aat 


495 


Ser 


Ala 


Ser 


Leu 


He 


Glu 


Thr 


Val 


Gin 


Ser 


Glu 


He 


He 


Arg 


Asp 


Asn 





105 110 115 

497 



<210> 576 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<220> 



WO 02/083898 PCT/IBO 1/009 14 

311 

<221> CDS 
<222> 88 . . 465 

<220> 

<221> sig_peptide 

<222> 88 . .219 

<223> Von Heijne matrix 

score 6.09999990463257 

seq ACCLVMVISCVFC/MG 

<400> 576 

tcttacacca acctgctcca aaccacaaga ggagttactt gttccagcct cctgtgtgga 60 
ctgctttcct atcaaagcac cttagac atg cac gag gaa gaa ata tac acc tct 114 

Met His Glu Glu Glu He Tyr Thr Ser 
-40 



ctt 


cag 


tgg 


gat 


age 


cca 


gca 


cca 


gac 


act 


tac 


cag 


aaa 


tgt 


ctg 


tct 


162 


Leu 


Gin 


Trp 


Asp 


Ser 


Pro 


Ala 


Pro 


Asp 


Thr 


Tyr 


Gin 


Lys 


Cys 


Leu 


Ser 




-35 










-30 










-25 










-20 




tec 


aac 


aaa 


tgt 


tea 


gga 


gca 


tgc 


tgt 


ctt 


gtg 


atg 


gtg 


att 


tea 


tgt 


210 


Ser 


Asn 


Lys 


Cys 


Ser 
-15 


Gly 


Ala 


Cys 


Cys 


Leu 
-10 


Val 


Met 


Val 


He 


Ser 
-5 


Cys 




gtt 


ttc 


tgc 


atg 


gga 


tta 


tta 


acg 


gca 


tec 


att 


ttc 


ttg 


ggc 


gtc 


aag 


258 


Val 


Phe 


Cys 


Met 
1 


Gly 


Leu 


Leu 


Thr 
5 


Ala 


Ser 


He 


Phe 


Leu 
10 


Gly 


Val 


Lys 




ttg 


ttg 


cag 


gtg 


tec 


acc 


att 


gcg 


atg 


cag 


cag 


caa 


gaa 


aaa 


etc 


ate 


306 


Leu 


Leu 
15 


Gin 


Val 


Ser 


Thr 


He 
20 


Ala 


Met 


Gin 


Gin 


Gin 
25 


Glu 


Lys 


Leu 


He 




caa 


caa 


gag 


agg 


gca 


ctg 


eta 


aac 


ttt 


aca 


gaa 


tgg 


aag 


aga 


age 


tgt 


354 


Gin 


Gin 


Glu 


Arg 


Ala 


Leu 


Leu 


Asn 


Phe 


Thr 


Glu 


Trp 


Lys 


Arg 


Ser 


Cys 




30 










35 










40 










45 




gec 


ctt 


cag 


atg 


aaa 


tat 


tgc 


caa 


gee 


ttc 


atg 


caa 


aac 


tea 


tta 


agt 


402 


Ala 


Leu 


Gin 


Met 


Lys 
50 


Tyr 


Cys 


Gin 


Ala 


Phe 
55 


Met 


Gin 


Asn 


Ser 


Leu 
60 


Ser 




tea 


gee 


cat 


aac 


age 


agt 


cct 


tgt 


cca 


aac 


aat 


tgg 


att 


cag 


aac 


aga 


450 


Ser 


Ala 


His 


Asn 
65 


Ser 


Ser 


Pro 


Cys 


Pro 
70 


Asn 


Asn 


Trp 


He 


Gin 
75 


Asn 


Arg 




gaa 


agt 


tgt 


tac 


tat 


gt 






















467 


Glu 


Ser 


Cys 


Tyr 


Tyr 



























80 



<210> 577 
<211> 485 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 328. .483 

<220> 

<221> sig_peptide 

<222> 328. .387 

<223> Von Heijne matrix 

score 5.90000009536743 
seq G WMAAG LM I GAGA / C Y 



<400> 577 

ctagagcaca agtcctgeta gtcgcctccg tctgggtacc agccccctat tactctgeag 60 

gcgtgtgaag aaagaaggaa actagctegg acegtgeagg tttgtaggtc tgttggcctg 120 

taggtttegg cacaagtttc agegagagaa ggagaaaact gccttggttg gaaccttgea 180 

gattcatcac aagagagcta caagagectg gaagaagctg aagactgeta ccctccatcc 240 

ttactcaccc tggacctgag agacctcttc aatcaggtgg agcaaggccc tctcctgtcc 300 

tgccccaagg ctggcacaga cttgagc atg ggc egg get egg gaa gtg ggt tgg 354 



WO 02/083898 PCT/IB01/00914 

312 

Met Gly Arg Ala Arg Glu Val Gly Trp 
-20 -15 
atg gcg gca gga ctg atg att ggg get ggt gec tgc tac tgc gtt tac 402 
Met Ala Ala Gly Leu Met lie Gly Ala Gly Ala Cys Tyr Cys Val Tyr 

-10 -5 15 

aaa ctg acc ata gga aga gat gac agt gag aag ctg gag gag gag ggg 450 
Lys Leu Thr lie Gly Arg Asp Asp Ser Glu Lys Leu Glu Glu Glu Gly 

10 15 20 

gaa gag gag tgg gac gat gac cag gag ctg gat ga 485 
Glu Glu Glu Trp Asp Asp Asp Gin Glu Leu Asp 
25 30 



<210> 578 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .500 



<220> 

<221> sig_peptide 

<222> 39. .83 

<223> Von Heijne matrix 

score 5.90000009536743 
seq ILMLTFI ICGLLT/RV 



<400> 578 

attcctcagg acacagagct tcctctctcc caggagee atg aat ate ctg atg ctg 56 

Met Asn lie Leu Met Leu 
-15 -10 



acc 


ttc 


att 


ate 


tgt 


ggg 


ttg 


eta 


act 


egg 


gtg 


acc 


aaa 


ggt 


age 


ttt 


104 


Thr 


Phe 


He 


He 


Cys 
-5 


Gly 


Leu 


Leu 


Thr 


Arg 
1 


Val 


Thr 


Lys 


Gly 
5 


Ser 


Phe 




gaa 


ccc 


caa 


aaa 


tgt 


tgg 


aag 


aat 


aat 


gta 


gga 


cat 


tgc 


aga 


aga 


cga 


152 


Glu 


Pro 


Gin 


Lys 


Cys 


Trp 


Lys 


Asn 


Asn 


Val 


Gly 


His 


Cys 


Arg 


Arg 


Arg 








10 










15 










20 










tgt 


tta 


gat 


act 


gaa 


agg 


tac 


ata 


ctt 


ctt 


tgt 


agg 


aac 


aag 


eta 


tea 


200 


Cys 


Leu 


Asp 


Thr 


Glu 


Arg 


Tyr 


He 


Leu 


Leu 


Cys 


Arg 


Asn 


Lys 


Leu 


Ser 






25 










30 










35 












tgc 


tgc 


att 


tct 


ata 


ata 


tea 


cat 


gaa 


tat 


act 


cga 


cga 


cca 


gca 


ttt 


248 


Cys 


Cys 


He 


Ser 


He 


He 


Ser 


His 


Glu 


Tyr 


Thr 


Arg 


Arg 


Pro 


Ala 


Phe 




40 










45 










50 










55 




cct 


gtg 


att 


cac 


eta 


gag 


gat 


ata 


aca 


ttg 


gat 


tat 


agt 


gat 


gtg 


gac 


296 


Pro 


Val 


He 


His 


Leu 


Glu 


Asp 


He 


Thr 


Leu 


Asp 


Tyr 


Ser 


Asp 


Val 


Asp 












60 










65 










70 






tct 


ttt 


act 


ggt 


tec 


cca 


gta 


tct 


atg 


ttg 


aat 


gat 


ctg 


ata 


aca 


ttt 


344 


Ser 


Phe 


Thr 


Gly 


Ser 


Pro 


Val 


Ser 


Met 


Leu 


Asn 


Asp 


Leu 


He 


Thr 


Phe 










75 










80 










85 








gac 


aca 


act 


aaa 


ttt 


gga 


gaa 


acc 


atg 


aca 


cct 


gag 


acc 


aat 


act 


cct 


392 


Asp 


Thr 


Thr 


Lys 


Phe 


Gly 


Glu 


Thr 


Met 


Thr 


Pro 


Glu 


Thr 


Asn 


Thr 


Pro 






90 










95 










100 










gag 


act 


act 


atg 


cca 


cca 


tec 


gag 


gee 


act 


act 


ccc 


gag 


act 


act 


atg 


440 


Glu 


Thr 


Thr 


Met 


Pro 


Pro 


Ser 


Glu 


Ala 


Thr 


Thr 


Pro 


Glu 


Thr 


Thr 


Met 






105 










110 










115 












cca 


cca 


tct 


gag 


act 


get 


act 


tec 


gag 


act 


atg 


cca 


cca 


cct 


tct 


cag 


488 


Pro 


Pro 


Ser 


Glu 


Thr 


Ala 


Thr 


Ser 


Glu 


Thr 


Met 


Pro 


Pro 


Pro 


Ser 


Gin 




120 










125 










130 










135 




aca 


get 


ctt 


act 


ca 
























502 


Thr 


Ala 


Leu 


Thr 





























<210> 579 



WO 02/083898 



PCT/IB01/00914 



313 

<211> 754 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 259. .753 



<220> 

<221> sig_peptide 

<222> 259. .375 

<223> Von Heijne matrix 

score 5.80000019073486 
seq FCVCVIAIGVVQA/LI 



<400> 579 

aagctcccgg 

gcccagcagt 

atgacctttc 

gggaaaacag 

ccctccctcc 



ccgggctgac tcaagcggag 
tcgacagaag tgtacagagg 
tcatcgtgac agtggcctgg 
acgacacact tgccctgctc 
aggcctgc atg atg acc 
Met Met Thr 



teg tta atg gtg 
Ser Leu Met Val 
-25 

tgt gtg tgt gtg 
Cys Val Cys Val 
-10 

tac gca ttc cac 
Tyr Ala Phe His 
5 

gec cac agg get 
Ala His Arg Ala 



acc ttc cct gat 
Thr Phe Pro Asp 



ggc ccg gec 
Gly Pro Ala 



ccc 
Pro 

age 
Ser 

ccc 

Pro 

85 

etc 

Leu 



ttg tct 
Leu Ser 
55 

aag gtc 
Lys Val 
70 

teg get 
Ser Ala 

age aag 
Ser Lys 



ctg 

Leu 

40 

tac 

Tyr 

acc 
Thr 

cac 
His 

gag 
Glu 



ate gec 
lie Ala 

ttc ccg 
Phe Pro 
10 

ctg tac 
Leu Tyr 

25 

tgc ttt 
Cys Phe 



gtg gec acg ctt 
Val Ala Thr Leu 
120 



ctg 
Leu 

ggc 
Gly 

cca 
Pro 

cgc 
Arg 
105 
etc 
Leu 



ctg 
Leu 

tgg 
Trp 

gtg 

Val 

90 

gtg 

Val 



ate ggg 
lie Gly 

-5 
cac ctg 
His Leu 

cga cga 
Arg Arg 

gca gcg 
Ala Ala 

atg gtg 
Met Val 
60 

tgc aga 
Cys Arg 
75 

gaa gtc 
Glu Val 

gaa gec 
Glu Ala 



gcgcgcggaa 
cttctggcaa 
gcagcacaca 
aacctggccg 
ate acc ttc 
lie Thr Phe 

-35 
gtg cct ctg 
Val Pro Leu 
-20 

gtc gtg cag 
Val Val Gin 



ctg age 
Leu Ser 

cac gtc 
His Val 

30 
gec ate 
Ala lie 
45 

act gtc 
Thr Val 



ccg 

Pro 

15 

ctg 

Leu 

ttc 
Phe 

ate 
He 



ate ctg gac 
He Leu Asp 



gac agg etc 
Asp Arg Leu 

ttc teg ttt 
Phe Ser Phe 
95 

ttc age gac 
Phe Ser Asp 

110 
ate tgg t 
He Trp 
125 



cagtcgccga ggcgattccc 
caeggattge cgtctacctg 
caaggttgtt ccaagttgtt 
catcatggct gtgatgeect 
ctg cct tac acg ttt 
Leu Pro Tyr Thr Phe 
-30 

ggc ate ttc ttg ttc 
Gly He Phe Leu Phe 
-15 

gca ctg att gtr ggg 
Ala Leu He Val Gly 
1 

cag ate cag cgc tct 
Gin He Gin Arg Ser 
20 

ggc ate gtc etc caa 
Gly He Val Leu Gin 
35 

tct etc ttc ttt gtc 
Ser Leu Phe Phe Val 
50 

etc etc ccc tat gtc 
Leu Leu Pro Tyr Val 
65 

ctg ggc cac agg gag 
Leu Gly His Arg Glu 
80 

gac etc cac gag cca 
Asp Leu His Glu Pro 
100 

gga gtc tac gec ate 
Gly Val Tyr Ala He 
115 



60 
120 
180 
240 
291 



339 



387 



435 



483 



531 



579 



627 



675 



723 



754 



<210> 580 
<211> 488 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 54 . . 488 



WO 02/083898 



PCT/IB01/00914 



314 

<220> 

<221> sig_peptide 

<222> 54 . .119 

<223> Von Heijne matrix 

score 5.80000019073486 

seq PVTLTSLLPMVTX/GP 

<220> 

<221> misc_f eature 
<222> 181,256,369 
<223> n-a, g, c or t 



<400> 580 

gtctttttaa tctctgcagt cccctgagtg tcacccaacc tgcaggcagt gtc atg 56 

Met 

104 



152 



ggc 


cgt 


tgg 


agt 


ctg 


act 


gca 


tct 


cct 


gtt 


acc 


etc 


aca 


tct 


etc 


tta 


Gly Arg 


Trp 


Ser 


Leu 


Thr 


Ala 


Ser 


Pro 


Val 


Thr 


Leu 


Thr 


Ser 


Leu 


Leu 




-20 










-15 










-10 










cca 


atg 


gta 


act 


kca 


gga 


cca 


get 


gca 




aag 


age 


agt 


tec 


tea 


acc 


Pro 


Met 


Val 


Thr 


Xaa 


Gly 


Pro 


Ala 


Ala 


Gly 


Lys 


Ser 


Ser 


Ser 


Ser 


Thr 


-5 










1 








5 










10 




teg 


tgg 


gat 


act 


gtg 


ctg 


act 


gee 


ate 


wnc 


tgt 


gee 


age 


act 


gtg 


cag 


Ser 


Trp 


Asp 


Thr 


Val 


Leu 


Thr 


Ala 


He 


Xaa 


Cys 


Ala 


Ser 


Thr 


Val 


Gin 








15 










20 










25 






ctg 


ate 


tec 


aca 


aca 


ctg 


gga 


gca 


tct 


gee 


tea 


ggt 


gec 


aga 


atg 


ccc 


Leu 


He 


Ser 


Thr 


Thr 


Leu 


Gly 


Ala 


Ser 


Ala 


Ser 


Gly 


Ala 


Arg 


Met 


Pro 






30 










35 










40 








act 


ace 


tnc 


tgc 


teg 


ggg 


acc 


act 


gtg 


ttc 


ctg 


act 


gec 


ctt 


cag 


gat 


Thr 


Thr 


Xaa 


Cys 


Ser 


Gly 


Thr 


Thr 


Val 


Phe 


Leu 


Thr 


Ala 


Leu 


Gin 


Asp 




45 










50 










55 










act 


atg 


cag 


aga 


gag 


gag 


ctt 


gta 


aaa 


aat 


gee 


act 


cct 


cct 


gca 


gaa 


Thr 


Met 


Gin 


Arg 


Glu 


Glu 


Leu 


Val 


Lys 


Asn 


Ala 


Thr 


Pro 


Pro 


Ala 


Glu 


60 










65 










70 










75 


cct 


gee 


agg 


gca 


gag 


gac 


ctt 


tct 


net 


get 


cct 


cat 


gtg 


aca 


cca 


acc 


Pro 


Ala 


Arg 


Ala 


Glu 


Asp 


Leu 


Ser 


Xaa 


Ala 


Pro 


His 


Val 


Thr 


Pro 


Thr 










80 










85 










90 




teg 


tgc 


tgt 


ccc 


aca 


ctg 


gca 


cct 


gca 


gca 


cca 


cct 


get 


tec 


ctg 


ggc 


Ser 


Cys 


Cys 


Pro 


Thr 


Leu 


Ala 


Pro 


Ala 


Ala 


Pro 


Pro 


Ala 


Ser 


Leu 


Gly 








95 










100 










105 






act 


ate 


ttg 


atg 


aca 


ate 


atg 


ttt 


gee 


age 


gta 


ggt 


ttt 


tec 


aat 


gaa 


Thr 


He 


Leu 


Met 


Thr 


He 


Met 


Phe 


Ala 


Ser 


Val 


Gly 


Phe 


Ser 


Asn 


Glu 






110 










115 










120 









200 



248 



296 



344 



392 



440 



488 



<210> 581 
<211> 343 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 17 . . 343 



<220> 

<221> sig_peptide 

<222> 17. . 106 

<223> Von Heijne matrix 

score 5.69999980926514 
seq LGVNLVVMPPTRA/RS 



<400> 581 

atagattaca aeggtg atg gcg ggc aag egg tec ggc tgg age egg gcg get 52 

Met Ala Gly Lys Arg Ser Gly Trp Ser Arg Ala Ala 
-30 " " -25 -20 
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etc 


etc 


cag 


etc 


ctt 


etc 


Leu 


Leu 


Gin 


Leu 


Leu 


Leu 








-15 






egg 


gee 


egg 


agt 


ctg 


cgc 


Arg 


Ala 


Arg 
1 


Ser 


Leu 


Arg 


tgg 


gaa 


ctg 


ggc 


cag 


gee 


Trp 


Glu 


Leu 


Gly 


Gin 


Ala 


15 










20 


acc 


tct 


tat 


cac 


egg 


caa 


Thr 


Ser 


Tyr 


His 


Arg 


Gin 










35 




act 


etc 


atg 


agt 


get 


gag 


Thr 


Leu 


Met 


Ser 


Ala 


Glu 








50 






ggg 


atk 


cag 


cgc 


ttc 


aac 


Gly 


Xaa 


Gin 


Arg 


Phe 


Asn 



65 



cac 
His 



315 

ggc gtg aac ctg gtg gtg 
Gly Val Asn Leu Val Val 
-10 

ttc gtt acc ttg gag ggg 
Phe Val Thr Leu Glu Gly 
5 10 
ctg egg cag cgc tat cac 
Leu Arg Gin Arg Tyr His 
25 

gag gtt tat gtg cga arg 
Glu Val Tyr Val Arg Xaa 
40 

gec aac ctg get gga stc 
Ala Asn Leu Ala Gly Xaa 
55 

ccg aac ate teg tgg cag 
Pro Asn He Ser Trp Gin 
70 



PCT/IB01/00914 

atg ccg ccc acc 100 
Met Pro Pro Thr 
-5 

atg eta cag cac 148 
Met Leu Gin His 

ggc ttc eta aac 196 
Gly Phe Leu Asn 
30 

tac ttt gac egg 244 
Tyr Phe Asp Arg 
4 5 

ttc cct ccc aac 292 
Phe Pro Pro Asn 
60 

cct att cct gtg 340 

Pro lie Pro Val 

75 

343 



<210> 582 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 293. . 463 



<220> 

<221> sig_peptide 

<222> 293. . 388 

<223> Von Heijne matrix 

score 5.59999990463257 
seq WMGLAC FRSLAAS / S P 



<400> 582 

aaaagattga agaggaactc tttcaaacct ttcaagtgcc ccctcctttc cttaaagtct 60 

tttatagggg tccccttctt ggccatctcc atcctgtgag tcaggactga aagggcacag 120 

acaggtcact gecagcattg ttggggcaag cctgcaagca cgcatcactg gggatctgac 180 

atgacaatgg ccgcctgccc cctctgaggg ctacaggact taccccagtg ggaagcagct 240 

aagcaggtct gaccagccga cctggacctg gecaagggtc ctgtcatccc tc atg gee 298 

Met Ala 



acc 


ccg 


cca 


ttc 


egg 


ctg 


ata 


agg 


aag 


atg 


ttt 


tec 


ttc 


aag 


gtg 


age 


346 


Thr 


Pro 


Pro 


Phe 


Arg 


Leu 


He 


Arg 


Lys 


Met 


Phe 


Ser 


Phe 


Lys 


Val 


Ser 




-30 










-25 










-20 










-15 




aga 


tgg 


atg 


ggg 


ctt 


gee 


tgc 


ttc 


egg 


tec 


ctg 


gcg 


gca 


tec 


tct 


ccc 


394 


Arg 


Trp 


Met 


Gly 


Leu 
-10 


Ala 


Cys 


Phe 


Arg 


Ser 
-5 


Leu 


Ala 


Ala 


Ser 


Ser 
1 


Pro 




agt 


att 


cgc 


cag 


aag 


aaa 


eta 


atg 


cac 


aag 


ctg 


cag 


gag 


gaa 


aac 


get 


442 


Ser 


He 


Arg 
5 


Gin 


Lys 


Lys 


Leu 


Met 
10 


His 


Lys 


Leu 


Gin 


Glu 

15 


Glu 


Asn 


Ala 




ttt 


cgc 


gaa 


gag 


atg 


aaa 


att 


tt 


















465 


Phe 


Arg 
20 


Glu 


Glu 


Met 


Lys 


He 
25 























<210> 583 
<211> 539 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 302. .538 



<220> 

<221> sig_jpeptide 

<222> 302. .358 

<223> Von Heijne matrix 

score 5.40000009536743 
seq LCCSGCVPSLCCS/SY 



<400> 583 

aaaaacttcc gccgccgcgt ccgccgcctc cggaactaaa cggggtgagg tcacattcgg 
ttatctctaa cgttggaaaa cgatggagct aacacccatt atggagatta accacttttc 
atcaggtttt taacttaagt cgtgaggaat acaacggtga acacaagatt cattttattt 
tcatcaccat gggacgtatc ctgttgttga gtwctctggg tcagacctct gaagacttct 
cagatggatc ctagtctctg ggcttgcctg aaattactcg ctgctcaggg agagagttga 
a atg gtt ggc ate etc cca etc tgt tgc tec ggc tgt gtc ccc teg etc 
Met Val Gly lie Leu Pro Leu Cys Cys Ser Gly Cys Val Pro Ser Leu 
-15 -10 -5 

tgt tgt tec age tat gtc ccc tct gtt get cca act gca get cat tct 
Cys Cys Ser Ser Tyr Val Pro Ser Val Ala Pro Thr Ala Ala His Ser 

1 5 10 

gtt aga gtt cct cat tea get ggt cac tgt ggc cag agg gtg ttg gee 
Val Arg Val Pro His Ser Ala Gly His Cys Gly Gin Arg Val Leu Ala 

15 20 25 

tgc tec ctt cct caa gta ttc tta aag cca tgg att ttt gtg gag cat 
Cys Ser Leu Pro Gin Val Phe Leu Lys Pro Trp lie Phe Val Glu His 
30 35 40 45 

ttt tct tec tgg etc tec ctt gag tta ttt tec ttt ctt cgc tat c 
Phe Ser Ser Trp Leu Ser Leu Glu Leu Phe Ser Phe Leu Arg Tyr 
50 55 60 



60 
120 
180 
240 
300 
349 



397 



445 



493 



539 



<210> 584 
<211> 526 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .525 



<220> 

<221> sig_peptide 

<222> 85. . 144 

<223> Von Heijne matrix 

score 5.19999980926514 
seq SPVFLVFPPEITA/SE 



<400> 584 

cctartcgaa aaacaagaaa agaacatggt ctagactgaa gtaccaacta aatcatctcc 60 
tttcaaatta tcaccgacac catc atg gat tea age ace gca cac agt ccg 111 

Met Asp Ser Ser Thr Ala His Ser Pro 
-20 -15 
gtg ttt ctg gta ttt cct cca gaa ate act get tea gaa tat gag tec 159 
Val Phe Leu Val Phe Pro Pro Glu lie Thr Ala Ser Glu Tyr Glu Ser 

-10 -5 15 

aca gaa ctt tea gee acg ace ttt tea act caa age ccc ttg caa aaa 207 
Thr Glu Leu Ser Ala Thr Thr Phe Ser Thr Gin Ser Pro Leu Gin Lys 

10 15 20 

tta ttt get aga aaa atg aaa ate tta ggg act ate cag ate ctg ttt 255 
Leu Phe Ala Arg Lys Met Lys lie Leu Gly Thr lie Gin lie Leu Phe 

25 30 35 

gga att atg ace ttt tct ttt gga gtt ate ttc ctt ttc act ttg tta 303 
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Gly 


He 


Met 


Thr 


Phe 


Ser 


Phe 


Gly 


Val 


He 


Phe 


Leu 


Phe 


Thr 


Leu 


Leu 








40 










45 










50 










aaa 


cca 


tat 


cca 


agg 


ttt 


ccc 


ttt 


ata 


ttt 


ctt 


tea 


gga 


tat 


cca 


ttc 


351 


Lys 


Pro 


Tyr 


Pro 


Arg 


Phe 


Pro 


Phe 


He 


Phe 


Leu 


Ser 


Gly 


Tyr 


Pro 


Phe 






55 










60 










65 












tgg 


ggc 


tct 


gtt 


ttg 


ttc 


att 


aat 


tct 


gga 


gcc 


ttc 


eta 


att 


gca 


gtg 


399 


Trp 


Gly 


Ser 


Val 


Leu 


Phe 


He 


Asn 


Ser 


Gly 


Ala 


Phe 


Leu 


He 


Ala 


Val 




70 










75 










80 










85 




aaa 


aga 


aaa 


acc 


aca 


gaa 


act 


ctg 


ata 


ata 


ttg 


age 


cga 


ata 


atg 


aat 


447 


Lys 


Arg 


Lys 


Thr 


Thr 


Glu 


Thr 


Leu 


He 


He 


Leu 


Ser 


Arg 


He 


Met 


Asn 












90 










95 










100 






ttt 


ctt 


agt 


gcc 


ctg 


gga 


gca 


ata 


get 


gga 


ate 


att 


etc 


etc 


aca 


ttt 


495 


Phe 


Leu 


Ser 


Ala 


Leu 


Gly 


Ala 


He 


Ala 


Gly 


He 


He 


Leu 


Leu 


Thr 


Phe 










105 










110 










115 








ggg 


ttt 


cat 


cct 


aga 


tea 


aaa 


eta 


cat 


ttg 


t 












526 


Gly 


Phe 


His 


Pro 


Arg 


Ser 


Lys 


Leu 


His 


Leu 

















120 125 



<210> 585 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 195. . 482 

<220> 

<221> sig_peptide 

<222> 195. .293 

<223> Von Heijne matrix 

score 5.09999990463257 

seq LYALVQLGQPCDC/LP 

<400> 585 

gcacagatga ccggaagcgt gccctggcgg cgctgccagc cccagctggg agctggtagg 60 

ggagtgcgag gaaggcggga egcagggegg agccgctgca ggggegggge ctgcggacag 120 

gagceggggt ttggggcggg aacccctcgt cccctgcaga ccccgcctgc tegggegegg 180 



gcggcggcgc ggee 


atg 


aag 


ctg 


aag 


ctg 


aag 


aac 


gtg 


ttt 


etc 


gcc 


tac 


230 










Met 


Lys 


Leu 


Lys 
-30 


Leu 


Lys 


Asn 


Val 


Phe 
-25 


Leu 


Ala 


Tyr 




ttc 


ctg 


gtg 


teg 


ate 


gcc 


ggc 


etc 


etc 


tac 


gcg 


ctg 


gta 


cag 


etc 


ggc 


278 


Phe 


Leu 
-20 


Val 


Ser 


He 


Ala 


Gly 
-15 


Leu 


Leu 


Tyr 


Ala 


Leu 
-10 


Val 


Gin 


Leu 


Gly 




cag 


cca 


tgt 


gac 


tgc 


ctt 


cct 


ccc 


ctg 


egg 


gca 


gca 


ges 


gag 


cag 


eta 


326 


Gin 


Pro 


Cys 


Asp 


Cys 


Leu 


Pro 


Pro 


Leu Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Leu 




-5 










1 








5 










10 






egg 


cag 


aag 


gat 


ctg 


agg 


att 


tec 


cag 


ctg 


caa 


gcg 


gaa 


etc 


cga 


egg 


374 


Arg 


Gin 


Lys 


Asp 
15 


Leu 


Arg 


He 


Ser 


Gin 
20 


Leu 


Gin 


Ala 


Glu 


Leu 

25 


Arg 


Arg 




cca 


ccc 


cct 


gcc 


cct 


gcc 


cag 


ccc 


cct 


gaa 


ccc 


gag 


gcc 


ctg 


cct 


act 


422 


Pro 


Pro 


Pro 

30 


Ala 


Pro 


Ala 


Gin 


Pro 
35 


Pro 


Glu 


Pro 


Glu 


Ala 
40 


Leu 


Pro 


Thr 




ate 


tat 


gtt 


gtt 


acc 


ccc 


acc 


tat 


gcc 


agg 


ctg 


gta 


cag 


aag 


gca 


gag 


470 


lie 


Tyr 
45 


Val 


Val 


Thr 


Pro 


Thr 
50 


Tyr 


Ala 


Arg 


Leu 


Val 
55 


Gin 


Lys 


Ala 


Glu 




ctg 


gta 


cga 


ctg 


























482 


Leu 


Val 


Arg 


Leu 





























60 



<210> 586 
<211> 543 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. . 541 

<220> 

<221> sig_pepticie 
<222> 35. .82 
<223> Von Heijne matrix 
score 5 

seq CGLFLRTTAAARA/CR 
<400> 586 

agactgaggc tgaggctggg gaacatcggg cage atg age ggc tgc ggg etc ttc 55 

Met Ser Gly Cys Gly Leu Phe 
-15 -10 



ctg 


cgc 


ace 


acg 


get 


gcg 


get 


cgt 


gee 


tgc 


egg 


ggt 


ctg 


gtg 


gtc 


tct 


103 


Leu 


Arg 


Thr 


Thr 


Ala 


Ala 


Ala 


Arg 


Ala 


Cys 


Arg 


Gly 


Leu 


Val 


Val 


Ser 












-5 










1 








5 








ace 


gcg 


aac 


egg 


egg 


eta 


ctg 


cgc 


ace 


age 


ccg 


cct 


gta 


cga 


get 


ttc 


151 


Thr 


Ala 


Asn 


Arg 


Arg 


Leu 


Leu 


Arg 


Thr 


Ser 


Pro 


Pro 


Val 


Arg 


Ala 


Phe 








10 










15 










20 










gee 


aaa 


gag 


ctt 


ttc 


eta 


ggc 


aaa 


ate 


aag 


aag 


aaa 


gaa 


gtt 


ttc 


cca 


199 


Ala 


Lys 


Glu 


Leu 


Phe 


Leu 


Gly 


Lys 


He 


Lys 


Lys 


Lys 


Glu 


Val 


Phe 


Pro 






.25 










30 










35 














cca 


ga a 


gut 


age 


caa 


y a L 


gaa 




da t_ 


y da 


a +■ c 


^ ^ t- 


f Pi n 




f t A 


247 


Phe 


Pro 


Glu 


Val 


Ser 


Gin 


Asp 


Glu 


Leu 


Asn 


Glu 


He 


Asn 


Gin 


Phe 


Leu 




40 










45 










50 










55 




gga 


ccc 


gtg 


gaa 


aaa 


ttc 


ttc 


act 


gaa 


gag 


gtg 


gac 


tec 


cga 


aaa 


att 


295 


Gly 


Pro 


Val 


Glu 


Lys 


Phe 


Phe 


Thr 


Glu 


Glu 


Val 


Asp 


Ser 


Arg 


Lys 


He 












60 










65 










70 






gac 


cag 


gaa 


ggg 


aaa 


ate 


cca 


gat 


gaa 


act 


ttg 


gag 


aaa 


ttg 


aag 


age 


343 


Asp 


Gin 


Glu 


Gly 


Lys 


He 


Pro 


Asp 


Glu 


Thr 


Leu 


Glu 


Lys 


Leu 


Lys 


Ser 










75 










80 










85 








eta 


ggg 


ctt 


ttt 


ggg 


ctg 


caa 


g t c 


cca 


gaa 


gaa 


tat 


y y l 




ctg 


y y u 


391 


Leu 


Gly 


Leu 


Phe 


Gly 


Leu 


Gin 


Val 


Pro 


Glu 


Glu 


Tyr 


Gly 


Gly 


Leu 


Gly 








90 










95 










100 










ttc 


tec 


aac 


ace 


atg 


tac 


tea 


cga 


eta 


ggg 


gag 


ate 


ate 


age 


atg 


gat 


439 


Phe 


Ser 


Asn 


Thr 


Met 


Tyr 


Ser 


Arg 


Leu 


Gly 


Glu 


He 


He 


Ser 


Met 


Asp 






105 










110 










115 












ggg 


tec 


ate 


act 


gtg 


ace 


ctg 


gca 


gcg 


cac 


cag 


get 


att 


ggc 


tea 


agg 


487 


Gly 


Ser 


He 


Thr 


Val 


Thr 


Leu 


Ala 


Ala 


His 


Gin 


Ala 


He 


Gly 


Ser 


Arg 




120 










125 










130 










135 




gga 


tea 


tct 


tgg 


ctg 


gca 


ctg 


agg 


age 


aga 


aag 


cca 


aat 


aac 


ttg 


cct 


535 


Gly 


Ser 


Ser 


Trp 


Leu 


Ala 


Leu 


Arg 


Ser 


Arg 


Lys 


Pro 


Asn 


Asn 


Leu 


Pro 












140 










145 










150 






aaa 


ctg 


gc 




























543 


Lys 


Leu 
































<210> 5£ 


57 






























<211> 473 






























<212> DNA 






























<213> Homo : 


sapiens 


























<220> 
































<221> CDS 






























<222> 64 


I . .471 




























<220> 
































<221> sig peptide 


























<222> 64 


I . . 171 




























<223> Von Heijne matrix 
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score 4.80000019073486 
seq LLLLRLNDAALRA/LQ 

<400> 587 

aggaggtggc ggtggtggcc ctcgcctgtg gcccccgtgc tgcttgcact cgaactcgtc 60 



gcc 


atg 


gag 


gag 


etc 


cag 


gag 


cct 


ctg 


aga 


gga 


gag 


etc 


egg 


etc 


tgc 


108 




Met 


Glu 
-35 


Glu 


Leu 


Gin 


Glu 


Pro 
-30 


Leu 


Arg 


Gly 


Glu 


Leu 
-25 


Arg 


Leu 


Cys 




ttc 


acg 


caa 


get 


gcc 


egg 


act 


age 


etc 


tta 


ctg 


etc 


agg 


etc 


aac 


gac 


156 


Phe 


Thr 
-20 


Gin 


Ala 


Ala 


Arg 


Thr 
-15 


Ser 


Leu 


Leu 


Leu 


Leu 
-10 


Arg 


Leu 


Asn 


Asp 




get 


gcc 


ctg 


egg 


gcg 


ctg 


caa 


gag 


tgt 


cag 


egg 


caa 


cag 


gta 


egg 


ccg 


204 


Ala 


Ala 


Leu 


Arg 


Ala 


Leu 


Gin 


Glu 


Cys 


Gin 


Arg 


Gin 


Gin 


Val 


Arg 


Pro 




-5 










1 








5 










10 






gtg 


att 


get 


ttc 


caa 


ggc 


cac 


cga 


ggg 


tat 


ctg 


aga 


etc 


cca 


ggc 


cct 


252 


Val 


He 


Ala 


Phe 
15 


Gin 


Gly 


His 


Arg 


Gly 
20 


Tyr 


Leu 


Arg 


Leu 


Pro 
25 


Gly 


Pro 




ggt 


tgg 


tec 


tgc 


etc 


ttc 


tec 


ttc 


ata 


gtg 


tec 


cag 


tgt 


tgt 


cag 


gag 


300 


Gly 


Trp 


Ser 
30 


Cys 


Leu 


Phe 


Ser 


Phe 
35 


He 


Val 


Ser 


Gin 


Cys 
40 


Cys 


Gin 


Glu 




ggc 


get 


ggt 


ggt 


age 


ttg 


gac 


ctt 


gtg 


tgc 


caa 


cgc 


ttc 


etc 


agg 


tct 


348 


Gly Ala 


Gly 


Gly 


Ser 


Leu 


Asp 


Leu 


Val 


Cys 


Gin 


Arg 


Phe 


Leu 


Arg 


Ser 






45 










50 










55 












ggg 


cct 


aac 


age 


etc 


cac 


tgc 


ctg 


ggc 


tea 


etc 


agg 


gag 


cgc 


etc 


att 


396 


Gly 


Pro 


Asn 


Ser 


Leu 


His 


Cys 


Leu 


Gly 


Ser 


Leu 


Arg 


Glu 


Arg 


Leu 


He 




60 










65 










70 










75 




att 


tgg 


gca 


gcc 


atg 


gat 


tct 


ate 


cca 


gcc 


cca 


tea 


tea 


gtt 


cag 


gga 


444 


He 


Trp 


Ala 


Ala 


Met 
80 


Asp 


Ser 


He 


Pro 


Ala 

85 


Pro 


Ser 


Ser 


Val 


Gin 
90 


Gly 




cac 


aay 


ctg 


act 


gaa 


gat 


gcc 


aga 


cat 


cc 














473 


His 


Asn 


Leu 


Thr 

95 


Glu 


Asp 


Ala 


Arg 


His 
100 



















<210> 588 
<211> 509 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 183. .509 

<220> 

<221> sig_peptide 

<222> 183. .251 

<223> Von Heijne matrix 

score 4.59999990463257 

seq GH S LVGLALLC I A / AN 

<400> 588 

aggaccaatg aaagcaaagt acctcatccc ctcagtgact aagaatcgea gtatttaaga 60 
ggtagcagga atgggctgag agtggtgttt gctttctcca ccagaagggc acactttcat 120 
ctaatttggg gtatcactga gctgaagaca aagagaaggg ggagaaaacc tagcagacca 180 
cc atg tgc tat ggg aag tgt gca cga tgc ate gga cat tct ctg gtg 227 
Met Cys Tyr Gly Lys Cys Ala Arg Cys He Gly His Ser Leu Val 
-20 -15 -10 

ggg etc gcc etc ctg tgc ate gcg get aat att ttg ctt tac ttt ccc 275 
Gly Leu Ala Leu Leu Cys He Ala Ala Asn He Leu Leu Tyr Phe Pro 

-5 1 5 

aat ggg gaa aca aag tat gcc tec gaa aac cac etc age cgc ttc gtg 323 
Asn Gly Glu Thr Lys Tyr Ala Ser Glu Asn His Leu Ser Arg Phe Val 

10 15 20 

tgg ttc ttt tct ggc ate gta gga ggt ggc ctg ctg atg etc ctg cca 371 
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Trp 


Phe 


Phe 


Ser 


Gly 


He 


Val 


Gly 


Gly 


Gly 


Leu 


Leu 


Met 


Leu 


Leu 


Pro 


25 










30 










35 










40 


gca 


ttt 


gtc 


ttc 


att 


ggg 


ctg 


gaa 


cag 


gat 


gac 


tgc 


tgt 


ggc 


tgc 


tgt 


Ala 


Phe 


Val 


Phe 


He 
45 


Gly 


Leu 


Glu 


Gin 


Asp 
50 


Asp 


Cys 


Cys 


Gly 


Cys 
55 


Cys 


ggc 


cat 


gaa 


aac 


tgt 


ggc 


aaa 


cga 


tgt 


gcg 


atg 


ctt 


tct 


tct 


gta 


ttg 


Gly 


His 


Glu 


Asn 
60 


Cys 


Gly 


Lys 


Arg 


Cys 
65 


Ala 


Met 


Leu 


Ser 


Ser 
70 


Val 


Leu 


get 


get 


etc 


att 


gga 


att 


gca 


gga 


yet 


ggc 


tac 


tgt 


gtc 


att 






Ala 


Ala 


Leu 
75 


He 


Gly 


He 


Ala 


Gly 
80 


Xaa 


Gly 


Tyr 


Cys 


Val 
85 


He 







<210> 589 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 302. .523 



<220> 

<221> sig_peptide 

<222> 302. .352 

<223> Von Heijne matrix 

score 4.59999990463257 
seq LLPLCQRIHCIIG/RF 



<400> 589 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 
tcctcaagtc tggctgagtc cggagttttt atgagectea gaggggagga agtgcatgct 
gattaatcca tgggcaggee tggaaaagtt cccactccag tetgegggae ccacagcctg 
gccctcaggc tcaggcttcc cwggcttgaa gattgggctt cacctgggac ctaccccttc 
tgectaggag catgtctgcc tcctgctgcc tttcatggtg cccagccaag gctaagtcga 
a atg tgg ccc aac ctt ctt ccc ctg tgc cag egg ate cat tgc ate att 
Met Trp Pro Asn Leu Leu Pro Leu Cys Gin Arg He His Cys lie He 
-15 -10 -5 

ggt cgc ttc egg tgc aat ggg ttt gag gac tgt ccc gat ggc age gat 
Gly Arg Phe Arg Cys Asn Gly Phe Glu Asp Cys Pro Asp Gly Ser Asp 

1 ' 5 10 15 

gaa gag aac tgc aca gca aac cct ctg ctt tgc tec ace gee cgc tac 
Glu Glu Asn Cys Thr Ala Asn Pro Leu Leu Cys Ser Thr Ala Arg Tyr 

20 25 30 

cac tgc aag aac ggc etc tgt att gac aag age ttc ate tgc gat gga 
His Cys Lys Asn Gly Leu Cys He Asp Lys Ser Phe He Cys Asp Gly 

35 40 45 

cag aat aac tgt caa gac aac agt gat gag 
Gin Asn Asn Cys Gin Asp Asn Ser Asp Glu 
50 55 



60 
120 
180 
240 
300 
349 



397 



445 



493 



523 



<210> 590 
<211> 513 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 103. .513 



<220> 

<221> sig_peptide 

<222> 103. .243 

<223> Von Heijne matrix 
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score 4.40000009536743 
seq MATMLQLVPLLKA/SI 

<400> 590 

aatcattaca gaaggatttg gcctctaagg gcctctggaa ttctttggaa gaaggcattg 60 
gctgtgtagc tgacactcta ctcactgtgg acatggacgt cc atg acc ctg cag 114 

Met Thr Leu Gin 
-45 



tgg 


cac 


ttg 


tat 


gag 


atg 


gca 


cgc 


aac 


ctg 


aag 


gtg 


cag 


gat 


atg 


ctg 


162 


Trp 


His 


Leu 


Tyr 


Glu 


Met 


Ala 


Arg 


Asn 


Leu 


Lys 


Val 


Gin 


Asp 


Met 


Leu 










-40 










-35 










-30 








egg 


gca 


gag 


gtc 


ttg 


get 


gcg 


egg 


cac 


cag 


gee 


cag 


gga 


gac 


atg 


gee 


210 


Arg 


Ala 


Glu 


Val 


Leu 


Ala 


Ala 


Arg 


His 


Gin 


Ala 


Gin 


Gly 


Asp 


Met 


Ala 








-25 










-20 










-15 










acg 


atg 


eta 


cag 


ctg 


gtc 


ccc 


etc 


etc 


aaa 


gee 


age 


ate 


aag 


gag 


aca 


258 


Thr 


Met 


Leu 


Gin 


Leu 


Val 


Pro 


Leu 


Leu 


Lys 


Ala 


Ser 


He 


Lys 


Glu 


Thr 






-10 










-5 










1 








5 




eta 


aga 


ctt 


cac 


ccc 


ate 


tec 


gtg 


acc 


ctg 


cag 


aga 


tat 


ctt 


gta 


aat 


306 


Leu 


Arg 


Leu 


His 


Pro 


He 


Ser 


Val 


Thr 


Leu 


Gin 


Arg 


Tyr 


Leu 


Val 


Asn 












10 










15 










20 






gac 


ttg 


gtt 


ctt 


cga 


gat 


tac 


atg 


att 


cct 


gee 


aag 


aca 


ctg 


gtg 


caa 


354 


Asp 


Leu 


Val 


Leu 


Arg 


Asp 


Tyr 


Met 


He 


Pro 


Ala 


Lys 


Thr 


Leu 


Val 


Gin 










25 










30 










35 








gtg 


gec 


ate 


tat 


get 


ctg 


ggc 


cga 


gag 


ccc 


acc 


ttc 


ttc 


ttc 


gac 


ccg 


402 


Val 


Ala 


He 


Tyr 


Ala 


Leu 


Gly 


Arg 


Glu 


Pro 


Thr 


Phe 


Phe 


Phe 


Asp 


Pro 








40 










45 










50 










gaa 


aat 


ttt 


gac 


cca 


acc 


cga 


tgg 


ctg 


age 


aaa 


gac 


aag 


aac 


ate 


acc 


450 


Glu 


Asn 


Phe 


Asp 


Pro 


Thr 


Arg 


Trp 


Leu 


Ser 


Lys 


Asp 


Lys 


Asn 


He 


Thr 






55 










60 










65 












tac 


ttc 


egg 


aac 


ttg 


ggc 


ttt 


ggc 


tgg 


ggt 


gtg 


egg 


cag 


tgt 


ctg 


gga 


498 


Tyr 


Phe 


Arg 


Asn 


Leu 


Gly 


Phe 


Gly 


Trp 


Gly 


Val 


Arg 


Gin 


Cys 


Leu 


Gly 




70 










75 










80 










85 




egg 


csg 


ate 


get 


gag 
























513 


Arg 


Xaa 


He 


Ala 


Glu 



























90 



<210> 591 
<211> 490 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 196. .489 



<220> 

<221> sig_peptide 

<222> 196. .255 

<223> Von Heijne matrix 

score 4.30000019073486 
seq LLDSSLMASGTAS/RS 



<400> 591 

agaegtaect cacggaagcc ggctttggcc ctgcggctgc taccgtcgcc gcggagaaat 60 
tgttggatct ggcagtctag gaatgaatct cctctcagcc tttaagctca cctggtcaga 120 
atccttggat gagcctgtgg gaccgttcct cctagcccgg tggtttggaa ccagtggctt 180 
tgggactgta agagg atg gac aaa gat tct cag ggg ctg eta gat tea tec 231 
Met Asp Lys Asp Ser Gin Gly Leu Leu Asp Ser Ser 
-20 -15 -10 

ctg atg gca tea ggc act gee age cgc tea gag gat gag gag tea ctg 279 
Leu Met Ala Ser Gly Thr Ala Ser Arg Ser Glu Asp Glu Glu Ser Leu 

-5 15 
gca ggg cag aag cga gee tec tec cag gee ttg ggc acc ate cct aaa 327 
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Ala 


Gly 
10 


Gin 


Lys 


Arg 


Ala 


Ser 
15 


Ser 


Gin 


Ala 


Leu 


Gly 
20 


Thr 


He 


Pro 


Lys 




egg 


aga 


age 


tec 


tec 


agg 


ttc 


ate 


aag 


agg 


aag 


aag 


ttc 


gat 


gat 


gag 


375 


Arg 


Arg 


Ser 


Ser 


Ser 


Arg 


Phe 


He 


Lys 


Arg 


Lys 


Lys 


Phe 


Asp 


Asp 


Glu 




25 










30 










35 










40 




ctg 


gtg 


gag 


age 


age 


ctg 


gca 


aaa 


tct 


tct 


acc 


egg 


gca 


aag 


ggg 


gee 


423 


Leu 


Val 


Glu 


Ser 


Ser 
45 


Leu 


Ala 


Lys 


Ser 


Ser 
50 


Thr 


Arg 


Ala 


Lys 


Gly 
55 


Ala 




agt 


ggg 


gtg 


gaa 


cag 


ggc 


get 


gtt 


egg 


gta 


gtg 


aac 


cct 


cct 


cca 


gtg 


471 


Ser 


Gly 


Val 


Glu 
60 


Gin 


Gly 


Ala 


Val 


Arg 
65 


Val 


Val 


Asn 


Pro 


Pro 
70 


Pro 


Val 




aga 


aga 


aga 


agg 


tat 


caa 


a 




















490 


Arg 


Arg 


Arg 
75 


Arg 


Tyr 


Gin 

























<210> 592 
<211> 537 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 197 . . 535 

<220> 

<221> sig_peptide 

<222> 197. .304 

<223> Von Heijne matrix 

score 3.79999995231628 

seq WLGCIVFPFS ALL /QQ 

<400> 592 

cctttttaaa caatatccat tctcttgatg tctacaatat cacctttctt atagattege 60 

atatacatgg ccaaaggaac aactccatgt tttctaaaag gectagagaa catacatcag 120 

ctcaccagga catgattata gcttctcaag cttatctaaa ataaaagata acatatatat 180 

caacattttt gatgaa atg atg act gaa aaa cat gag gat cac tgt etc aag 232 

Met Met Thr Glu Lys His Glu Asp His Cys Leu Lys 













-35 








-30 








-25 




age 


tgt 


agt 


ggt 


cac 


tea 


tat 


ata 


aga 


aag 


aat 


tgg 


ctt 


gga 


tgc 


att 


280 


Ser 


Cys 


Ser 


Gly 


His 


Ser 


Tyr 


He 


Arg 


Lys 


Asn 


Trp 


Leu 


Gly 


Cys 


He 












-20 










-15 










-10 






gtc 


ttc 


cct 


ttt 


tct 


get 


ctt 


ctg 


caa 


caa 


tct 


gag 


att 


agt 


gga 


aca 


328 


Val 


Phe 


Pro 


Phe 
-5 


Ser 


Ala 


Leu 


Leu 


Gin 
1 


Gin 


Ser 


Glu 


He 
5 


Ser 


Gly 


Thr 




ttt 


caa 


gta 


act 


att 


cca 


cca 


gtt 


ttg 


etc 


ggg 


tat 


act 


tgg 


agt 


aat 


376 


Phe 


Gin 


Val 


Thr 


He 


Pro 


Pro 


Val 


Leu 


Leu 


Gly 


Tyr 


Thr 


Trp 


Ser 


Asn 






10 










15 










20 












act 


tat 


gta 


ttt 


cct 


aaa 


gaa 


gat 


tec 


aat 


gag 


cag 


aat 


tta 


aaa 


gaa 


424 


Thr 


Tyr 


Val 


Phe 


Pro 


Lys 


Glu 


Asp 


Ser 


Asn 


Glu 


Gin 


Asn 


Leu 


Lys 


Glu 




25 










30 










35 










40 




tgt 


aca 


ttc 


tta 


aat 


att 


ttt 


get 


acc 


att 


gaa 


cct 


caa 


ata 


tea 


tat 


472 


Cys 


Thr 


Phe 


Leu 


Asn 


He 


Phe 


Ala 


Thr 


He 


Glu 


Pro 


Gin 


He 


Ser 


Tyr 










45 










50 










55 






gtc 


acc 


tgt 


aat 


cca 


aca 


eta 


gat 


aag 


ttt 


tta 


gat 


caa 


aca 


gag 


gtc 


520 


Val 


Thr 


Cys 


Asn 


Pro 


Thr 


Leu 


Asp 


Lys 


Phe 


Leu 


Asp 


Gin 


Thr 


Glu 


Val 










60 










65 










70 








ttg 


cag 


aga 


gca 


cag 


at 






















537 


Leu 


Gin 


Arg 


Ala 


Gin 



























75 



<210> 593 
<211> 488 
<212> DNA 



WO 02/083898 

<213> Homo sapiens 



323 



PCT/IB01/00914 



<220> 
<221> CDS 
<222> 79. .486 

<220> 

<221> sig_peptide 

<222> 79. .201 

<223> Von Heijne matrix 

score 3.59999990463257 

seq LHTSVTLFLLSYC/DC 

<400> 593 

agatgattcc ctgattctcc agagagatta cacacttcgt ttgtggctaa ggttactgtg 60 



acccaatgaa agaagaaa 


atg 


aaa 


gee 


ata 


aag 


aaa 


agt 


ctt 


aca 


gaa 


gaa 


111 












Met 


Lys 


Ala 


He 


Lys 


Lys 


Ser 


Leu 


Thr 


Glu 


Glu 
















-40 










-35 












gaa 


tac 


ctg 


tac 


ctg 


gac 


ttt 


tct 


cac 


caa 


aca 


gaa 


gga 


tgc 


ate 


ttt 


159 


Glu 


Tyr 


Leu 


Tyr 


Leu 


Asp 


Phe 


Ser 


His 


Gin 


Thr 


Glu 


Gly 


Cys 


He 


Phe 




-30 










-25 










-20 










-15 




cct 


ctt 


cat 


aca 


tct 


gta 


act 


tta 


ttt 


ctg 


tta 


tct 


tac 


tgt 


gac 


tgt 


207 


Pro 


Leu 


His 


Thr 


Ser 


Val 


Thr 


Leu 


Phe 


Leu 


Leu 


Ser 


Tyr 


Cys 


Asp 


Cys 












-10 










-5 










1 






aaa 


ate 


ttt 


aaa 


att 


tgc 


tta 


gtt 


gtc 


ace 


aaa 


gag 


gtg 


agt 


aga 


gat 


255 


Lys 


He 


Phe 


Lys 


He 


Cys 


Leu 


Val 


Val 


Thr 


Lys 


Glu 


Val 


Ser 


Arg 


Asp 








5 










10 










15 










agt 


tea 


eta 


eta 


aga 


gat 


gac 


ctg 


ate 


cag 


gat 


gtt 


gaa 


ata 


cag 


att 


303 


Ser 


Ser 


Leu 


Leu 


Arg 


Asp 


Asp 


Leu 


He 


Gin 


Asp 


Val 


Glu 


He 


Gin 


He 






20 










25 










30 












att 


tea 


agg 


cag 


gag 


etc 


cca 


cca 


ata 


gtc 


caa 


aat 


tgc 


tgt 


ttg 


cct 


351 


He 


Ser 


Arg 


Gin 


Glu 


Leu 


Pro 


Pro 


He 


Val 


Gin 


Asn 


Cys 


Cys 


Leu 


Pro 




35 










40 










45 










50 




gca 


gta 


gta 


gaa 


cga 


tea 


gac 


aat 


ttt 


tgt 


aga 


gca 


gga 


ctt 


get 


gtt 


399 


Ala 


Val 


Val 


Glu 


Arg 


Ser 


Asp 


Asn 


Phe 


Cys 


Arg 


Ala 


Gly 


Leu 


Ala 


Val 












55 










60 










65 






gta 


ttg 


aga 


cac 


ata 


ate 


cag 


aaa 


tec 


tat 


gaa 


gca 


gac 


ccc 


tta 


aag 


447 


Val 


Leu 


Arg 


His 


He 


He 


Gin 


Lys 


Ser 


Tyr 


Glu 


Ala 


Asp 


Pro 


Leu 


Lys 










70 










75 










80 








aag 


gaa 


ctt 


ttg 


gaa 


ctt 


ctg 


ggc 


ttw 


aaa 


aga 


ctt 


get 


tg 






488 


Lys 


Glu 


Leu 


Leu 


Glu 


Leu 


Leu 


Gly 


Xaa 


Lys 


Arg 


Leu 


Ala 











85 90 95 



<210> 594 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .490 

<220> 

<221> sig_peptide 

<222> 80. . 127 

<223> Von Heijne matrix 

score 3.59999990463257 

seq FSLLSISGPPISS/SA 

<400> 594 

gaatgtttat cctctggaca aaccagccag cctctccaga gcaggcgtgt gatctctgta 60 

cccccgcagt ggtcagaat atg gag aac ttc tea etc etc age ate tct gga 112 

Met Glu Asn Phe Ser Leu Leu Ser He Ser Gly 
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cct cca ate 
Pro Pro lie 
-5 

tct cgt gec 
Ser Arg Ala 



gag 
Glu 

att 
He 

ttg 

Leu 

60 

tac 

Tyr 



gca tec 
Ala Ser 

30 
gtc agg 
Val Arg 
45 

ggg gac 
Gly Asp 

egg ccc 
Arg Pro 



gag age cag 
Glu Ser Gin 

ggc aag tgg 
Gly Lys Trp 
110 



tct 
Ser 

acc 

Thr 

15 

age 

Ser 

caa 
Gin 

acc 
Thr 

tgg 
Trp 

ctg 

Leu 

95 

gac 

Asp 



tec tec 
Ser Ser 
1 

age ctg 
Ser Leu 

cca get 
Pro Ala 

ggg ata 
Gly He 

cag cac 
Gin His 
65 

ttc tec 
Phe Ser 
80 

gag gee 
Glu Ala 

aac tgc 
Asn Cys 



-15 
gee ctg 
Ala Leu 

cca gac 
Pro Asp 

cag gee 
Gin Ala 

35 
cag aac 
Gin Asn 
50 

gga gag 
Gly Glu 

ccc tac 
Pro Tyr 

tat gac 
Tyr Asp 

ctt tct 
Leu Ser 
115 



324 

agt get ttt 
Ser Ala Phe 
5 

att gca aag 
He Ala Lys 
20 

ctg cca ccc 
Leu Pro Pro 

aca gtg etc 
Thr Val Leu 



aag 
Lys 

age 
Ser 

ttc 
Phe 
100 
gag 
Glu 



ctg agg 
Leu Arg 

70 
tac ttc 
Tyr Phe 
85 

cca gag 
Pro Glu 

gac atg 
Asp Met 



-10 
ccc gac 
Pro Asp 

aca gca 
Thr Ala 

cag tac 
Gin Tyr 

40 
tea cca 
Ser Pro 
55 

egg aac 
Arg Asn 



att atg 
He Met 
10 

gta ccc 
Val Pro 
25 

caa age 
Gin Ser 

grc wgc 
Xaa Xaa 

tgc act 
Cys Thr 



gtg tgt gca gac 
Val Cys Ala Asp 
90 

gtg cag cag gat 
Val Gin Gin Asp 
105 

get gag aac 
Ala Glu Asn 
120 



ttc 
Phe 

act 
Thr 

ate 
He 

age 
Ser 

ate 

He 

75 

aaa 

Lys 

gag 
Glu 



160 



208 



256 



304 



352 



400 



448 



490 



<210> 595 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 157 . .495 



<400> 595 

attctttcga caggagttga aaccagcacc ttccctttct 
cagaagggag ctcaaaagaa ctttgttgtt ttgectttta 
gaggtatgtg agatggtgaa atgatttget tetgee atg 

Met 
1 

gat cgc cct aaa etc teg gtg gee ccc tea gta 
Asp Arg Pro Lys Leu Ser Val Ala Pro Ser Val 

10 15 
caa gtc ctt gtc tct cca gca gtg gac ctg gaa 
Gin Val Leu Val Ser Pro Ala Val Asp Leu Glu 

25 30 
agg gac ttc act gag aaa ata atg aat gtc aaa 
Arg Asp Phe Thr Glu Lys He Met Asn Val Lys 



40 



45 



tea atg ctg gtt gtc tea act gtg ate att gtg 
Ser Met Leu Val Val Ser Thr Val He He Val 
55 60 65 

aac age aca gaa ggc tct ttc ttg tgg ata tat 
Asn Ser Thr Glu Gly Ser Phe Leu Trp He Tyr 

75 80 
gaa gtt gat gac age agt get cag aag ggc tgg 
Glu Val Asp Asp Ser Ser Ala Gin Lys Gly Trp 

90 95 
ttt aac aat ggg ate cac aat tat caa caa ggg 
Phe Asn Asn Gly He His Asn Tyr Gin Gin Gly 
105 110 



ctgagtcctg cctccttctg 
ctctggggtg aaagcggcag 
ctg ggg tea egg gtg 
Leu Gly Ser Arg Val 
5 

gtt ttg gaa gag gac 
Val Leu Glu Glu Asp 
20 

gma gga tgc egg etc 
Xaa Gly Cys Arg Leu 
35 

gga aaa gta att ctg 
Gly Lys Val He Leu 
50 

ttt tgg gaa ttt ate 
Phe Trp Glu Phe He 
70 

cac tea aaa aac cca 
His Ser Lys Asn Pro 
85 

tgg ttt ctg age tgg 
Trp Phe Leu Ser Trp 
100 



60 
120 
174 



222 



270 



318 



366 



414 



462 



495 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 416. . 937 
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<400> 596 

aaaaggagta 

gaagatgaaa 

attggctggg 

agacccatca 

acaactggag 

tgggacctat 

ctgcgcacag 



gctattagcc 
gactagcgga 
aactgtatcc 
ataagaaatt 
gcaaagaggg 
atagcgtggc 
tcgcatttct 



aattcggcag 
agctctgcct 
acccaaatgt 
tctcagcctg 
ttgctcaacg 
ggaggcgggg 
ttccccgccc 



ggcccgcttt 
ctttccccag 
caccgatttc 
gccgaaaatg 
ccccgcctca 
cgatgattgt 
ctgagaccct 



gcg 


get 


999 


ctg 


ttt 


Ala 


Ala 


Gly 


Leu 
5 


Phe 


gcg 


acg 


cga 


999 


etc 


Ala 


Thr 


Arg 


Gly 


Leu 






20 






tec 


agg 


act 


gtg 


gtc 


Ser 


Arg 


Thr 


Val 


Val 




35 








gaa 


ccg 


ace 


aca 


ccg 


Glu 


Pro 


Thr 


Thr 


Pro 


50 










ttg 


tat 


gag 


aag 


aac 


Leu 


Tyr 


Glu 


Lys 


Asn 










70 


ttg 


gac 


gtc 


tgg 


aac 


Leu 


Asp 


Val 


Trp 


Asn 








85 




ate 


ctg 


gtc 


ctt 


ggc 


He 


Leu 


Val 


Leu 


Gly 






100 






atg 


aaa 


gag 


tgg 


tec 


Met 


Lys 


Glu 


Trp 


Ser 




115 








gag 


gee 


aat 


ggc 


ttc 


Glu 


Ala 


Asn 


Gly 


Phe 


130 










aga 


tec 


age 


tgc 


cag 


Arg 


Ser 


Ser 


Cys 


Gin 










150 


gaa 


gca 


ccg 


cct 


tec 


Glu 


Ala 


Pro 


Pro 


Ser 








165 





ggt ttg 
Gly Leu 

ccg gee 
Pro Ala 

gee ccg 
Ala Pro 
40 

tgg caa 
Trp Gin 
55 

cca gac 
Pro Asp 

atg cga 
Met Arg 

age acc 
Ser Thr 



age get cgc cgt 
Ser Ala Arg Arg 
10 

gee cgc 
Ala Arg 
25 

tec get gtg gcg 
Ser Ala Val Ala 



gtc cgc 
Val Arg 



gag gac cca 
Glu Asp Pro 



cgc 
Arg 

cca 
Pro 
135 
agg 
Arg 



cgc 
Arg 
120 
tea 
Ser 

atg 
Met 



tec 
Ser 

ctt 
Leu 

ttt 
Phe 
105 
gaa 
Glu 



cat 
His 



ggt 
Gly 
75 
gtc ttc 
Val Phe 
90 

gtg gee 
Val Ala 

get gag 
Ala Glu 



gaa 

Glu 

60 

tat 

Tyr 

ttc 
Phe 

tat 
Tyr 

agg 
Arg 



cca ccc 
Pro Pro 



tgg aat 
Trp Asn 

agt gac 
Ser Asp 

cct gee 
Pro Ala 
170 



cca act 
Pro Thr 
140 
cag ttg 
Gin Leu 
155 

tgc cat 
Cys His 



ttagaagctt gatttcyttt 
tggcgaggga acteggggeg 
ttcctatgea ggaaatgagc 
gttggcccca cgaagccacg 
ttggaaaacc aaatcagatc 
cgcgctcgca cccactgcag 
gcagcaccat ctgtc atg 
Met 
1 

ctt ttg gcg gca gcg 
Leu Leu Ala Ala Ala 
15 

tgg gaa tct age ttc 
Trp Glu Ser Ser Phe 
30 

gga aag egg ccc cca 
Gly Lys Arg Pro Pro 
45 

ccc gag gac gaa aac 
Pro Glu Asp Glu Asn 
65 

gac aag gac ccc gtt 
Asp Lys Asp Pro Val 
8 0 

ttt ggc gtc tec ate 
Phe Gly Val Ser He 
95 

ctg cct gac tac agg 
Leu Pro Asp Tyr Arg 
110 

ctt gtg aaa tac cga 
Leu Val Lys Tyr Arg 
125 

get teg acc cca gca 
Ala Ser Thr Pro Ala 
145 

eta agt ggg get caa 
Leu Ser Gly Ala Gin 
160 

tct gac ct 
Ser Asp 



60 
120 
180 
240 
300 
360 
418 



466 



514 



562 



610 



658 



706 



754 



802 



850 



898 



939 



<210> 597 
<211> 528 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 51. .527 



<400> 597 
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aagctggggg agctcgtcgc cgccgccggc ggctagcggg cgtccgcgcc atg gag 56 

Met Glu 
1 

cgc tac gcg gcc gcc ttg gag gag gtg gcg gac ggt gcc egg cag cag 104 
Arg Tyr Ala Ala Ala Leu Glu Glu Val Ala Asp Gly Ala Arg Gin Gin 

5 10 15 

gag cga cac tac cag ttg ctg teg gcg eta cag age ctg gtg aag gag 152 
Glu Arg His Tyr Gin Leu Leu Ser Ala Leu Gin Ser Leu Val Lys Glu 

20 " 25 30 

ttg ccc age tct ttc cag cag cgc ctg tec tac acc acg etc age gac 200 
Leu Pro Ser Ser Phe Gin Gin Arg Leu Ser Tyr Thr Thr Leu Ser Asp 
35 40 45 50 

ctg gcc ctg gcg ctt etc gac ggc act gtg ttc gaa ate gtg cag ggg 248 
Leu Ala Leu Ala Leu Leu Asp Gly Thr Val Phe Glu lie Val Gin Gly 

55 60 65 

eta ctg gag ate cag cac etc acc gaa aag age ctg tac aac cag cgc 296 
Leu Leu Glu lie Gin His Leu Thr Glu Lys Ser Leu Tyr Asn Gin Arg 

70 75 80 

ctg cgc eta cag aac gag cac cga gtg etc agg cag gcg ctg egg cag 344 
Leu Arg Leu Gin Asn Glu His Arg Val Leu Arg Gin Ala Leu Arg Gin 

85 90 95 

aag cac cag gaa gcc cag cag gcc tgc egg ccc cac aac ctg cct gtg 392 
Lys His Gin Glu Ala Gin Gin Ala Cys Arg Pro His Asn Leu Pro Val 

100 105 110 

gtt cag gcg get cag cag cga gaa eta gag gcc gtg gaa cac egg ate 440 
Val Gin Ala Ala Gin Gin Arg Glu Leu Glu Ala Val Glu His Arg lie 
115 120 125 130 

cgt gag gag cag egg gcg atg gac cat aag ate ate ctg gag ctg gac 488 
Arg Glu Glu Gin Arg Ala Met Asp His Lys lie lie Leu Glu Leu Asp 

135 140 145 

egg aag gtg get gac cag cag age aca ctg gag aag gcg g 528 
Arg Lys Val Ala Asp Gin Gin Ser Thr Leu Glu Lys Ala 
150 155 

<210> 598 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128 . . 556 

<400> 598 

cttctttggc cctgtgacac gtagcaaegg ggctggttca gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctactgcc tttgccgcca gcgcasctca gagtttgatt 120 

atttgea atg tea ggc ttt gaa aac tta aac acg gat ttc tac cag aca 169 

Met Ser Gly Phe Glu Asn Leu Asn Thr Asp Phe Tyr Gin Thr 

15 10 

agt tac age ate gat gat cag tea cag cag tec tat gat tat gga gga 217 

Ser Tyr Ser lie Asp Asp Gin Ser Gin Gin Ser Tyr Asp Tyr Gly Gly 

15 20 25 30 

agt gga gga ccc tat age aaa cag tat get ggc tat gac tat teg cag 265 

Ser Gly Gly Pro Tyr Ser Lys Gin Tyr Ala Gly Tyr Asp Tyr Ser Gin 

35 4 0 4 5 

caa ggc aga ttt gtc cct cca gac atg atg cag cca caa cag cca tac 313 

Gin Gly Arg Phe Val Pro Pro Asp Met Met Gin Pro Gin Gin Pro Tyr 

50 55 60 

acc ggg cag att tac cag cca act cag gca tat act cca get tea cct 361 

Thr Gly Gin lie Tyr Gin Pro Thr Gin Ala Tyr Thr Pro Ala Ser Pro 

65 70 7 5 

cag cct ttc tat gga aac aac ttt gag gat gag cca cct tta tta gaa 409 

Gin Pro Phe Tyr Gly Asn Asn Phe Glu Asp Glu Pro Pro Leu Leu Glu 
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80 * 85 90 

gag tta ggt ate aat ttt gac cac ate tgg caa aaa aca eta aca gta 457 
Glu Leu Gly lie Asn Phe Asp His lie Trp Gin Lys Thr Leu Thr Val 
95 100 105 110 

tta cat ccg tta aaa gta gca gat ggc age ate atg aat gaa act gat 505 
Leu His Pro Leu Lys Val Ala Asp Gly Ser lie Met Asn Glu Thr Asp 

115 120 125 

ttg gca ggt cca atg gtt ttt tgc ctt get ttt tgg age aca ttg eta 553 
Leu Ala Gly Pro Met Val Phe Cys Leu Ala Phe Trp Ser Thr Leu Leu 

130 135 140 

ctg 556 
Leu 



<210> 599 
<211> 564 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 151. .564 



<400> 599 

ataacaaaac aatccaacta tactgaataa tacagaatca aactgggatc caaaaacaat 60 
ttaaaacaaa gttggaaact caaccgaaac aagaacaaat cgaatccgac atgaaacaag 120 
tctcacaagt gttcttaagc tgacagaaag atg gtc aaa acc aaa ctt aaa ace 174 

Met Val Lys Thr Lys Leu Lys Thr 

1 5 



gta 


gta 


ate 


gac 


ata 


acg 


gtc 


ata 


cca 


ttc 


gee 


gac 


gat 


ttt 


ttc 


tgc 


222 


Val 


Val 


He 


Asp 


He 


Thr 


Val 


He 


Pro 


Phe 


Ala 


Asp 


Asp 


Phe 


Phe 


Cys 






10 










15 










20 












cct 


gga 


get 


cca 


gee 


ggc 


gta 


gag 


ata 


tea 


cca 


gcg 


get 


gat 


gga 


cca 


270 


Pro 


Gly 


Ala 


Pro 


Ala 


Gly 


Val 


Glu 


He 


Ser 


Pro 


Ala 


Ala 


Asp 


Gly 


Pro 




25 










30 










35 










40 




tct 


cct 


gaa 


gga 


ccc 


ate 


gee 


gag 


cct 


gac 


gga 


gaa 


tct 


gac 


ggt 


gaa 


318 


Ser 


Pro 


Glu 


Gly 


Pro 


He 


Ala 


Glu 


Pro 


Asp 


Gly 


Glu 


Ser 


Asp 


Gly 


Glu 












45 










50 










55 






gaa 


gaa 


gta 


gag 


cct 


ccc 


gga 


gee 


aac 


gat 


gga 


gaa 


acg 


gtg 


ggt 


gag 


366 


Glu 


Glu 


Val 


Glu 


Pro 


Pro 


Gly 


Ala 


Asn 


Asp 


Gly 


Glu 


Thr 


Val 


Gly 


Glu 










60 










65 










70 








ttt 


gaa 


aca 


ggg 


gat 


gaa 


gat 


tta 


ggt 


gee 


att 


gat 


cca 


ggt 


ggt 


gaa 


414 


Phe 


Glu 


Thr 


Gly 


Asp 


Glu 


Asp 


Leu 


Gly 


Ala 


He 


Asp 


Pro 


Gly 


Gly 


Glu 








75 










80 










85 










gtc 


ate 


gga 


gca 


gaa 


gac 


gga 


gga 


ata 


gtt 


gaa 


gat 


gat 


tta 


ggt 


gee 


4 62 


Val 


He 


Gly 


Ala 


Glu 


Asp 


Gly 


Gly 


He 


Val 


Glu 


Asp 


Asp 


Leu 


Gly 


Ala 






90 










95 










100 












att 


ggt 


gee 


ggc 


ggt 


gaa 


gtc 


ate 


gga 


gca 


gag 


gat 


ggg 


gaa 


aca 


ggg 


510 


He 


Gly 


Ala 


Gly 


Gly 


Glu 


Val 


He 


Gly 


Ala 


Glu 


Asp 


Gly 


Glu 


Thr 


Gly 




105 










110 










115 










120 




gac 


gtg 


gat 


tta 


ggt 


ggt 


gee 


ggt 


ggt 


gaa 


gtc 


gtc 


gga 


gaa 


aca 


ggg 


558 


Asp 


Val 


Asp 


Leu 


Gly 


Gly 


Ala 


Gly 


Gly 


Glu 


Val 


Val 


Gly 


Glu 


Thr 


Gly 





125 130 135 



gag etc 564 
Glu Leu 



<210> 600 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 198 . . 419 
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<220> 

<221> misc_f eature 
<222> 275. .278 
<223> n=a, g, c or t 



<400> 600 

cttttacttc 

ttcagtcatt 

aaaacctgaa 

gatctccaca 



aggcgaacct gaactcagtt aatgcaattt tgaagcatgc 
atcaagtcaa actaaaaacg gtgaaaggtt gcgactatta 
gacataagaa ctacacatga ggaatatgtc atttagcact 
gaagaca atg aga agt cat acc ata aca atg acg 

Met Arg Ser His Thr lie Thr Met Thr 

1 5 



tea gtc age age 
Ser Val Ser Ser 
15 

hat cat age gac 
Xaa His Ser Asp 
30 

act acc tgt ccc 
Thr Thr Cys Pro 
4 5 

tec tac tct gtt 
Ser Tyr Ser Val 
60 



tgg cct tac tec tec cac aga atg cgc ttt 
Trp Pro Tyr Ser Ser His Arg Met Arg Phe 



caa 
Gin 

atg 
Met 

att 



ccg 
Pro 

gat 



cca caa 
Pro Gin 

35 
gaa aaa 



20 
aac 
Asn 



25 
cca 
Pro 



ttc tea gca aca 
Phe Ser Ala Thr 
40 

ttg khs act act gtg tta 
Asp Glu Lys Leu Xaa Thr Thr Val Leu 

50 ^ 55 
ttc ate gtg gga ctg gtt ggg aac ata 
lie Phe lie Val Gly Leu Val Gly Asn He 
65 70 



cctgaataga 
ccaaatagga 
ttcacttttt 
aca act 
Thr Thr 
10 

awn nnn 
Xaa Xaa 

aat gtt 
Asn Val 

acc aca 
Thr Thr 



ate gc 
He 



60 
120 
180 
230 



278 



326 



374 



421 



<210> 601 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124 . .528 



<400> 601 

ggctgcgcct agtgggccgt tgccttacag ttgetgagag gaggegagag gegggggege 60 
tagggecgag atcatgtctg actgggagag gtttccttgg cagcagagga cgctaggttt 120 
ggg atg aaa gaa get ggg cag atg saa aat ctg gag age gcg agg gee 168 
Met Lys Glu Ala Gly Gin Met Xaa Asn Leu Glu Ser Ala Arg Ala 
15 10 15 

ggg egg tea gtc age acc cag act ggc age atg acc ggt cag ata cca 216 
Gly Arg Ser Val Ser Thr Gin Thr Gly Ser Met Thr Gly Gin He Pro 

20 25 30 

agg ctt tct aaa gtc aac ctt ttc act ctg etc age etc tgg atg gag 264 
Arg Leu Ser Lys Val Asn Leu Phe Thr Leu Leu Ser Leu Trp Met Glu 

35 4 0 4 5 

etc ttt cca gca gaa gee cag egg caa aaa tct cag aaa aat gaa grg 312 
Leu Phe Pro Ala Glu Ala Gin Arg Gin Lys Ser Gin Lys Asn Glu Xaa 

50 55 60 

gga aag cat gga ccc tta gga gat aat gaa gag agg acc aga gta tct 360 
Gly Lys His Gly Pro Leu Gly Asp Asn Glu Glu Arg Thr Arg Val Ser 

65 " 70 75 

act gac aaa aga cag gta aag aga act ggt ctt gtg gtg gtg aaa aac 408 
Thr Asp Lys Arg Gin Val Lys Arg Thr Gly Leu Val Val Val Lys Asn 
80 85 90 95 

atg aaa att gtt ggt etc cac tgt tct agt gaa gat tta cat gee ggg 456 
Met Lys He Val Gly Leu His Cys Ser Ser Glu Asp Leu His Ala Gly 

100 105 110 

cag att get ctt att aaa cat ggg tea agg ctg aaa aac tgt gat cct 504 
Gin He Ala Leu He Lys His Gly Ser Arg Leu Lys Asn Cys Asp Pro 

115 120 125 

tat ttt tec aga aaa cmt gtt ctg c 529 
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Tyr Phe Ser Arg Lys Xaa Val Leu 
130 " 135 



<210> 602 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24 . . 206 



<400> 602 

ccttccctgc ggccgcgcag age atg ccg gga gec gec ggc cgc egg tgt ccg 53 

Met Pro Gly Ala Ala Gly Arg Arg Cys Pro 
1 5 10 



gac 


cgc 


teg 


ccc 


ccg 


ttt 


gga 


ccc 


gac 


ttc 


ggt 


tct 


tct 


ggg 


gtg 


ttg 


101 


Asp 


Arg 


Ser 


Pro 


Pro 


Phe 


Gly 


Pro 


Asp 


Phe 


Gly 


Ser 


Ser 


Gly 


Val 


Leu 












15 










20 










25 






atg 


etc 


eta 


aag 


ccc 


gag 


age 


acg 


tgt 


cca 


gac 


cct 


age 


ctg 


tac 


gac 


149 


Met 


Leu 


Leu 


Lys 


Pro 


Glu 


Ser 


Thr 


Cys 


Pro 


Asp 


Pro 


Ser 


Leu 


Tyr 


Asp 










30 










35 










40 








gcd 


baa 


etc 


tgc 


ccg 


gtc 


cca 


gaa 


cca 


aag 


cca 


tgc 


egg 


ggt 


gtg 


gee 


197 


Ala 


Xaa 


Leu 


Cys 


Pro 


Val 


Pro 


Glu 


Pro 


Lys 


Pro 


Cys 


Arg 


Gly 


Val 


Ala 








45 










50 










55 










tct 


gac 


ccc 


ac 


























208 


Ser 


Asp 


Pro 































60 



<210> 603 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 198. .587 



<400> 603 

cttttacttc aggegaaect gaactcagtt aatgcaattt tgaagcatgc cctgaataga 60 
ttcagtcatt atcaagtcaa actaaaaacg gtgaaaggtt gcgactatta ccaaatagga 120 
aaaacctgaa gacataagaa ctacacatga ggaatatgtc atttagcact ttcacttttt 180 
gatctccaca gaagaca atg aga agt cat ace ata aca atg acg aca act 230 

Met Arg Ser His Thr lie Thr Met Thr Thr Thr 
15 10 



tea 


gtc 


age 


age 


tgg 


cct 


tac 


tec 


tec 


cac 


aga 


atg 


cgc 


ttt 


ata 


ace 


278 


Ser 


Val 


Ser 


Ser 


Trp 


Pro 


Tyr 


Ser 


Ser 


His 


Arg 


Met 


Arg 


Phe 


He 


Thr 










15 










20 










25 








aat 


cat 


age 


gac 


caa 


ccg 


cca 


caa 


aac 


ttc 


tea 


gca 


aca 


cca 


aat 


gtt 


326 


Asn 


His 


Ser 


Asp 


Gin 


Pro 


Pro 


Gin 


Asn 


Phe 


Ser 


Ala 


Thr 


Pro 


Asn 


Val 








30 










35 










40 










act 


ace 


tgt 


ccc 


atg 


gat 


gaa 


aaa 


ttg 


eta 


tct 


act 


gtg 


tta 


ace 


aca 


374 


Thr 


Thr 


Cys 


Pro 


Met 


Asp 


Glu 


Lys 


Leu 


Leu 


Ser 


Thr 


Val 


Leu 


Thr 


Thr 






45 










50 










55 












tec 


tac 


tct 


gtt 


att 


ttc 


ate 


gtg 


gga 


ctg 


gtt 


ggg 


aac 


ata 


ate 


gee 


422 


Ser 


Tyr 


Ser 


Val 


lie 


Phe 


He 


Val 


Gly 


Leu 


Val 


Gly 


Asn 


He 


He 


Ala 




60 










65 










70 










75 




etc 


tat 


gta 


ttt 


ctg 


ggt 


att 


cac 


cgt 


aaa 


aga 


aat 


tec 


att 


caa 


att 


470 


Leu 


Tyr 


Val 


Phe 


Leu 


Gly 


He 


His 


Arg 


Lys 


Arg 


Asn 


Ser 


He 


Gin 


He 












80 










85 










90 






tat 


eta 


ctt 


aac 


gta 


gee 


att 


gca 


gac 


etc 


eta 


etc 


ate 


ttc 


tgc 


etc 


518 


Tyr 


Leu 


Leu 


Asn 


Val 


Ala 


He 


Ala 


Asp 


Leu 


Leu 


Leu 


lie 


Phe 


Cys 


Leu 





95 100 105 



WO 02/083898 

cct ttc cga ata atg tat 

Pro Phe Arg lie Met Tyr 
110 

gtg att ytg tgc aag gtt 

Val lie Leu Cys Lys Val 
125 



330 

cat att aac caa aac aag 
His lie Asn Gin Asn Lys 
115 

gtg 
Val 
130 



PCT/IB01/00914 

tgg aca eta ggt 566 

Trp Thr Leu Gly 

120 

587 



<210> 604 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 327. .527 



<400> 604 

acaagaaaag aacatggtct agactgaagt accaactaaa tcatctcctt tcaaattatc 60 

accgacacca tcatggattc aagcaccgca cacagtccgg tgtttctggt atttcctcca 120 

gaaatcactg cttcagaata tgagtccaca gaactttcag ccacgacctt ttcaactcaa 180 

agccccttgc aaaaattatt tgctagaaaa atgaaaatct taggggatat ccattctggg 240 

gctctgtttt gttcattaat tetggagect tectaattge agtgaaaaga aaaaccacag 300 

aaactctgat aatattgagc cgaata atg aat ttt ctt agt gec ctg gga gca 353 

Met Asn Phe Leu Ser Ala Leu Gly Ala 

1 5 



ata 


get 


gga 


ate 


att 


etc 


etc 


aca 


ttt 


ggt 


ttc 


ate 


eta 


gat 


caa 


aac 


401 


He 


Ala 


Gly 


He 


He 


Leu 


Leu 


Thr 


Phe 


Gly 


Phe 


He 


Leu 


Asp 


Gin 


Asn 




10 










15 










20 










25 




tac 


att 


tgt 


ggt 


tat 


tct 


cac 


caa 


aat 


agt 


cag 


tgt 


aag 


get 


gtt 


act 


449 


Tyr 


He 


Cys 


Gly 


Tyr 
30 


Ser 


His 


Gin 


Asn 


Ser 
35 


Gin 


Cys 


Lys 


Ala 


Val 
40 


Thr 




gtc 


ctg 


ttc 


ttg 


gga 


att 


ttg 


att 


aca 


ttg 


atg 


act 


ttc 


age 


att 


att 


497 


Val 


Leu 


Phe 


Leu 
45 


Gly 


He 


Leu 


He 


Thr 
50 


Leu 


Met 


Thr 


Phe 


Ser 
55 


He 


He 




gaa 


tta 


ttc 


att 


tct 


ctg 


ctt 


tct 


caa 


ttt 


t 












528 


Glu 


Leu 


Phe 


He 


Ser 


Leu 


Leu 


Ser 


Gin 


Phe 

















60 65 



<210> 605 
<211> 227 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62 . .226 



<400> 605 

atttceggtt ccggtgtcag ttcgaggege cgccgccgcc gccgcagccg ccggagccgc 60 
a atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg cag 109 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
1 5 10 15 

tat aca age cct gag gag ate gac gcg cas tgc agg ctg aga age aga 157 
Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

agg cca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca ggt 205 
Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 4 0 4 5 

gac ccc aaa aaa aaa aaa aaa a 227 
Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 606 



WO 02/083898 

<211> 228 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .227 

<400> 606 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 
ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 
1 5 10 15 

cag tat aca age cct gag gag ate gac gcg cas tgc agg ctg aga age 155 
Gin Tyr Thr Ser Pro Glu Glu lie Asp Ala Xaa Cys Arg Leu Arg Ser 

20 25 30 

aga agg cca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca 203 
Arg Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala 

35 40 45 

ggt gac ccc aaa aaa aaa aaa aaa a 228 
Gly Asp Pro Lys Lys Lys Lys Lys 
50 55 

<210> 607 
<211> 378 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 119. .376 

<220> 

<221> misc_feature 
<222> 143, 318, 332 
<223> n=a, g, c or t 

<400> 607 

gaaagacgga gagaggaaaa gggacaaggg caaggcaaga gagaggegtc gtcgtttgat 60 
ctgctgccgg agtgcgaaga ggaagaggaa gaggaagagg agegceggag ttgtaaag 118 



atg 


tts 


ctg 


gtg 


gac 


tgg 


ttc 


tat 


ngg 


gtg 


ctg 


tea 


teg 


ctt 


ggg 


ctg 


166 


Met 


Xaa 


Leu 


Val 


Asp 


Trp 


Phe 


Tyr 


Xaa 


Val 


Leu 


Ser 


Ser 


Leu 


Gly 


Leu 




1 








5 










10 










15 






tgg 


cag 


aag 


gag 


get 


aag 


ate 


etc 


ttc 


ctt 


ggc 


etc 


gac 


aac 


gee 


ggc 


214 


Trp 


Gin 


Lys 


Glu 
20 


Ala 


Lys 


He 


Leu 


Phe 
25 


Leu 


Gly 


Leu 


Asp 


Asn 
30 


Ala 


Gly 




aag 


ace 


ace 


etc 


etc 


cac 


atg 


ctg 


aag 


gac 


gag 


egg 


etc 


gta 


cag 


cac 


262 


Lys 


Thr 


Thr 
35 


Leu 


Leu 


His 


Met 


Leu 
40 


Lys 


Asp 


Glu 


Arg 


Leu 
45 


Val 


Gin 


His 




cag 


cca 


acg 


cag 


tac 


ccc 


acg 


tea 


gaa 


gag 


ttg 


age 


ate 


ggc 


agg 


ate 


310 


Gin 


Pro 
50 


Thr 


Gin 


Tyr 


Pro 


Thr 
55 


Ser 


Glu 


Glu 


Leu 


Ser 
60 


He 


Gly 


Arg 


He 




aag 


ttc 


ang 


gcg 


ttc 


gac 


ctt 


ngg 


ggc 


cac 


cag 


ate 


gec 


cgc 


cgc 


gtc 


358 


Lys 


Phe 


Xaa 


Ala 


Phe 


Asp 


Leu 


Xaa 


Gly 


His 


Gin 


He 


Ala 


Arg 


Arg 


Val 




65 










70 










75 










80 




tgg 


aag 


gac 


tac 


tmc 


gee 


aa 




















378 


Trp 


Lys 


Asp 


Tyr 


Xaa 


Ala 

























85 



<210> 608 
<211> 455 
<212> DNA 

<213> Homo sapiens 



PCT/IB01/00914 

331 
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<220> 
<221> CDS 
<222> 63. .455 



<400> 608 

attsccacct gtgatgctca gagagaaacc atgagccctg aaaaacacca tattgttttg 60 

tg atg tea cct ctg gaa tgt tct gag tgt ttt ggt gac caa ctt ctg 107 

Met Ser Pro Leu Glu Cys Ser Glu Cys Phe Gly Asp Gin Leu Leu 
15 10 15 



cat 


agg 


ace 


tat 


ace 


tgg 


caa 


etc 


aca 


ttg 


cac 


tea 


agg 


cca 


aat 


tat 


155 


His 


Arg 


Thr 


Tyr 


Thr 


Trp 


Gin 


Leu 


Thr 


Leu 


His 


Ser 


Arg 


Pro 


Asn 


Tyr 












20 










25 










30 






aca 


aga 


aaa 


aga 


gat 


acc 


aga 


tct 


gaa 


age 


eta 


gaa 


att 


cca 


ate 


agt 


203 


Thr 


Arg 


Lys 


Arg 


Asp 


Thr 


Arg 


Ser 


Glu 


Ser 


Leu 


Glu 


He 


Pro 


He 


Ser 










35 










40 










45 








gtg 


gtt 


eta 


cct 


cag 


agg 


ggc 


aca 


gee 


gaa 


ccc 


ttc 


ccg 


agg 


etc 


cac 


251 


Val 


Val 


Leu 


Pro 


Gin 


Arg 


Gly 


Thr 


Ala 


Glu 


Pro 


Phe 


Pro 


Arg 


Leu 


His 








50 










55 










60 










aac 


ttg 


tac 


age 


acc 


cct 


cgc 


tgc 


gcg 


cag 


cag 


gee 


gee 


ctg 


ccc 


egg 


299 


Asn 


Leu 


Tyr 


Ser 


Thr 


Pro 


Arg 


Cys 


Ala 


Gin 


Gin 


Ala 


Ala 


Leu 


Pro 


Arg 






65 










70 










75 












ctg 


age 


cgc 


agg 


atg 


gcg 


age 


cag 


cac 


tec 


tat 


cca 


ctg 


aac 


cgc 


ttc 


347 


Leu 


Ser 


Arg 


Arg 


Met 


Ala 


Ser 


Gin 


His 


Ser 


Tyr 


Pro 


Leu 


Asn 


Arg 


Phe 




80 










85 










90 










95 




tec 


tec 


gtg 


cct 


tta 


gac 


ccc 


atg 


gag 


cgc 


ccc 


atg 


tec 


cag 


gee 


gac 


395 


Ser 


Ser 


Val 


Pro 


Leu 


Asp 


Pro 


Met 


Glu 


Arg 


Pro 


Met 


Ser 


Gin 


Ala 


Asp 












100 










105 










110 






ctg 


gag 


ctg 


gac 


tac 


aac 


ccg 


ccg 


egg 


gtg 


cag 


etc 


age 


gac 


gag 


atg 


443 


Leu 


Glu 


Leu 


Asp 


Tyr 


Asn 


Pro 


Pro 


Arg 


Val 


Gin 


Leu 


Ser 


Asp 


Glu 


Met 





115 120 125 



ttc ktg ttc car 455 
Phe Xaa Phe Gin 
130 



<210> 609 
<211> 366 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 68. .364 



<400> 609 

gtacagaaag attggtggac aggagcagcg gccggtgggm gagggegetc ggcggcggcc 60 



tgeggee 


atg 


gee 


acc 


gtg 


atg 


gca 


gcg 


acg 


gcg 


gcg 


gag 


egg 


gcg 


gtg 


109 






Met 


Ala 


Thr 


Val 


Met 


Ala 


Ala 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Val 








1 








5 










10 












ctg 


gag 


gag 


gag 


ttc 


cgc 


tgg 


ctg 


ctg 


cac 


gac 


gak 


ktg 


cac 


get 


gtg 


157 


Leu 


Glu 


Glu 


Glu 


Phe 


Arg 


Trp 


Leu 


Leu 


His 


Asp 


Xaa 


Xaa 


His 


Ala 


Val 




15 










20 










25 










30 




ttg 


wwg 


cag 


ctg 


cag 


gac 


ate 


etc 


aag 


gag 


gee 


tct 


ctg 


cgc 


ttc 


act 


205 


Leu 


Xaa 


Gin 


Leu 


Gin 


Asp 


He 


Leu 


Lys 


Glu 


Ala 


Ser 


Leu 


Arg 


Phe 


Thr 












35 










40 










45 






ctg 


ccg 


ggc 


tec 


ggc 


act 


gag 


ggg 


ccc 


gee 


aag 


caa 


gag 


aac 


ttc 


ate 


253 


Leu 


Pro 


Gly 


Ser 


Gly 


Thr 


Glu 


Gly 


Pro 


Ala 


Lys 


Gin 


Glu 


Asn 


Phe 


He 










50 










55 










60 








eta 


ggc 


age 


tgt 


ggc 


aca 


gac 


cak 


gtg 


aag 


ggt 


gtg 


ctg 


act 


ctg 


cak 


301 


Leu 


Gly 


Ser 


Cys 


Gly 


Thr 


Asp 


Xaa 


Val 


Lys 


Gly 


Val 


Leu 


Thr 


Leu 


Xaa 








65 










70 










75 










ggg 


gat 


gee 


etc 


age 


cag 


gcg 


gat 


gtg 


aac 


ctg 


aag 


atg 


ccc 


egg 


aac 


349 


Gly Asp 


Ala 


Leu 


Ser 


Gin 


Ala 


Asp 


Val 


Asn 


Leu 


Lys 


Met 


Pro 


Arg 


Asn 
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333 

80 85 90 

aac cag ctg ctg cac tt 366 
Asn Gin Leu Leu His 
95 



<210> 610 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 209. .496 



<400> 610 

atgatgccag tgaatgcgta gtcccagcct cagcacaggg gagccacctt gaagctctca 
aatatcactg ttgtgaatac agagagggaa aaccaactgt aacgtgccac ccaaataaag 
ctcattacta gtcctgtcca gcaacgtgcc tctcctggcc ctagagttct tggaaatagc 
ccaggccaaa gagaaggcct ttctcccc atg gtc age cac acg ttc cac atg 

Met Val Ser His Thr Phe His Met 
1 5 
cgc aca gag gag tct gat gec tea cag gag ggc gat gac eta ccc aag 
Arg Thr Glu Glu Ser Asp Ala Ser Gin Glu Gly Asp Asp Leu Pro Lys 

10 15 20 

tec tea gca aac ace age cat ccc aag cag gat gac age ccc aag tec 
Ser Ser Ala Asn Thr Ser His Pro Lys Gin Asp Asp Ser Pro Lys Ser 
25 30 35 40 

tea gaa gaa acc ate cag ccc aag gag ggt gac ate ccc aag gee cca 
Ser Glu Glu Thr lie Gin Pro Lys Glu Gly Asp lie Pro Lys Ala Pro 

45 50 55 

gaa gaa acc ate caa tec aag aag gag gac etc ccc aag tec tea gaa 
Glu Glu Thr lie Gin Ser Lys Lys Glu Asp Leu Pro Lys Ser Ser Glu 

60 65 70 

aaa gee ate cag ccc aaa gag agt aac ate ccc aag tec tea gca aaa 
Lys Ala lie Gin Pro Lys Glu Ser Asn lie Pro Lys Ser Ser Ala Lys 

75 " 80 85 

ccc ate cag ccc aag ctg gca ata tt 
Pro lie Gin Pro Lys Leu Ala lie 
90 95 



60 
120 
180 
232 



280 



328 



376 



424 



472 



498 



<210> 611 
<211> 504 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 214. .504 



<400> 611 

aagacagaca eggaagtget gggaggegee gggagcccgt teggttgegg gtgtctctgg 60 
ccctgcggtc agccctggga acgtcccgga gagctagatt cctagaggcc cgattccgct 120 
agcccggaac agacaaagee agcgctcccg cccgctcccc gacttaggat ccgatgccgg 180 
cagcgtcctg gggcccccgt agegggctgg acc atg age ctg ctg gac ggc etc 234 

Met Ser Leu Leu Asp Gly Leu 

1 5 

get tec teg ccg egg get ccg ctg cag tec age aag gee agg atg aaa 282 
Ala Ser Ser Pro Arg Ala Pro Leu Gin Ser Ser Lys Ala Arg Met Lys 

10 15 20 

aag etc ccg aag aag age cag aat gag aag tac egg ctg aag tac ctg 330 
Lys Leu Pro Lys Lys Ser Gin Asn Glu Lys Tyr Arg Leu Lys Tyr Leu 

25 30 35 

egg ctg cgc aaa gcg gee aag gee acg gtg ttt gaa aat get get att 378 



WO 02/083898 PCT/IB01/00914 

334 



Arg 


Leu 


Arg 


Lys 


Ala 


Ala Lys 


Ala 


Thr 


Val 


Phe 


Glu 


Asn 


Ala 


Ala 


He 




40 










45 








50 










55 




tgt 


gat 


gaa 


att 


get 


cgt ctt 


gag 


gaa 


aaa 


ttt 


ctt 


aaa 


gca 


aaa 


gaa 


426 


Cys 


Asp 


Glu 


He 


Ala 
60 


Arg Leu 


Glu 


Glu 


Lys 
65 


Phe 


Leu 


Lys 


Ala 


Lys 
70 


Glu 




gaa 


aga 


agg 


tac 


ttg 


eta aag 


aag 


etc 


etc 


cag 


ctt 


cag 


get 


eta 


act 


474 


Glu 


Arg 


Arg 


Tyr 
75 


Leu 


Leu Lys 


Lys 


Leu 
80 


Leu 


Gin 


Leu 


Gin 


Ala 
85 


Leu 


Thr 




gaa 


ggg 


gta 


agt 


aca 


ggc mtg 


cag 


etc 


ctt 














504 


Glu 


Gly 


Val 
90 


Ser 


Thr 


Gly Xaa 


Gin 

95 


Leu 


Leu 

















<210> 612 
<211> 483 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .481 



<400> 612 

gtgggcggtt cggcggctct gggcgctgtt gtttggtctt taggectgeg gaggggegtt 60 
atctggaggg ccgcgggtgc aggcegcagt gaeagggecg ctcgccccgc tagtcctgcc 120 
tgtctcccgg tgcagctgtg ttcgcggcct gcaggccaac 

teg gag tac ctg age cag aac ccg egg gtg gca 
Ser Glu Tyr Leu Ser Gin Asn Pro Arg Val Ala 

10 15 
ctg cgc tgc gac ggc gag act gac aaa cac tgg 
Leu Arg Cys Asp Gly Glu Thr Asp Lys His Trp 

25 30 
ttt ttg ctt cgc aac gee ggg gac ctg gee ccc 
Phe Leu Leu Arg Asn Ala Gly Asp Leu Ala Pro 

40 45 
tec get age acg gat gaa get gee gac gee gag 
Ser Ala Ser Thr Asp Glu Ala Ala Asp Ala Glu 

55 60 
egg cag ctg cag cag etc ate tec ttt tec atg 
Arg Gin Leu Gin Gin Leu He Ser Phe Ser Met 
70 75 80 

gtc ttc etc ggg tgc cga tac cct caa aaa gtt 
Val Phe Leu Gly Cys Arg Tyr Pro Gin Lys Val 

90 95 
agt atg get gaa ggc ate aa 483 
Ser Met Ala Glu Gly He 
105 



atg 


gcg 


cag 


gag 


gtg 


175 


Met 


Ala 


Gin 


Glu 


Val 




1 








5 




gee 


tgg 


gtg 


gag 


gcg 


223 


Ala 


Trp 


Val 


Glu 

20 


Ala 




cgc 


cac 


cgc 


egg 


gat 


271 


Arg 


His 


Arg 
35 


Arg 


Asp 




get 


ggc 


ggc 


get 


gee 


319 


Ala 


Gly 
50 


Gly 


Ala 


Ala 




age 


ggg 


ace 


cga 


aac 


367 


Ser 


Gly 


Thr 


Arg 


Asn 




65 












gee 


tgg 


gcg 


aac 


cac 


415 


Ala 


Trp 


Ala 


Asn 


His 
85 




atg 


gat 


aaa 


ata 


ctt 


463 


Met 


Asp 


Lys 


He 
100 


Leu 





<210> 613 
<211> 473 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 277. .471 



<400> 613 

aaactgtcct tgccaccaca attatttaat ggacccaaca gaaagtaacc ceggaaatta 60 

ggacacctca tcccaaaaga cctttaaata ggggaagtcc acttgtgcac ggctgctcct 120 

tgctatagaa gacctgggac agaggactgc tgtctgccct ctctggtcac cctgcctagc 180 

tagaggatct gtgaccccag ccatgaggac cctcgccatc ettgetgesa ttctcctggt 240 
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ggccctgcag gccagagcca ctccaggcaa gagctg atg agg ttg ctg cag ccc 294 

Met Arg Leu Leu Gin Pro 
1 5 



egg 


age 


aga 


ttg 


cag 


egg 


aca 


tec 


cag 


aag 


tgg 


ttg 


ttt 


esc ttg 


cat 


342 


Arg 


Ser 


Arg 


Leu 


Gin 


Arg 


Thr 


Ser 


Gin 


Lys 


Trp 


Leu 


Phe 


Xaa Leu 


His 










10 










15 










20 






ggc 


gac 


gaa 


age 


ttg 


get 


cca 


aag 


cat 


cca 


ggc 


tea 


agg 


aaa aac 


atg 


390 


Gly 


Asp 


Glu 


Ser 


Leu 


Ala 


Pro 


Lys 


His 


Pro 


Gly 


Ser 


Arg 


Lys Asn 


Met 








25 










30 










35 








gac 


tgc 


tat 


tgg 


cag 


aat 


ace 


age 


gtg 


cat 


tgc 


agg 


aga 


acg teg 


eta 


438 


Asp 


Cys 


Tyr 


Trp 


Gin 


Asn 


Thr 


Ser 


Val 


His 


Cys 


Arg 


Arg 


Thr Ser 


Leu 






40 










45 










50 










tgg 


aac 


ctg 


cat 


eta 


cca 


ggg 


aag 


act 


ctg 


ggc 


at 








473 


Trp 


Asn 


Leu 


His 


Leu 


Pro 


Gly 


Lys 


Thr 


Leu 


Gly 












55 










60 










65 













<210> 614 
<211> 522 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 279. .521 



<400> 614 

attctyaatt gsgecmaget tccggcctgg ccacccgcca gcaggtctca cccggttcgc 60 
atcagtcccg ggcccccggc gcccctcagt agtagcaaat tatcgcatcc ctgagcttag 120 
gatccccctt tagtttcaca tatgccccca cccccgtttc cactttgatg gcggcttcca 180 
aggagcatcc cttagtaaga accgtcgcaa atategtgat ttggatccac caaggcttgg 240 
ggcgagccgt cgaatgtgtg tggtagggga gcgggctg atg tta gca cgt ggt ctt 296 

Met Leu Ala Arg Gly Leu 
1 5 

gtt arg scy tgr aat kgg ggr aat agg ags cck gtg ttc acg att agt 344 
Val Xaa Xaa Xaa Asn Xaa Gly Asn Arg Xaa Pro Val Phe Thr lie Ser 

10 ~ 15 20 

ttc cac cac eta cca act tgg aac ctt tac ate tct cca age ctg agt 392 
Phe His His Leu Pro Thr Trp Asn Leu Tyr lie Ser Pro Ser Leu Ser 

25 30 35 

ttc ctt ate tgt gag aca get gcg aat ace ggt ttc cag aga gta gtt 440 
Phe Leu lie Cys Glu Thr Ala Ala Asn Thr Gly Phe Gin Arg Val Val 

40 45 50 

aag aac cct tgt gaa cac ctt ctt cag ctt cct teg acg caa aac aga 488 
Lys Asn Pro Cys Glu His Leu Leu Gin Leu Pro Ser Thr Gin Asn Arg 
55 60 65 , 70 

ctt ttt cat tgg agg aga aga agg gat ggc aga g 522 
Leu Phe His Trp Arg Arg Arg Arg Asp Gly Arg 
75 80 



<210> 615 
<211> 322 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 171. .320 



<400> 615 

agtggttcgg ccgcggcgac cccactccgg cggcgcgtcc cccgagcttg gtaeggctea 60 
gcccgtctcc cccgaagccg cgcgcccgcg cccgcgcccc teagteggtg gagcccgcag 120 
ccccccttgt ggcccgcggc agctccccgc ccgctcggcc cgcgcccgcc atg gtc 176 

Met Val 
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cgt ccg cgc cgt gcc ccg 
Arg Pro Arg Arg Ala Pro 
5 

ggt cgc ggc cgy cct ccg 
Gly Arg Gly Arg Pro Pro 
20 

cgc tec tgg cct gcc age 
Arg Ser Trp Pro Ala Ser 
35 40 

gg 



336 

tac cgc tec ggc gcc ggg 
Tyr Arg Ser Gly Ala Gly 
10 

egg ccc etc gtg gtg cgc 
Arg Pro Leu Val Val Arg 
25 30 
ccy ccg agg ccc gca gcc 
Pro Pro Arg Pro Ala Ala 
45 



1 

ggc ccc ctm ggg 224 

Gly Pro Leu Gly 

15 

gcc gtc cgc teg 272 
Ala Val Arg Ser 

tyc gcg gat ccg 320 
Xaa Ala Asp Pro 
50 

322 



<210> 616 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 269. .523 



<400> 616 

attttatact aatggagaga gagagaaaga catagaagtc tcactgactt gcacagttta 60 

cttcccactg actagatctt ggataaggac ttttttgtct tattcataaa aggegattgt 120 

gttcaattat gttatgetea gaggtggaat atataatact gacttctatt gcttaaaaaa 180 

ggtgattatg aatggaggaa gaagtctgtg ggacaggaaa tttaaaataa aattttagag 240 

ttagatagat tatgagaaga aaaaatca atg cca agt ttt gtt gac cgc ate 292 

Met Pro Ser Phe Val Asp Arg lie 

1 5 

ttt ggt ggt gaa eta act agt atg ate atg tat gat caa tgc aga act 340 

Phe Gly Gly Glu Leu Thr Ser Met lie Met Tyr Asp Gin Cys Arg Thr 

10 15 20 

gtc tec ttg gtt cat gaa tct ttc ctt gat ttg tec etc cca gtt tta 388 

Val Ser Leu Val His Glu Ser Phe Leu Asp Leu Ser Leu Pro Val Leu 

25 30 35 40 

gat gat cag agt ggt aag aaa agt gta aat gat aaa aat ctg aaa aag 436 

Asp Asp Gin Ser Gly Lys Lys Ser Val Asn Asp Lys Asn Leu Lys Lys 

4 5 50 " 55 

aca gtg gag gat gaa gat caa gat agt gag gaa gaa aaa gat aac gac 484 

Thr Val Glu Asp Glu Asp Gin Asp Ser Glu Glu Glu Lys Asp Asn Asp 

60 65 70 

agt tac ata aaa gag aga agt gat att cct tct gga aca a 524 

Ser Tyr lie Lys Glu Arg Ser Asp lie Pro Ser Gly Thr 

75 80 85 



<210> 617 
<211> 274 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 75. .272 



<400> 617 

atacacttct tggctgtgtg cgctcagcag gacgtgggag gctccggctt caagacactc 60 

atccaagagg aagg atg gcc agt ate ttt tct aag ttg eta act ggc cgc 110 

Met Ala Ser lie Phe Ser Lys Leu Leu Thr Gly Arg 

1 5 10 

aat get tct ctg ctg ttt get acc atg ggc ace agt gtc ctg ace ace 158 

Asn Ala Ser Leu Leu Phe Ala Thr Met Gly Thr Ser Val Leu Thr Thr 

15 20 25 

ggg tac ctg ctg aac egg cag aaa gtg tgt gcc gag gtc egg gag cag 206 
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Gly 


Tyr 
30 


Leu 


Leu 


Asn 


Arg 


Gin 

35 


Lys 


Val 


Cys 


Ala 


Glu 
40 


Val 


Arg 


Glu 


Gin 




cct 


agg 


eta 


ttt 


cct 


cca 


age 


gca 


gac 


tac 


cca 


gac 


ctg 


cgc 


aas 


aca 


254 


Pro 


Arg 


Leu 


Phe 


Pro 


Pro 


Ser 


Ala 


Asp 


Tyr 


Pro 


Asp 


Leu 


Arg 


Xaa 


Thr 




45 










50 










55 










60 




aca 


act 


gca 


tgg 


ccg 


agt 


gc 




















274 


Thr 


Thr 


Ala 


Trp 


Pro 
65 


Ser 

























<210> 618 
<211> 243 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .242 

<400> 618 

agggaagagg ctgtcgggaa atg gcg gec gcg gec ggg ctg ggg ctt cag egg 53 

Met Ala Ala Ala Ala Gly Leu Gly Leu Gin Arg 











1 








5 










10 




gag gca 


gca 


gag 


ggg 


aag 


tgg 


tea 


gcg 


tgg 


cga 


atg 


acg 


gaa 


gaa 


act 


101 


Glu Ala 


Ala 


Glu 


Gly 


Lys 


Trp 


Ser 


Ala 


Trp 


Arg 


Met 


Thr 


Glu 


Glu 


Thr 








15 










20 










25 








cgc att 


gtc 


tac 


tgg 


ate 


aag 


gac 


aga 


cag 


etc 


acc 


aac 


cgt 


gac 


age 


149 


Arg lie 


Val 


Tyr 


Trp 


He 


Lys 


Asp 


Arg 


Gin 


Leu 


Thr 


Asn 


Arg 


Asp 


Ser 






30 










35 










40 










ace ata 


ctg 


gaa 


ctt 


caa 


aaa 


gtt 


ctg 


aaa 


aca 


tgt 


tgg 


ttc 


gtt 


tgt 


197 


Thr lie 


Leu 


Glu 


Leu 


Gin 


Lys 


Val 


Leu 


Lys 


Thr 


Cys 


Trp 


Phe 


Val 


Cys 




45 










50 










55 












tgg agt 


att 


ttt 


caa 


ctt 


aaa 


ttc 


ctt 


ttc 


ctt 


ttt 


ttt 


ttt 


ttt 


k 


243 


Trp Ser 


He 


Phe 


Gin 


Leu 


Lys 


Phe 


Leu 


Phe 


Leu 


Phe 


Phe 


Phe 


Phe 






60 








65 










70 














<210> 619 






























<211> 499 






























<212> DNA 






























<213> Homo 


sapiens 


























<220> 
































<221> CDS 






























<222> 295. . 


498 




























<400> 619 






























aggaagttga 


tcaatcccga ggctgetgag agacggtggc 


gcgattggga 


cagtcgccag 


60 


ggatggctga 


gcgtgaagat geagegggtg tccgggctgc 


tctcctggac 


gctgagcaga 


120 


gtcctgtggc 


tctccggcct ctctgagccg ggagctgccc 


ggcagccccg 


gatcatggaa 


180 


gagaaagect 


agaggtttat gatttgatta 


t gaactatccg 


ggacccagaa 


aageccaata 


240 


ctttagaaga 


actggaagtg gtctcggaaa 


i gttgtgtgga 


agttcaggag 


ataa 


atg 


297 






























Met 
1 




aag aag 


aat 


ate 


tgg 


tta 


tta 


tea 


ggt 


tea 


cgc 


caa 


cag 


tac 


etc 


att 


345 


Lys Lys 


Asn 


He 


Trp 


Leu 


Leu 


Ser 


Gly 


Ser 


Arg 


Gin 


Gin 


Tyr 


Leu 


He 








5 










10 










15 








get ctt 


tgg 


cga 


etc 


tta 


ttg 


ttg 


gaa 


ate 


tac 


att 


tct 


gaa 


gga 


acc 


393 


Ala Leu 


Trp 


Arg 


Leu 


Leu 


Leu 


Leu 


Glu 


He 


Tyr 


He 


Ser 


Glu 


Gly 


Thr 






20 










25 










30 










cac tea 


aca 


gaa 


gaa 


gac 


ate 


aat 


aag 


cag 


ata 


aat 


gac 


aaa 


gag 


cga 


441 


His Ser 


Thr 


Glu 


Glu 


Asp 


He 


Asn 


Lys 


Gin 


He 


Asn 


Asp 


Lys 


Glu 


Arg 




35 










40 










45 












gtg gca 


get 


gca 


atg 


gaa 


aac 


ccc 


aac 


tta 


egg 


gaa 


att 


gtg 


gac 


agt 


489 


Val Ala 


Ala 


Ala 


Met 


Glu 


Asn 


Pro 


Asn 


Leu 


Arg 


Glu 


He 


Val 


Asp 


Ser 
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50 55 60 65 

gtg tec ttg a 499 

Val Ser Leu 



<210> 620 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 613. .771 



<400> 620 

caggttcagg 

caagaggact 

ctctaatcaa 

gatctcccaa 

tctccaaggt 

ttaagtctgt 

accaaacaca 

gattcacaca 

tggtctgctg 

ccgctggatg 

ccacttctca 



gga tgc age 
Gly Cys Ser 
15 

aag ggt cca 
Lys Gly Pro 
30 

tct gtg get 
Ser Val Ala 



gcacagaacc caggcttgta ccatggtggt gggagaaaat gaccactggc 
gctgacctgt gcaccaggct agtacttatg actacaaatt ettactgett 
ctctgaggga agagggcatc tgatcattac aaaagggagg gcttataagt 
gaaggcagtg atetgetagt gectttgget ctgtacctct gctgggcatc 
ctaaggtaac atattaaatg tttttgtcag etaatgeagg ctcagtgact 
aagttaccca ggaagaagga ttataggaaa aataactcag taagtttaaa 
tttccattta gtgacaggaa tttaagcaag gacctgaagt agaatcaact 
gtagtaaata caaagtagaa caatgatctt ggcttcgctg tctggttcag 
gaatgeaata cacaagttaa gtcacactgc agactgtttt ctagctgtgg 
ccacttctag catagtagaa ctatgttagg aggaatggga aaagtgagca 
cc atg ttc ccc cct cct get gee agt etc tgc tec cat gtt 
Met Phe Pro Pro Pro Ala Ala Ser Leu Cys Ser His Val 
15 10 
aga gat cac cca cca gtt ggc cca gga cag acc aat agg 
Arg Asp His Pro Pro Val Gly Pro Gly Gin Thr Asn Arg 

20 25 
ate act eta act aca gee gaa etc acc tec aca aca gtc 
lie Thr Leu Thr Thr Ala Glu Leu Thr Ser Thr Thr Val 

35 40 45 

eta gee tgg act cct 
Leu Ala Trp Thr Pro 
50 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
651 



699 



747 



771 



<210> 621 
<211> 658 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 440. . 658 



<400> 621 

gctttgtaat tggtgcagct ctgagaacac 
cteggegggg ccgtgtctgg tetgeaagge 
cccacgctgc ctggcgactc gggccaccga 
cgcacacgct gatttgaacc ctgcttcgac 
atctgtcaac ctgtcttggg cagaaacagc 
tggcgctgtc tgcgcsggcg atccgccttt 
gagcagcagc tacctggcaa ctgaacccat 
ggtttctgcc acctctaag atg tgc cct 

Met Cys Pro 

1 

act ttt atg gec cgc ttc tec egg 
Thr Phe Met Ala Arg Phe Ser Arg 
15 

act cga ggg acc ctg gag gag atg 
Thr Arg Gly Thr Leu Glu Glu Met 
30 35 



aegcatgeat tecaggeggt gctgctgggg 
gcgcgggctc gtggggggtt ggctggggag 
atgtgagacc gagtcccttt atgtcaccag 
gtgtgtgtca tggcttaaaa atagctgeta 
ggeggegaca gagcaggagc gtcatggccg 
eggactgagg cccagcgcag cgcttgcaaa 
catcaccaca gccactcctg cagctgccac 
ggt aac tgg ctt tgg get tct atg 
Gly Asn Trp Leu Trp Ala Ser Met 
5 10 
agt age tea agg tct cct gtt cga 
Ser Ser Ser Arg Ser Pro Val Arg 
20 25 
cca acc gtt caa cat cct ttc etc 
Pro Thr Val Gin His Pro Phe Leu 
40 



60 
120 
180 
240 
300 
360 
420 
472 



520 



568 
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aat gtc ttc gag ttg 
Asn Val Phe Glu Leu 
45 

aac cat gcc gac gag 
Asn His Ala Asp Glu 
60 

<210> 622 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .460 

<400> 622 

aatttttttg tgacttcttc agaccacttt c atg act tct gga age aaa tgt 52 

Met Thr Ser Gly Ser Lys Cys 

1 5 



cct 


agt 


aca 


gac 


tea 


gga 


aaa 


gaa 


gaa 


tat 


att 


gcc 


acg 


ttc 


aaa 


gga 


100 


Pro 


Ser 


Thr 


Asp 


Ser 


Gly 


Lys 


Glu 


Glu 


Tyr 


He 


Ala 


Thr 


Phe 


Lys 


Gly 








10 










15 










20 










tct 


gaa 


tac 


ttc 


tgc 


tac 


gac 


ttg 


tct 


caa 


aac 


ccc 


att 


caa 


age 


age 


148 


Ser 


Glu 


Tyr 


Phe 


Cys 


Tyr 


Asp 


Leu 


Ser 


Gin 


Asn 


Pro 


He 


Gin 


Ser 


Ser 






25 










30 










35 












agt 


gat 


gaa 


ata 


act 


ctg 


tea 


ttt 


aaa 


acc 


ctt 


cag 


agg 


aat 


gga 


ctg 


196 


Ser 


Asp 


Glu 


He 


Thr 


Leu 


Ser 


Phe 


Lys 


Thr 


Leu 


Gin 


Arg 


Asn 


Gly 


Leu 




40 










45 










50 










55 




atg 


ctt 


cac 


act 


ggg 


aaa 


teg 


get 


gat 


tat 


gtc 


aat 


ctt 


gcc 


ctg 


aaa 


244 


Met 


Leu 


His 


Thr 


Gly 


Lys 


Ser 


Ala 


Asp 


Tyr 


Val 


Asn 


Leu 


Ala 


Leu 


Lys 












60 










65 










70 






aat 


gga 


get 


gtc 


tct 


ctg 


gtc 


att 


aat 


ttg 


gga 


tea 


ggg 


gcc 


ttt 


gaa 


292 


Asn 


Gly 


Ala 


Val 


Ser 


Leu 


Val 


He 


Asn 


Leu 


Gly 


Ser 


Gly 


Ala 


Phe 


Glu 










75 










80 










85 








gca 


eta 


gtg 


gag 


cct 


gtg 


aat 


gga 


aag 


ttt 


aat 


gat 


aat 


gcc 


tgg 


cat 


340 


Ala 


Leu 


Val 


Glu 


Pro 


Val 


Asn 


Gly 


Lys 


Phe 


Asn 


Asp 


Asn 


Ala 


Trp 


His 








90 










95 










100 










gat 


gtg 


aaa 


gtc 


acc 


arg 


aat 


ctg 


cgt 


cag 


gtg 


aca 


ata 


tea 


gtg 


gat 


388 


Asp 


Val 


Lys 


Val 


Thr 


Xaa 


Asn 


Leu 


Arg 


Gin 


Val 


Thr 


He 


Ser 


Val 


Asp 






105 










110 










115 












ggg 


att 


ctt 


acc 


aca 


acg 


ggc 


tac 


acg 


caa 


gaa 


gat 


tat 


acc 


atg 


ctg 


436 


Gly 


He 


Leu 


Thr 


Thr 


Thr 


Gly 


Tyr 


Thr 


Gin 


Glu 


Asp 


Tyr 


Thr 


Met 


Leu 




120 










125 










130 










135 




ggg 


tct 


gat 


gac 


ttt 


ttc 


tat 


gtt 


















460 


Gly 


Ser 


Asp 


Asp 


Phe 


Phe 


Tyr 


Val 





















140 



<210> 623 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. . 480 

<400> 623 

tttttcacct tcttctcctg gaeagceggg gccaccattc cactggcttc attcttgetg 
etcagctgaa tacaaggect cagtggtttg ctgggctttg gtgagatctt aaaggcaaac 
ctctcttggc gcaaggaagg agee atg cag ttg tat ctg tac teg atg tec 

Met Gin Leu Tyr Leu Tyr Ser Met Ser 

1 5 



339 

gag egg etc etc tac aca ggc 
Glu Arg Leu Leu Tyr Thr Gly 

50 55 
gtc tgg cca ggc ttc tat etc 
Val Trp Pro Gly Phe Tyr Leu 
65 70 



PCT/IB01/00914 

aag aca gcc tgt 616 
Lys Thr Ala Cys 

gga gac 658 
Gly Asp 



60 
120 
171 
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340 



aca 


gcc 


ate 


atg 


ggg 


eta 


gat 


gaa 


cag 


get 


aga 


acc 


ctg 


gaa 


agg 


gaa 


219 


Thr 


Ala 


He 


Met 


Gly 


Leu 


Asp 


Glu 


Gin 


Ala 


Arg 


Thr 


Leu 


Glu 


Arg 


Glu 




10 










15 










20 










25 




gag 


aag 


aaa 


gaa 


aac 


agt 


gga 


gca 


ate 


aga 


tea 


gac 


aga 


tgt 


gta 


aag 


267 


Glu 


Lys 


Lys 


Glu 


Asn 
30 


Ser 


Gly 


Ala 


He 


Arg 

35 


Ser 


Asp 


Arg 


Cys 


Val 
40 


Lys 




gca 


tea 


ttc 


cag 


cag 


gga 


ctg 


gca 


aac 


aat 


att 


tec 


ttt 


tat 


eta 


tgc 


315 


Ala 


Ser 


Phe 


Gin 
45 


Gin 


Gly 


Leu 


Ala 


Asn 
50 


Asn 


He 


Ser 


Phe 


Tyr 
55 


Leu 


Cys 




cca 


cct 


ggc 


tac 


agt 


cca 


cgt 


atg 


etc 


aga 


ttc 


tct 


tct 


gcc 


tct 


gcc 


363 


Pro 


Pro 


Gly 
60 


Tyr 


Ser 


Pro 


Arg 


Met 
65 


Leu 


Arg 


Phe 


Ser 


Ser 
70 


Ala 


Ser 


Ala 




gcc 


ctg 


aga 


cag 


caa 


gac 


caa 


ccc 


ctt 


etc 


etc 


etc 


ttc 


etc 


etc 


etc 


411 


Ala 


Leu 
75 


Arg 


Gin 


Gin 


Asp 


Gin 
80 


Pro 


Leu 


Leu 


Leu 


Leu 
85 


Phe 


Leu 


Leu 


Leu 




age 


eta 


etc 


aac 


gtg 


aag 


agg 


atg 


aga 


atg 


aag 


acc 


ttt 


atg 


atg 


acc 


459 


Ser 


Leu 


Leu 


Asn 


Val 


Lys 


Arg 


Met 


Arg 


Met 


Lys 


Thr 


Phe 


Met 


Met 


Thr 




90 










95 










100 










105 




cgc 


ate 


cac 


tta 


atg 


aat 


agt 


a 


















481 


Arg 


He 


His 


Leu 


Met 


Asn 


Ser 























110 



<210> 624 
<211> 456 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 295. .456 



<400> 624 

aggeggtget 

gtgtttccca 

tegtcattte 

accagcacag 

cagctgaaac 



gcctgtgtcg tcgtccctgt 
gaeegtaacg ttccaaaggt 
ctatatatgt gttcaatggt 
ggecttgact cctaacaaca 
ttcacttgag accttctcat 



ttgttcttct 
agaaacacag 
ttggaatagt 
atcacacgaa 
tggecatcat 



ata cag 
He Gin 

gat get 
Asp Ala 

aat get 
Asn Ala 
35 

ttc tea 
Phe Ser 
50 



aga aac 
Arg Asn 
5 

gac atg 
Asp Met 
20 

aga tgc 
Arg Cys 

act cac 
Thr His 



ttg agt aat 
Leu Ser Asn 

gat get gtc 
Asp Ala Val 

gat cca gcc 
Asp Pro Ala 
4 0 

agt 
Ser 



gat gac aat age 
Asp Asp Asn Ser 
10 

tec cat tct tac 
Ser His Ser Tyr 
25 

agt ctg tac teg 
Ser Leu Tyr Ser 



tcaaacctca ggatttctcc 
atgacagatt tttggtcttt 
tcaccagtga agtcatctca 
cttggaagag gatccttccc 
cgacatctgg attc atg 
Met 
1 

agt cca gcc tgc atg 
Ser Pro Ala Cys Met 
15 

cac tgg atg tea aga 
His Trp Met Ser Arg 
30 

act etc agt tec ate 
Thr Leu Ser Ser He 
45 



60 
120 
180 
240 
297 



345 



393 



441 



456 



<210> 625 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .550 



<400> 625 

aaaaaaaacc agctgagtct ttgtgccagg aagactgcgt gcagaaggtg actgtctcag 



60 
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tggagctggg tcatctcagg ccttggctcc ttgaactttt ggccgccatg tgcttcccga 120 
agtcctctct gatgac atg aag aag ctg aag gcc cga atg cac cag gcc ata 172 

Met Lys Lys Leu Lys Ala Arg Met His Gin Ala lie 
15 10 



gaa 


aga 


ttt 


tat 


gat 


aaa 


atg 


caa 


aat 


gca 


gaa 


tea 


gga 


cgt 


gga 


cag 


220 


Glu 


Arg 


Phe 


Tyr 


Asp 


Lys 


Met 


Gin 


Asn 


Ala 


Glu 


Ser 


Gly 


Arg 


Gly 


Gin 








15 










20 










25 










gtg 


atg 


teg 


age 


ctg 


gca 


gag 


ctg 


gag 


gac 


gac 


ttc 


aaa 


gag 


ggc 


tac 


268 


Val 


Met 


Ser 


Ser 


Leu 


Ala 


Glu 


Leu 


Glu 


Asp 


Asp 


Phe 


Lys 


Glu 


Gly 


Tyr 






30 










35 










40 












ctg 


gag 


aca 


gtg 


gcg 


get 


tat 


tat 


gag 


gag 


cag 


cac 


cca 


gag 


etc 


act 


316 


Leu 


Glu 


Thr 


Val 


Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu 


Gin 


His 


Pro 


Glu 


Leu 


Thr 




45 










50 










55 










60 




cct 


eta 


ctt 


gaa 


aaa 


gaa 


aga 


gat 


gga 


tta 


egg 


tgc 


cga 


ggc 


aac 


aga 


364 


Pro 


Leu 


Leu 


Glu 


Lys 


Glu 


Arg 


Asp 


Gly 


Leu 


Arg 


Cys 


Arg 


Gly 


Asn 


Arg 












65 










70 










75 






tec 


cct 


gtc 


ccg 


gat 


gtt 


gag 


gat 


ccc 


gca 


acc 


gag 


gag 


cct 


ggg 


gag 


412 


Ser 


Pro 


Val 


Pro 


Asp 


Val 


Glu 


Asp 


Pro 


Ala 


Thr 


Glu 


Glu 


Pro 


Gly 


Glu 










80 










85 










90 








age 


ttt 


tgt 


gac 


aag 


gtc 


atg 


aga 


tgg 


ttc 


cag 


gcc 


atg 


ctg 


cag 


egg 


460 


Ser 


Phe 


Cys 


Asp 


Lys 


Val 


Met 


Arg 


Trp 


Phe 


Gin 


Ala 


Met 


Leu 


Gin 


Arg 








95 










100 










105 










ctg 


cag 


ace 


tgg 


tgg 


cac 


ggg 


gtt 


ctg 


gcc 


tgg 


gtg 


aag 


gag 


aag 


gtg 


508 


Leu 


Gin 


Thr 


Trp 


Trp 


His 


Gly 


Val 


Leu 


Ala 


Trp 


Val 


Lys 


Glu 


Lys 


Val 






110 










115 










120 












gtg 


gcc 


ctg 


gtc 


cat 


gca 


gtg 


cag 


gcc 


etc 


tgg 


aaa 


cag 


ttc 


c 




551 


Val 


Ala 


Leu 


Val 


His 


Ala 


Val 


Gin 


Ala 


Leu 


Trp 


Lys 


Gin 


Phe 









125 130 135 



<210> 626 
<211> 494 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 283. .492 



<400> 626 

gaetttgega 

gttaattaaa 

tttcgttttc 

aegtatagge 

tgtagttagc 



aggcatggcg 
atggaaagaa 
atttaegctg 
ttgagtacgt 
atcgtcccta 



gggacactgt 
gacagaaggg 
eggaaactga 
gtccggccgc 
ageggaaega 



gaatgtcagc 
aaggtagaca 
cgtttttgcc 
atgtgtagtg 
ttttccgtga 



ttt eta cga gcc gta cag tat acg gag aag ctg 

Phe Leu Arg Ala Val Gin Tyr Thr Glu Lys Leu 
5 10 ' 15 

aag cga ttg ctt ttg cca tac ate gtg ctt aac 

Lys Arg Leu Leu Leu Pro Tyr lie Val Leu Asn 

25 ' 30 

act gag ccc agt ttg aga tgt ggg ctt caa tat 

Thr Glu Pro Ser Leu Arg Cys Gly Leu Gin Tyr 

40 45 

cga cct aga tgt att ctt gga gtc acc cag aaa 

Arg Pro Arg Cys lie Leu Gly Val Thr Gin Lys 

55 60 
gga ccg ag 
Gly Pro 
70 



ccagaaggtg ateagagect 
tcaggttctc cctggagact 
taacacccca tgtaatgtaa 
aaccctaaag ctttcctaat 
ac atg att tgt act 

Met lie Cys Thr 

1 

cac agg tec teg gca 
His Arg Ser Ser Ala 
20 

aaa gcg tgc ttg aag 
Lys Ala Cys Leu Lys 
35 

caa aag aaa acg ctg 
Gin Lys Lys Thr Leu 
50 

acc ate tgg acg cag 
Thr lie Trp Thr Gin 
65 



60 
120 
180 
240 
294 



342 



390 



438 



486 



494 



<210> 627 



WO 02/083898 



PCT/IB01/00914 



342 

<211> 585 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128 . . 583 



<400> 627 

attttccggg aagatggcgg cgcacaagtc aggtccggca catgtttccg cggagcccca 60 
gcaatgacgg atgatatcac ctcttcttct ctggtgagag tctgaggata gagacttttt 120 

tctcacc atg aat gtc acc cca gag gtc aag agt cgt ggg atg aag ttt 169 

Met Asn Val Thr Pro Glu Val Lys Ser Arg Gly Met Lys Phe 

15 10 

get gag gag cag ctg eta aag cat gga tgg act caa ggc aaa ggc etc 217 

Ala Glu Glu Gin Leu Leu Lys His Gly Trp Thr Gin Gly Lys Gly Leu 

15 20 25 30 

ggc egg aag gag aat ggt ate act cag get etc agg gtg aca ctg aag 265 

Gly Arg Lys Glu Asn Gly lie Thr Gin Ala Leu Arg Val Thr Leu Lys 

35 40 45 

caa gac act cat ggg gta gga cat gac cct gee aag gag ttc aca aac 313 

Gin Asp Thr His Gly Val Gly His Asp Pro Ala Lys Glu Phe Thr Asn 

50 55 60 

cac tgg tgg aat gag etc ttc aac aag act gcg gee aac ttg gta gtg 361 

His Trp Trp Asn Glu Leu Phe Asn Lys Thr Ala Ala Asn Leu Val Val 

65 70 75 

gaa act ggg cag gat gga gta cag ata agg age ctt tct aag gag acc 409 

Glu Thr Gly Gin Asp Gly Val Gin lie Arg Ser Leu Ser Lys Glu Thr 

80 85 90 

acc cgt tat aat cat ccc aag ccc aac ttg ctg tat cag aag ttt gtg 457 

Thr Arg Tyr Asn His Pro Lys Pro Asn Leu Leu Tyr Gin Lys Phe Val 

95 100 105 110 

aag atg get aca ttg act tea ggt gga gag aag cca aac aaa gac ttg 505 

Lys Met Ala Thr Leu Thr Ser Gly Gly Glu Lys Pro Asn Lys Asp Leu 

115 120 125 

gag age tgc agt gat gac gac aac cag ggg tea agt ccc caa aga ttc 553 

Glu Ser Cys Ser Asp Asp Asp Asn Gin Gly Ser Ser Pro Gin Arg Phe 

130 135 140 

tta ctg atg aga tgc tgc tec aag cct gtg ar 585 
Leu Leu Met Arg Cys Cys Ser Lys Pro Val 

145 150 



<210> 628 
<211> 560 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 201. .560 



<400> 628 

atcytawtcg aaaagctctt tccttcttcc tcttgttcct cctcctgcct ctcttcgctt 60 
cgcctgcaaa cgcggtgggg gctgctcggc ggtcaggagc agcaagagac agagegacat 120 
gagagattgg accgcgggct gcactggaga atttactggt aggataattc atccctaaag 180 
agattgaagt gagcttcaga atg gca aaa gag gag ccc cag agt ate tea agg 233 

Met Ala Lys Glu Glu Pro Gin Ser lie Ser Arg 
15 10 
gac ttg cag gaa ctg cag aag aag ctg tct ctg ctg ata gac tec ttc 281 
Asp Leu Gin Glu Leu Gin Lys Lys Leu Ser Leu Leu lie Asp Ser Phe 

15 20 25 

cag aat aac tea aag gtg gtg gee ttt atg aag tct cca gtg ggt cag 329 
Gin Asn Asn Ser Lys Val Val Ala Phe Met Lys Ser Pro Val Gly Gin 



tac 
Tyr 

gtc 
Val 
60 
acc 
Thr 

ate 
He 

ttc 
Phe 
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30 

ttg gac age cat 
Leu Asp Ser His 
45 

atg teg gee gtt 
Met Ser Ala Val 
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343 



acc ctg 
Thr Leu 

tct gtg 
Ser Val 

gta tea 
Val Ser 
110 



get 
Ala 

ggt 

Gly 

95 

etc 

Leu 



get 

Ala 

80 

ggc 

Gly 

gee 
Ala 



ccg 
Pro 

cct 

Pro 

65 

ctg 

Leu 

ttc 
Phe 

atg 
Met 



ttt 

Phe 

50 

gtt 

Val 

ctg 
Leu 

tea 
Ser 

teg 
Ser 



35 

ctg gee 
Leu Ala 

gga ttc 
Gly Phe 

ggg gtc 
Gly Val 

ctg etc 
Leu Leu 
100 
ggg atg 
Gly Met 
115 



ttc 
Phe 

ttc 
Phe 

ata 

He 

85 

tgc 

Cys 

atg 
Met 



acc 
Thr 

ctg 

Leu 

70 

ata 

He 

ate 
He 

ata 
He 



ttg 

Leu 

55 

etc 

Leu 

ttg 
Leu 

etc 
Leu 

gca 
Ala 



40 

ctg gtg ttc att 
Leu Val Phe He 



ate gtg 
He Val 

gaa gga 
Glu Gly 

tgt ggk 
Cys Gly 
105 

tct 
Ser 
120 



gtg ctt 
Val Leu 

75 
ttg gtc 
Leu Val 
90 

ttg age 
Leu Ser 



377 



425 



473 



521 



560 



<210> 629 
<211> 728 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 420. 



728 



<400> 629 

tttttggagg gctcttagca acggccctgg ttgagccccc 
tcaatgtgcc attcttccag gegegaggea geageggctg 
cttcctccct gectgetaat tacctgctct tcccgatctc 
gtgctactga gaagsgggaa gaaacgtccg ccacccatcc 
agacttgatt agatggctaa caggggcccg agctatggct 
aagatcgagc agaagtatga tgcggacctg gagaacaagc 
cagtgcgccg aggacataga gcacccgccc cccggcaggc 
atg gac ggg acg gtc ctg tgc aag ctg ata aat 
Met Asp Gly Thr Val Leu Cys Lys Leu He Asn 
15 10 
gga caa gag ccc ata ccc aag ate tea gag tea 
Gly Gin Glu Pro He Pro Lys lie Ser Glu Ser 

20 25 
cag atg gag caa ate tec cag ttc eta aaa get 
Gin Met Glu Gin He Ser Gin Phe Leu Lys Ala 

35 40 
gtc aga acc acc gac ate ttt cag acg gtg gat 
Val Arg Thr Thr Asp He Phe Gin Thr Val Asp 

50 ~ 55 

gac atg gca get gtg cag agg acc ctg atg get 
Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala 
65 70 75 

gtc acc aag gat gat ggc tgc tat egg gga gag 
Val Thr Lys Asp Asp Gly Cys Tyr Arg Gly Glu 

85 90 
agg aaa gee cag cag aat egg 
Arg Lys Ala Gin Gin Asn Arg 
100 



teagecatga gaaaatcaaa 
cagttcaaca tgaaaggagg 
atcgtttctg ectttgeaaa 
cccttgctgc ctgggggttc 
taagecgaga ggtgcaggag 
tggtggactg gatcatcctg 
ccattttcag aaatggtta 
agt tta tac cca cca 
Ser Leu Tyr Pro Pro 
15 

aag atg get ttt aag 
Lys Met Ala Phe Lys 
30 

gcg gag acc tat ggt 
Ala Glu Thr Tyr Gly 
45 

eta tgg gaa ggg aag 
Leu Trp Glu Gly Lys 
60 

tta ggc age gtt gca 
Leu Gly Ser Val Ala 
80 

cca tec tgg ttt cac 
Pro Ser Trp Phe His 
95 



60 
120 
180 
240 
300 
360 
419 
467 



515 



563 



611 



659 



707 



728 



<210> 630 
<211> 496 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 261 . .494 
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<400> 630 

ttcgcggtta gggattccat agttcacggg ttacggcaca cacgcgttac gtatatatcg 
aggttaatct gcgcaastct tcacttttgt ctgaatgtat tttagcggtg aattactcct 
tctggctaag agcttcacat cttcagtgtt aatggtggtg gttttcgcat gtcttgcaca 
catttctaga tcttgcactg tcgaaaagtc agctccgaaa ttgctgcaat gataagtttg 
ctgaactgca tctctgccca atg cca ctt cct cgc aaa gac aac cca cag tgt 

Met Pro Leu Pro Arg Lys Asp Asn Pro Gin Cys 
1 5 10 

agt cca ctg ctg cct cct ttg gcc tct gtt ggt aag aaa atg get aag 
Ser Pro Leu Leu Pro Pro Leu Ala Ser Val Gly Lys Lys Met Ala Lys 

15 20 25 

aaa ate get get get cct cca cct cca cct cct cct cca tea gaa egg 
Lys lie Ala Ala Ala Pro Pro Pro Pro Pro Pro Pro Pro Ser Glu Arg 

30 35 4 0 

cga ggc aaa cgc gca gaa cag aaa ggg get gtg ggg cag egg cca gat 
Arg Gly Lys Arg Ala Glu Gin Lys Gly Ala Val Gly Gin Arg Pro Asp 

45 50 55 

tgg att ttc tct ccg cag ccc caa rgg atg ttt eta atg acc caa gga 
Trp lie Phe Ser Pro Gin Pro Gin Xaa Met Phe Leu Met Thr Gin Gly 
60 65 70 75 

aaw tta gtg gt 
Xaa Leu Val 



60 
120 
180 
240 
293 



341 



389 



437 



485 



496 



<210> 631 
<211> 500 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 91. .498 



<400> 631 

wgaacatcct aatcgaaaaa agattctaag acaagacttc taataeggea tctcagttta 60 
ttgttctttg gggtttgttt tcagatcctc atg tea gaa cca ggc aaa etc age 114 

Met Ser Glu Pro Gly Lys Leu Ser 



cag 


aaa 


ate 


aaa 


gtt 


tgg 


ctt 


cag 


1 

gag 


tac 


tgg 


aac 


5 
ate 


aca 


gat 


etc 


162 


Gin 


Lys 


He 


Lys 


Val 


Trp 


Leu 


Gin 


Glu 


Tyr 


Trp 


Asn 


He 


Thr 


Asp 


Leu 






10 










15 










20 












gtg 


gcc 


att 


tec 


aca 


ttc 


atg 


att 


gga 


gca 


att 


ctt 


cgc 


eta 


cag 


aac 


210 


Val 


Ala 


He 


Ser 


Thr 


Phe 


Met 


He 


Gly 


Ala 


He 


Leu 


Arg 


Leu 


Gin 


Asn 




25 










30 










35 










40 




cag 


ccc 


tac 


atg 


ggc 


tat 


ggc 


egg 


gtg 


ate 


tac 


tgt 


gtg 


gat 


ate 


ate 


258 


Gin 


Pro 


Tyr 


Met 


Gly 


Tyr 


Gly 


Arg 


Val 


He 


Tyr 


Cys 


Val 


Asp 


He 


He 












45 










50 










55 






ttc 


tgg 


tac 


ate 


cgt 


gtc 


ctg 


gac 


ate 


ttt 


ggt 


gtc 


aac 


aag 


tat 


ctg 


306 


Phe 


Trp 


Tyr 


He 


Arg 


Val 


Leu 


Asp 


He 


Phe 


Gly 


Val 


Asn 


Lys 


Tyr 


Leu 










60 










65 










70 








ggg 


cca 


tac 


gtg 


atg 


atg 


att 


gga 


aag 


atg 


atg 


ate 


gac 


atg 


ctg 


tac 


354 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 


Gly 


Lys 


Met 


Met 


He 


Asp 


Met 


Leu 


Tyr 








75 










80 










85 










ttt 


gtg 


gtc 


ate 


atg 


ctg 


gtc 


gtg 


etc 


atg 


agt 


ttc 


gga 


gta 


gcc 


cgt 


402 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 


Ala 


Arg 






90 










95 










100 












caa 


gcc 


att 


ctg 


cat 


cca 


gag 


gag 


aag 


ccc 


tct 


tgg 


aaa 


ctg 


gcc 


cga 


450 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 


Leu 


Ala 


Arg 




105 










110 










115 










120 




aac 


ate 


ttc 


tac 


atg 


ccc 


tac 


tgg 


atg 


ate 


tat 


gga 


gag 


gtg 


ttg 


cag 


498 


Asn 


lie 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 


Leu 


Gin 





125 130 135 
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500 



<210> 632 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 170. . 523 



<400> 632 

tttttccggt cttactcacg ttgcggcctt cctcgcgtca cagccgggat gaagccgatc 60 
ctactgcagg gccatgagcg gtccattacg cagattaagt ataaccgcga aggagacctc 120 
ctctttactg tggccaagga ccctatcgtc aatgtatggt actctgtga atg gtg aga 178 

Met Val Arg 
1 

ggc tgg gca cct aca tgg gcc ata ccg gag ctg tgt ggt gtg tgg acg 226 
Gly Trp Ala Pro Thr Trp Ala lie Pro Glu Leu Cys Gly Val Trp Thr 

5 10 15 

ctg act ggg aca cca age atg tec tea ctg get cag ctg aca aca get 274 
Leu Thr Gly Thr Pro Ser Met Ser Ser Leu Ala Gin Leu Thr Thr Ala 
20 25 30 35 

gtc gtc tct ggg act gtg aaa cag tct gga gag gtg ttg gtg aat gtt 322 
Val Val Ser Gly Thr Val Lys Gin Ser Gly Glu Val Leu Val Asn Val 

40 45 50 

aag gag cac tec egg cag ate aac gac ate cag tta tec agg gac atg 370 
Lys Glu His Ser Arg Gin lie Asn Asp lie Gin Leu Ser Arg Asp Met 

55 60 65 

acc atg ttt gtg acc gcg tec aag gac aac aca gcc aag ctt ttt gac 418 
Thr Met Phe Val Thr Ala Ser Lys Asp Asn Thr Ala Lys Leu Phe Asp 

70 75 80 

tec aca act ctt gaa cat cag aag act ttc egg aca gaa cgt cct gtc 466 
Ser Thr Thr Leu Glu His Gin Lys Thr Phe Arg Thr Glu Arg Pro Val 

85 90 95 

aac tea get gcc etc tec ccc aac tat gac cat gtg gtc ctg ggc ggt 514 
Asn Ser Ala Ala Leu Ser Pro Asn Tyr Asp His Val Val Leu Gly Gly 
100 105 110 115 

ggt cag gaa g 524 
Gly Gin Glu 



<210> 633 
<211> 508 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 141. .506 



<400> 633 

attggcttct ctcccagttt ccaccacact ccccatctgg gtaacctaag caggttactc 60 
gcccctgtga ggaggttctc ttctctgtac aatggaaatg gttatccaca cctatctctc 120 
agggaggctc tgatatcatc atg aga ccc age tgt get cct gga tgg ttt tac 173 

Met Arg Pro Ser Cys Ala Pro Gly Trp Phe Tyr 
1 5 10 

cac aag tec aat tgc tat ggt tac ttc agg aag ctg agg aac tgg tct 221 
His Lys Ser Asn Cys Tyr Gly Tyr Phe Arg Lys Leu Arg Asn Trp Ser 

15 20 25 

gat gcc gag etc gag tgt cag tct tac gga aac gga gcc cac ctg gca 269 
Asp Ala Glu Leu Glu Cys Gin Ser Tyr Gly Asn Gly Ala His Leu Ala 

30 35 4 0 

tct ate ctg agt tta aag gaa gcc age acc ata gca gag tac ata agt 317 
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Ser 


He 
45 


Leu 


Ser 


Leu 


Lys 


Glu 
50 


Ala 


Ser 


Thr 


He 


Ala 
55 


Glu 


Tyr 


He 


Ser 




ggc 


tat 


cag 


aga 


age 


cag 


ccg 


ata 


tgg 


att 


ggc 


ctg 


cac 


gac 


cca 


cag 


365 


Gly 


Tyr 


Gin 


Arg 


Ser 


Gin 


Pro 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 


Pro 


Gin 




60 










65 










70 










75 




aag 


agg 


cag 


cag 


tgg 


cag 


tgg 


att 


gat 


ggg 


gee 


atg 


tat 


ctg 


tac 


aga 


413 


Lys 


Arg 


Gin 


Gin 


Trp 
80 


Gin 


Trp 


He 


Asp 


Gly 
85 


Ala 


Met 


Tyr 


Leu 


Tyr 
90 


Arg 




tec 


tgg 


tct 


ggc 


aag 


tec 


atg 


ggt 


ggg 


aac 


aag 


cac 


tgt 


get 


gag 


atg 


461 


Ser 


Trp 


Ser 


Gly 
95 


Lys 


Ser 


Met 


Gly 


Gly 
100 


Asn 


Lys 


His 


Cys 


Ala 
105 


Glu 


Met 




age 


tec 


aat 


aac 


aac 


ttt 


tta 


act 


tgg 


age 


age 


aac 


gaa 


tgc 


aac 


aa 


508 


Ser 


Ser 


Asn 
110 


Asn 


Asn 


Phe 


Leu 


Thr 
115 


Trp 


Ser 


Ser 


Asn 


Glu 
120 


Cys 


Asn 







<210> 634 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 92. .412 

<220> 

<221> sig_peptide 
<222> 92. .160 
<223> Von Heijne matrix 
score 12 

seq RLLLLSLVSGALG/LG 

<220> 

<221> misc_feature 
<222> 325 

<223> n=a, g, c or t 



<400> 634 

tgctgccttc tctgatggct tctccagcgg cctcagtacc accttagegg tcttctgcca 60 
tgagctgccc cactaactgg gtgactttgc c atg ctg etc cag tea ggg ctg 112 

Met Leu Leu Gin Ser Gly Leu 
-20 



tec 


ttt 


egg 


egg 


ctg 


ctg 


ctg 


ctg 


age 


etc 


gtg 


tct 


gga 


gee 


ctg 


gga 


160 


Ser 


Phe 


Arg 


Arg 


Leu 


Leu 


Leu 


Leu 


Ser 


Leu 


Val 


Ser 


Gly 


Ala 


Leu 


Gly 






-15 










-10 










-5 












ttg 


ggg 


ggt 


gca 


gtc 


ctg 


ggg 


gtg 


ggg 


etc 


age 


ctg 


ggc 


cct 


gtc 


ccc 


208 


Leu 


Gly 


Gly 


Ala 


Val 


Leu 


Gly 


Val 


Gly 


Leu 


Ser 


Leu 


Gly 


Pro 


Val 


Pro 




1 








5 










10 










15 






etc 


act 


ccc 


tgg 


gtg 


ttt 


ggg 


gtc 


act 


get 


ggg 


gtc 


ttc 


etc 


tat 


gtg 


256 


Leu 


Thr 


Pro 


Trp 


Val 


Phe 


Gly 


Val 


Thr 


Ala 


Gly 


Val 


Phe 


Leu 


Tyr 


Val 










20 










25 










30 








gee 


ctt 


gtg 


gac 


atg 


eta 


cca 


gee 


ctg 


ctt 


cgt 


cct 


ccg 


gag 


ccc 


ctg 


304 


Ala 


Leu 


Val 


Asp 


Met 


Leu 


Pro 


Ala 


Leu 


Leu 


Arg 


Pro 


Pro 


Glu 


Pro 


Leu 








35 










40 










45 










cct 


acg 


ccc 


cat 


gtg 


etc 


ctn 


cag 


ggg 


ctg 


ggg 


ctg 


ctg 


ctg 


ggg 


ggc 


352 


Pro 


Thr 


Pro 


His 


Val 


Leu 


Leu 


Gin 


Gly 


Leu 


Gly 


Leu 


Leu 


Leu 


Gly 


Gly 






50 










55 










60 












ggc 


etc 


atg 


ctt 


gee 


ata 


acc 


ctg 


ctg 


gag 


gag 


egg 


eta 


ctg 


ccc 


gtg 


400 


Gly 


Leu 


Met 


Leu 


Ala 


He 


Thr 


Leu 


Leu 


Glu 


Glu 


Arg 


Leu 


Leu 


Pro 


Val 




65 










70 










75 










80 





acc act gag ggc tgatggggcc aattggaaag gggtcgggtt gcccttcctt 452 
Thr Thr Glu Gly 

ccccccaacc ac 4 64 
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<210> 635 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 193. .468 



<220> 

<221> sig_peptide 

<222> 193. .264 

<223> Von Heijne matrix 

score 11.6999998092651 
seq FVLGLGLTPPTLA/QD 



<400> 635 

acacacactc acacaaggac gcsaacccca cctagatgca aagcaggatt caaaagaaca 60 
tctttgcgtt ttctaccggc tccccatcat cgtactaggg aggaagaagc gggtgagaaa 120 
caaaacttct ttccattgtc ctgcccgttt ctgcggactt gttctgaggc cgaggagcct 180 
gtgttggaag ag atg gtg atg ggc ctg ggc gtt ttg ttg ttg gtc ttc gtg 231 
Met Val Met Gly Leu Gly Val Leu Leu Leu Val Phe Val 
-20 -15 

279 



327 



ctg 


ggt 


ctg 


ggt 


ctg 


acc 


cca 


ccg 


acc 


ctg 


get 


cag 


gat 


aac 


tec 


agg 


Leu 


Gly 


Leu 


Gly 


Leu 


Thr 


Pro 


Pro 


Thr 


Leu 


Ala 


Gin 


Asp 


Asn 


Ser 


Arg 




-10 










-5 










1 








5 


tac 


aca 


cac 


ttc 


ctg 


acc 


cag 


cac 


tat 


gat 


gee 


aaa 


cca 


cag 


ggc 


egg 


Tyr 


Thr 


His 


Phe 


Leu 


Thr 


Gin 


His 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly 


Arg 










10 










15 










20 




gat 


gac 


aga 


tac 


tgt 


gaa 


age 


ate 


atg 


agg 


aga 


egg 


ggc 


ctg 


acc 


tea 


Asp 


Asp 


Arg 


Tyr 


Cys 


Glu 


Ser 


He 


Met 


Arg 


Arg 


Arg 


Gly 


Leu 


Thr 


Ser 








25 










30 










35 






ccc 


tgc 


aaa 


gac 


ate 


aac 


aca 


ttt 


att 


cat 


ggc 


aac 


aag 


cgc 


asa 


tea 


Pro 


Cys 
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468 



taagaataag caagtcttct ttccaggtca ccacttgc 506 



<210> 636 
<211> 566 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 268. .507 



<220> 

<221> sig_peptide 

<222> 268 . .363 

<223> Von Heijne matrix 

score 8.89999961853027 
seq LGLFFGFLLVIFA/IE 



<400> 636 

actttagctc 

agtgccagag 

actgtgcttg 

accaagagca 

tatattctga 



gectaggatt tgagctgggt gtatgtctgc tcccagctca agtccctccg 60 

aggaaggcag ggagaagegg asacccctct ttgggccaag gecaaggagg 120 

ccgggattgc tgtccttgcc attggactat ggctccgatt cgactctcag 180 

tcttcgagca agaaactaat aataataatt ccagcttcta cacaggagtc 240 

teggagcegg cgccctc atg atg ctg gtg ggc ttc ctg ggc tgc 294 

Met Met Leu Val Gly Phe Leu Gly Cys 
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45 



aacagtttat ctcagacatc tgeccaaga 566 



<210> 637 
<211> 568 
<212> DNA 
<2 13> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .527 



<220> 

<221> sig_peptide 

<222> 30. .74 

<223> Von Heijne matrix 

score 8.60000038146973 
seq PLLI ICLLPAIEG/KN 



<400> 637 
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Thr Ser Leu Trp Ala Val Ser Leu Ser Ser Ala Leu Leu Leu Ala lie 

125 130 135 

get gga gat gtb wet ttt act ggc aaa gga aga agg agg cag 527 
Ala Gly Asp Val Xaa Phe Thr Gly Lys Gly Arg Arg Arg Gin 

140 145 150 

taaagcagga acagggcagc ccgcatgtct tccagaagtg a 568 



<210> 638 
<211> 420 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .217 



<220> 

<221> sig__peptide 

<222> 56. . 133 

<223> Von Heijne matrix 

score 8.39999961853027 
seq AAILLLTVPVAEG/VT 



<400> 638 

aaaactgaaa acagaacaaa cacaaacaca aacagagaat caacgaagaa gaaaa atg 58 

Met 

gtg aag gtg atg tgg gtt tec gtt tta get ctg gcg gcg gcg att etc 106 

Val Lys Val Met Trp Val Ser Val Leu Ala Leu Ala Ala Ala lie Leu 

-25 ~ *" -20 -15 -10 

ctt ttg acg gtc ccg gtg gca gaa ggg gtg acg tgc teg cct atg cag 154 

Leu Leu Thr Val Pro Val Ala Glu Gly Val Thr Cys Ser Pro Met Gin 

-5 15 
ctg get tea tgt gcg gcg gcg atg acg tea tct teg ccg cca teg gag 202 
Leu Ala Ser Cys Ala Ala Ala Met Thr Ser Ser Ser Pro Pro Ser Glu 

10 " 15 20 

get gtt gca caa age tgagagagca gcagccatgc ctttgtgggt acatgaggaa 257 
Ala Val Ala Gin Ser 
25 

ccctaccctc cgccaatacg ttagctcccc taaegcaagg aaagtctcca acagttgcaa 317 
gattccttcc ccaagctgtt aaggaaatgt taatcatgat taattagtga caagtttege 377 
tgattatagt ggttaatgct ggtcttatct tegtcagtae tac 420 



<210> 639 
<211> 526 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 276. .440 



<220> 

<221> sig_peptide 

<222> 276. .353 

<223> Von Heijne matrix 

score 8.30000019073486 
seq SVLLLAIVLISWG/CI 



<400> 639 

tatctcacgc gcggcagttc aagaacatca gacacgctca acaaacactt gttgettgae 60 

tgatacatta tggegggegg gcaaggcccc gatcacgtga taccgccccc caactccggg 120 

eggaagagge cagcgacagc ttgcacgaag cgagtgggcg ggcgctcgtg cccggctaaa 180 

agatggctgc tggcgcctgg cagccaccgc ctgggaggtt actgtaaggc ccgcagctcc 240 
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cgccagctcc cgcggactgc tgccgcctcc ttacc atg aag cca gta agt cgt 293 

Met Lys Pro Val Ser Arg 
-25 

cgc acg ctg gac tgg att tat tea gtg ttg ctg ctt gec ate gtt tta 341 

Arg Thr Leu Asp Trp lie Tyr Ser Val Leu Leu Leu Ala lie Val Leu 

-20 * -15 "* -10 -5 

ate tec tgg ggc tgc ate ate tat get teg atg gtg tct gca aga cga 389 

lie Ser Trp Gly Cys lie lie Tyr Ala Ser Met Val Ser Ala Arg Arg 

15 10 
cag eta agg aag aaa tac cca gac aaa ate ttt ggg acg aat gaa aat 437 
Gin Leu Arg Lys Lys Tyr Pro Asp Lys lie Phe Gly Thr Asn Glu Asn 

15 20 25 

ttg taactcttct ggatttaatt atctgaaaat acagttcttt ccctcatgct 490 
Leu 

tatgtagata taaaaataaa attcataatg caaagt 526 

<210> 640 
<211> 543 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 152. .454 

<220> 

<221> sig_peptide 

<222> 152. .217 

<223> Von Heijne matrix 

score 7.90000009536743 

seq ILLVALQARLSHS/RQ 

<400> 640 

aggaaattag gacacctcat cccaaaagac ctttaaatag gggaagtcca cttgtgcacg 60 
gctgctcctt gctatagaag acctgggaca gaggactget gtctgccctc tctggtcacc 120 
ctgcctagct agaggatctg tgaccccagc c atg agg acc etc gee ate ctt 172 

Met Arg Thr Leu Ala lie Leu 
-20 

get gee att etc ctg gtg gee ctg cag gec agg ctg age cac tec agg 220 

Ala Ala lie Leu Leu Val Ala Leu Gin Ala Arg Leu Ser His Ser Arg 

-15 -10 -5 1 

caa gag ctg atg agg ttg ctg cag ccc egg age aga ttg cag egg aca 268 

Gin Glu Leu Met Arg Leu Leu Gin Pro Arg Ser Arg Leu Gin Arg Thr 

5 10 15 

tec cag aag tgg ttg ttt ccc ttg cat ggg acg aaa get tgg etc caa 316 
Ser Gin Lys Trp Leu Phe Pro Leu His Gly Thr Lys Ala Trp Leu Gin 

20 25 30 

age ate cag get caa gga aaa aca tgg cct get att gca gaa tac cag 364 
Ser lie Gin Ala Gin Gly Lys Thr Trp Pro Ala lie Ala Glu Tyr Gin 

35 40 45 

cgt gca ttg cag gag aac gtc get atg gaa cct gca tct acc agg gaa 412 
Arg Ala Leu Gin Glu Asn Val Ala Met Glu Pro Ala Ser Thr Arg Glu 
50 55 60 65 

gac tct ggg cat tct get get gag ctt gca gaa aaa gaa aaa 454 
Asp Ser Gly His Ser Ala Ala Glu Leu Ala Glu Lys Glu Lys 

70 75 
tgagctcaaa atttgetttg agagctacag ggaattgeta ttactcmtgt accttctgct 514 
caatttcctt tcctcatctc aaataaatg 543 

<210> 641 
<211> 336 
<212> DNA 
<213> Homo sapiens 



WO 02/083898 



PCT/IB01/00914 



351 

<220> 
<221> CDS 
<222> 77 . .247 

<220> 

<221> sig_peptide 

<222> 77. . 139 

<223> Von Heijne matrix 

score 7.80000019073486 

seq SVLIFCLLNLAIS/DR 

<220> 

<221> misc_feature 
<222> 301 

<223> n=a, g, c or t 
<400> 641 

atctgagcta gggaagaatg tgtattctgt tgttggatga agtagtctac agatgtcagt 60 
tatatccagt tgagta atg gtg ttg ttg agt tea aca atg tct gta ctg att 112 

Met Val Leu Leu Ser Ser Thr Met Ser Val Leu lie 
-20 -15 -10 

ttc tgc ctg eta aat ctg gec att tct gat aga agg gta atg aag tct 160 
Phe Cys Leu Leu Asn Leu Ala lie Ser Asp Arg Arg Val Met Lys Ser 

-5 15 
cca act ata ata gtg att cat eta ttt ttt cct cac agt tct ace agt 208 
Pro Thr lie lie Val lie His Leu Phe Phe Pro His Ser Ser Thr Ser 

10 15 20 

att tgc att gag aat ttg gtc act ctg tta tta ggc aca taagaatcct 257 
lie Cys lie Glu Asn Leu Val Thr Leu Leu Leu Gly Thr 

25 30 35 

cagctgeatt ttgeattatt tccagagttc ttggttgtaa tcantggata tacaggctgt 317 
ggaagactta ccccatttc 336 

<210> 642 
<211> 507 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .496 

<220> 

<221> sig_peptide 

<222> 110. . 187 

<223> Von Heijne matrix 

score 6.59999990463257 

seq LFWLASGWTPAFA/YS 

<400> 642 

aagatgcgga ggagggtgac gcactagctc tccagttcgc ccgttcctgg cctgaccccc 60 
accaaggccc atacegcagt aggctcctcg ggctgcccct eggttgaca atg gtc tec 118 

Met Val Ser 
-25 

agg atg gtc tct ace atg eta tct ggc eta ctg ttt tgg ctg gca tct 166 
Arg Met Val Ser Thr Met Leu Ser Gly Leu Leu Phe Trp Leu Ala Ser 

-20 -15 -10 

gga tgg act cca gca ttt get tac age ccc egg acc cct gac egg gtc 214 
Gly Trp Thr Pro Ala Phe Ala Tyr Ser Pro Arg Thr Pro Asp Arg Val 

-5 1 5 

tea gaa gca gat ate cag agg ctg ctt cat ggt gtt atg gag caa ttg 262 
Ser Glu Ala Asp lie Gin Arg Leu Leu His Gly Val Met Glu Gin Leu 



WO 02/083898 

10 

ggc att gcc agg ccc 
Gly lie Ala Arg Pro 
30 

ctt gtg ggc ccc cag 
Leu Val Gly Pro Gin 
4 5 

cat ttg ggt cct ttt 
His Leu Gly Pro Phe 
60 

gga gac aac att cct 
Gly Asp Asn lie Pro 
75 

etc caa ate cct gtc 
Leu Gin He Pro Val 
90 

tagaaaacac c 

<210> 643 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14 . . 310 

<220> 

<221> sig_peptide 
<222> 14 . . 136 
<223> Von Heijne matrix 

score 5.40000009536743 

seq WLCPGLRLPGSLA/GR 

<400> 643 

gtaaacagac aac atg gcg gcc gcg gtg kss gcg gca cct ggg gcc ttg 49 
Met Ala Ala Ala Val Xaa Ala Ala Pro Gly Ala Leu 
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Glu 
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Glu 
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35 
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340 
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55 



ataaggaatt gtggtgcttg gaataaacag agcttatggc aaaaattcac tcagtaaaaa 400 
tcttataaaa atgctatcaa aagctgtgga tgctttgaaa tctgataaga aagtaeggae 460 
cataat 466 



PCT/IB01/00914 

352 

15 20 25 

cga gtg gaa tat cca get cac cag gcc atg aat 310 
Arg Val Glu Tyr Pro Ala His Gin Ala Met Asn 

35 40 
age att gaa ggt gga get cat gaa gga ctt cag 358 
Ser He Glu Gly Gly Ala His Glu Gly Leu Gin 

50 55 
ggc aac ate ccc aac ate gtg gca gag ttg act 406 
Gly Asn lie Pro Asn lie Val Ala Glu Leu Thr 

65 70 
aag gac ttt agt gag gat cag ggg tac cag ace 454 
Lys Asp Phe Ser Glu Asp Gin Gly Tyr Gin Thr 

80 85 
ctg ttg gaa aaa cag atg atg gat gtc 496 
Leu Leu Glu Lys Gin Met Met Asp Val 
95 100 

507 



<210> 644 
<211> 502 
<212> DNA 
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<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 19. .342 



<220> 

<221> sig_peptide 

<222> 19. .75 

<223> Von Heijne matrix 

score 4.19999980926514 
seq AVAASAASGQAEG/KK 



<400> 644 

gcgcgctgct cttctaag atg get gec get acc ggt gcg gtg gca gec teg 51 

Met Ala Ala Ala Thr Gly Ala Val Ala Ala Ser 
-15 -10 

ate acc gat ctg egg gtc 99 
lie Thr Asp Leu Arg Val 
5 

aac tta gac ate acc gga 147 
Asn Leu Asp lie Thr Gly 
20 

cag get att gaa gag gaa 195 
Gin Ala lie Glu Glu Glu 
35 40 
gtt tea act gat act cca 243 
Val Ser Thr Asp Thr Pro 
55 

aaa cat gaa gca gat gag 291 
Lys His Glu Ala Asp Glu 
70 

get ttt ate aag gta aag 339 
Ala Phe lie Lys Val Lys 
85 

tgt taattactca ggttaagatg cttattttaa gectagtatt ttgcatcatg 392 
Cys 

aattttgtgc tttttgttga acaatggtca ctgaagtatg taggttaaag akdaagacct 452 
atgtaatatt gacaacgttt aatattgtca atctcattaa ctctcataat 502 
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<210> 645 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 264 . .443 



<220> 

<221> sig_peptide 

<222> 264 . .386 

<223> Von Heijne matrix 

score 4.19999980926514 
seq VFWMS FYLSRVNC / TT 



<400> 645 

ttcgtgaaat ttgttggttt tgaggactgt aaaagtgatt tcatactctg aatataaaac 60 

tggataatag ggtaatgttt taaaatttat tatgetatta ttcagaatgc caaagtatta 120 

ttttttttcc caaaatcagt ctggacattt actacttttt agactttttg aegttgaact 180 

ttctgtataa aaattggctg ggttttgagc ttttggtaag aaataaaagc cgattaagca 240 

ctggccgccc cgcgtggtac cca atg ccc gag tea ctg tgg cag cat teg cac 293 

Met Pro Glu Ser Leu Trp Gin His Ser His 
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-40 -35 
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cgt 


gtg 


aac 


tgc 


act 


389 


Leu 


Glu 


Val 


Phe 


Trp 


Met 


Ser 


Phe 


Tyr 


Leu 


Ser 


Arg 


Val 


Asn 


Cys 


Thr 




-15 










-10 










-5 










1 




aca 


aaa 


gag 


acc 


age 


cca 


ctt 


ccc 


aag 


cca 


gec 


aga 


cac 


ctg 


ggt 


ctt 


437 


Thr 


Lys 


Glu 


Thr 


Ser 


Pro 


Leu 


Pro 


Lys 


Pro 


Ala 


Arg 


His 


Leu 


Gly 


Leu 





5 10 15 

gag cca taaactggcg tagttaagct ttgeagctte cagtgtattt tatttattct 493 
Glu Pro 



<210> 646 
<211> 540 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 175. . 354 



<220> 

<221> sig_peptide 
<222> 175. . 309 
<223> Von Heijne matrix 
score 4 

seq ENFLSLLSKSCSA/DP 



<400> 646 

attttaaaat ggaaattggg gcgaggggtt ggcggctggg 
ggagaatgtc actttacccg agaaattcac tacgatgggc 
gcaggcgcac tgcacgagtg gaggtgtggc tagtggctgt 



cat age ctt 
His Ser Leu 

age ttt agg 
Ser Phe Arg 

aat ttc ctt 
Asn Phe Leu 
-10 

tea act ttc 
Ser Thr Phe 
5 

tctgagcagc 
aagagctctt 
catcttggtt 



ttc att gcg age ttg 
Phe lie Ala Ser Leu 
-40 

ttt aat tgg ttc gac 
Phe Asn Trp Phe Asp 
-25 

age ctt etc age aaa 
Ser Leu Leu Ser Lys 
-5 

atg agg gac att gag 
Met Arg Asp lie Glu 
10 

tacaaaaaga agaccagtct 
gcagtatgaa gaccacatgg 
teggacaegt gaagaattca 



aaa gtt ctt 
Lys Val Leu 

-35 
tgc ctt etc 
Cys Leu Leu 
-20 

agt tgt tct 
Ser Cys Ser 

aca aac aaa 
Thr Asn Lys 
15 

atectgetgg 
aaagagaggc 
tctcagactg 



cgaagggagc attcaaaagc 60 
cattggtttc tgctcggctc 120 
gatgagataa atcc atg 177 

Met 

-45 

ttt tat tac agt ttt 225 
Phe Tyr Tyr Ser Phe 
-30 

cac aat ttg ggc gag 273 
His Asn Leu Gly Glu 
-15 

gcg gac ccg tct ggg 321 
Ala Asp Pro Ser Gly 
1 

tgaaatatgg gttaaagtac 374 



agacagtggc caegtgaaga 434 
cacatggaac caacagtcag 494 
tgtatc 540 



<210> 647 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 100. .498 



<220> 

<221> sig_peptide 
<222> 100. . 138 



WO 02/083898 PCT/IB01/00914 
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<223> Von Heijne matrix 

score 3.79999995231628 
seq MAHCVTLVQLSIS/CD 



<400> 647 

attttgtgaa gcggcgaagg aggtggtggc tgcgttgggc tccgggaagc cgttcgggct 60 
ggggctgtcg gccgcggggc ggaggcactc gcgcggggg atg gcc cac tgc gtg 114 

Met Ala His Cys Val 
-10 



acc 


ttg 


gtt 


cag 


ctg 


tec 


att 


tec 


tgt 


gac 


cat 


etc 


att 


gac 


aag 


gac 


162 


Thr 


Leu 


Val 


Gin 
-5 


Leu 


Ser 


He 


Ser 


Cys 
1 


Asp 


His 


Leu 


He 

5 


Asp 


Lys 


Asp 




ate 


ggc 


tec 


aag 


tct 


gac 


cca 


etc 


tgc 


gtc 


ctt 


tta 


cag 


gat 


gtg 


gga 


210 


He 


Gly 


Ser 


Lys 


Ser 


Asp 


Pro 


Leu 


Cys 


Val 


Leu 


Leu 


Gin 


Asp 


Val 


Gly 






10 










15 










20 












ggg 


ggc 


age 


tgg 


get 


gag 


ctt 


ggc 


egg 


act 


gaa 


egg 


gtg 


egg 


aac 


tgc 


258 


Gly 


Gly 


Ser 


Trp 


Ala 


Glu 


Leu 


Gly Arg 


Thr 


Glu 


Arg 


Val 


Arg 


Asn 


Cys 




25 










30 










35 










40 




tea 


age 


cct 


gag 


ttc 


tec 


aag 


act 


eta 


cag 


ctt 


gag 


tac 


cgc 


ttt 


gag 


306 


Ser 


Ser 


Pro 


Glu 


Phe 


Ser 


Lys 


Thr 


Leu 


Gin 


Leu 


Glu 


Tyr 


Arg 


Phe 


Glu 












45 










50 










55 






aca 


gtc 


cag 


aag 


eta 


cgc 


ttt 


gga 


ate 


tat 


gac 


ata 


gac 


aac 


aag 


acg 


354 


Thr 


Val 


Gin 


Lys 


Leu 


Arg 


Phe 


Gly 


He 


Tyr 


Asp 


He 


Asp 


Asn 


Lys 


Thr 










60 










65 










70 








cca 


gag 


ctg 


agg 


gat 


gat 


gac 


ttc 


eta 


ggg 


ggt 


get 


gag 


tgt 


tec 


eta 


402 


Pro 


Glu 


Leu 


Arg 


Asp 


Asp 


Asp 


Phe 


Leu 


Gly 


Gly 


Ala 


Glu 


Cys 


Ser 


Leu 








75 










80 










85 










gga 


cag 


att 


gtg 


tec 


age 


cag 


gta 


ctg 


act 


etc 


ccc 


ttg 


atg 


ctg 


aag 


450 


Gly 


Gin 


He 


Val 


Ser 


Ser 


Gin 


Val 


Leu 


Thr 


Leu 


Pro 


Leu 


Met 


Leu 


Lys 






90 










95 










100 












cct 


gga 


aaa 


cct 


get 


ggg 


egg 


ggg 


aca 


tea 


egg 


tct 


cag 


etc 


agg 


aat 


498 


Pro 


Gly 


Lys 


Pro 


Ala 


Gly 


Arg 


Gly 


Thr 


Ser 


Arg 


Ser 


Gin 


Leu 


Arg 


Asn 




105 










110 










115 










120 





taaagga 505 



<210> 648 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78 . .281 



<220> 

<221> sig_peptide 

<222> 78 . .218 

<223> Von Heijne matrix 

score 3.70000004768372 
seq PLLSLHSRGGSSS/ES 



<400> 648 

attttgegaa eggegagcag cggcgcggcg eggagagacs ageggaggtt ttcctggttt 60 
cggaccccag eggcegg atg gtg aaa tec tec ctg cag egg ate etc aat 110 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cac tgc ttc gcc aga gag aag gaa ggg gat aaa ccc age gcc acc 158 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gcc age cgc acc atg ccg etc eta age ctg cac age cgc ggc 206 
He His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 254 



WO 02/083898 PCT/IB01/00914 
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Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 301 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctttttggg 361 
ctttgagatt gtgagacegg ggcatcccct tatccccaag agacccgacg ettgettcat 421 
ggcctacacg ttcgagagag agtcttcggg agaggaggag gagtagggee g 472 



<210> 649 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 173. .373 



<220> 

<221> sig_peptide 

<222> 173. .265 

<223> Von Heijne matrix 

score 3.70000004768372 
seq ASWALGRWGSCRA/LD 



<400> 649 

agaatctgac attccaggac tccactaggc ctgttttcct ctgtgggaac tagagecaag 60 
gegagagace cgtgccagcc ccgaggctcc cggggcccat gggcaggegg ggctggtaca 120 
gggccccggg gctctctcag cccatctgtc acccctcccc cccaacaccc ag atg tec 178 

Met Ser 
-30 



cgt 


ctt 


ggc 


agt 


get 


gtc 


ccg 


gac 


tgc 


cac 


att 


aat 


get 


cca 


gaa 


ccc 


226 


Arg 


Leu 


Gly 


Ser 


Ala 


Val 


Pro 


Asp 


Cys 


His 


He 


Asn 


Ala 


Pro 


Glu 


Pro 












-25 










-20 










-15 






gee 


age 


tgg 


gec 


ctg 


ggg 


egg 


tgg 


ggc 


tec 


tgc 


aga 


gee 


ctg 


gac 


aaa 


274 


Ala 


Ser 


Trp 


Ala 


Leu 


Gly 


Arg 


Trp 


Gly 


Ser 


Cys 


Arg 


Ala 


Leu 


Asp 


Lys 










-10 










-5 










1 








ggc 


gag 


gga 


ggg 


agg 


gaa 


ggg 


gat 


cct 


aag 


cac 


ccc 


tec 


etc 


cct 


ggc 


322 


Gly 


Glu 


Gly 


Gly 


Arg 


Glu 


Gly 


Asp 


Pro 


Lys 


His 


Pro 


Ser 


Leu 


Pro 


Gly 






5 










10 










15 












cct 


agg 


agg 


cag 


aca 


tgc 


ccg 


atg 


aga 


tea 


aca 


ttg 


atg 


aat 


tgt 


tgg 


370 


Pro 


Arg 


Arg 


Gin 


Thr 


Cys 


Pro 


Met 


Arg 


Ser 


Thr 


Leu 


Met 


Asn 


Cys 


Trp 




20 










25 










30 










35 





agt tagagagtga agaggagaga ageeggaaaa tccagggact cctgaagtca 423 
Ser 

tgtgggaaac ctgtcgagga cttcatccag ggagc 458 



<210> 650 
<211> 482 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 8 . . 250 



<220> 

<221> sig_peptide 
<222> 8 . . 64 

<223> Von Heijne matrix 
score 3.5 

seq RLLRAS VARH VSA/ I P 



WO 02/083898 



PCT/IB01/00914 



<400> 650 
actgaag atg 
Met 

cga cat gtg 
Arg His Val 
-5 

agg cct get 
Arg Pro Ala 

ggt tec agt 
Gly Ser Ser 
30 

ttg tgt gtc 
Leu Cys Val 
45 

ttc acg atg 
Phe Thr Met 
60 

tagecatett 
cgcactcttg 
ttctggtctt 
gagegtcaac 



gcg get get 
Ala Ala Ala 

agt gee att 
Ser Ala lie 
1 

gca tgt gga 
Ala Cys Gly 
15 

caa gca aaa 
Gin Ala Lys 

eta cag aaa 
Leu Gin Lys 



357 

gta gga egg ttg etc cga 
Val Gly Arg Leu Leu Arg 
-15 -10 
cct tgg ggc att tct gec 
Pro Trp Gly lie Ser Ala 
5 

aga acg age ttg aca aat 
Arg Thr Ser Leu Thr Asn 
20 

att att tct tct ate ctt 
lie lie Ser Ser lie Leu 
35 

ttg ttg ctt tta gtg aca 
Leu Leu Leu Leu Val 



50 



Thr 
55 



gcg teg 
Ala Ser 

act gca 
Thr Ala 

tta ttg 
Leu Leu 

25 
tgg att 
Trp He 
40 

aag eta 
Lys Leu 



gtt gee 
Val Ala 

gee etc 
Ala Leu 
10 

tgt tct 
Cys Ser 

tea cct 
Ser Pro 

acg aat 
Thr Asn 



tgaactgtga agttgtcgca gtctcagtgg attcccactt 



gcctggataa atacaccaag aaagaatggt ggtttgggcc acatgaacat 
tcagacttaa ctaagcagat ttcccgagac tacggtgtgc tgttagaagg 
gcactaagag gtctcttcat aattgaccct aatggagtca tcaagcattt 
ga 



49 



97 



145 



193 



241 



290 



350 
410 
470 
482 



<210> 651 
<211> 291 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 130. .288 



<220> 

<221> sig_peptide 
<222> 130. .234 
<223> Von Heijne matrix 
score 3.5 

seq GLVEVVAVGVVRS/DQ 



<400> 651 

agtctctgtc ggagtctgtt tcccagtgca cccaacttgt gatagaaggt atatgttctg 60 
ctgaaaaaga gaaaaaggaa aaaaaagaat aaaagcagag agcagagaag gacagggtct 120 
ggctgccgc atg gag ggg agg age agg age agg aaa gee tgt ctg gag gtt 171 
Met Glu Gly Arg Ser Arg Ser Arg Lys Ala Cys Leu Glu Val 









35 








-30 










25 






get 


aca 


gta 


aac 


cag 


gtc 


ttc 


gtg ggg 


ttg 


gtc 


gag 


gtg 


gtg 


gca 


gtg 


Ala 


Thr 


Val 


Asn 


Gin 


Val 


Phe 


Val Gly 


Leu 


Val 


Glu 


Val 


Val 


Ala 


Val 




-20 










-15 








-10 










ggg 


gtg 


gtg 


aga 


agt 


gat 


cag 


att aag 


aat 


atg 


tat 


tta 


aaa 


tac 


age 


Gly 


Val 


Val 


Arg 


Ser 


Asp 


Gin 


He Lys 


Asn 


Met 


Tyr 


Leu 


Lys 


Tyr 


Ser 


-5 










1 






5 










10 




cag 


cat 


gat 


ctg 


ctg 


atg 


gac 


tag 
















Gin 


His 


Asp 


Leu 


Leu 


Met 


Asp 



















15 



<210> 652 
<211> 766 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 



WO 02/083898 



<222> 166. . 333 



358 



PCT/IB01/00914 



<400> 652 

aaaaggagta gctattagcc aattcggcag ggcccgcttt ttagaagctt gatttcyttt 
gaagatgaaa gactagcgga agctctgcct ctttccccag tggcgaggga actcggggcg 
attggctggg aactgtatcc acccaaatgt caccgatttc ttcct atg cag gaa atg 

Met Gin Glu Met 
1 

age aga ccc ate aat aag aaa ttt etc age ctg gee gaa aat ggt tgg 
Ser Arg Pro lie Asn Lys Lys Phe Leu Ser Leu Ala Glu Asn Gly Trp 
5 10 15 20 

ccc cac gaa gec acg aca act gga ggc aaa gag ggt tgc tea acg ccc 
Pro His Glu Ala Thr Thr Thr Gly Gly Lys Glu Gly Cys Ser Thr Pro 

25 30 35 

cgc etc att gga aaa cca aat cag ate tgg gac eta tat age gtg gcg 
Arg Leu lie Gly Lys Pro Asn Gin lie Trp Asp Leu Tyr Ser Val Ala 

40 ~ 45 50 

gag gcg ggg cga tgattgtcgc gctcgcaccc actgeagctg cgcacagtcg 
Glu Ala Gly Arg 
55 

catttctttc cccgcccctg agaccctgca gcaccatctg teatggegge tgggctgttt 
ggtttgagcg ctcgccgtct tttggcggca gcggcgacgc gagggctccc ggccgcccgc 
gtccgctggg aatctagctt ctccaggact gtggtcgccc cgtccgctgt ggeggaaage 
ggcccccaga accgaccaca ccgtgcaaga ggacccagaa cccgaggacg aaaacttgta 
tgagaagaac ccagactccc atggttatga caaggacccc gttttggacg tctggaacat 
gcgacttgtc ttcttctttg gcgtctccat catcctggtc cttggcagca cctttgtggc 
ctatctgcct gactacagga tgaaagagtg ggt 



60 
120 
177 



225 



273 



321 



373 



433 
4 93 
553 
613 
673 
733 
766 



<210> 653 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 199. . 357 



<400> 653 

cagtaggata catagctttt cttccagtga aacaaagttc atatcatcca ttgtttttca 
ageaegtgae accagcctca aagtaaatga catgaccagt ggttgaacag tctaattttc 
aaatttaata tagagcatat aacttctgat ttgatagtat ttattttaaa aaattatgtt 
ttcatcattc atttgaaa atg aaa aag ccc caa agt gag aac ttt ggg gga 

Met Lys Lys Pro Gin Ser Glu Asn Phe Gly Gly 
15 10 
ggg cct aga aca tgg ata gat etc tta gtg gtc ttt cca aaa gta cat 
Gly Pro Arg Thr Trp lie Asp Leu Leu Val Val Phe Pro Lys Val His 

15 20 25 

gta ctt gaa ata ttt tea tta tea tac tat tct ttg aaa aaa aag atg 
Val Leu Glu lie Phe Ser Leu Ser Tyr Tyr Ser Leu Lys Lys Lys Met 

30 35 40 

ctt act gta tac ttg ttt tea age ate etc taaaatcaaa ggttttgatc 
Leu Thr Val Tyr Leu Phe Ser Ser lie Leu 

45 50 
acaatatgea gatttctctt gatagatact taaataggct atttctctcc tcttcttggg 
caatgecttg ttttctcctc tgaatatttg catttgaaag gattgettec t 



60 
120 
180 
231 



279 



327 



377 



437 
488 



<210> 654 
<211> 489 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 



WO 02/083898 PCT/IB01/00914 
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<222> 253. . 426 
<400> 654 

aacctcacga ggccttacgt gcctagagcg gttcctctcc gtggcgccct accgccttcg 60 

gcgcgaaggt ggctggtgcg taccgtcccg aaccctagga gccgagatcc cacccgcagg 120 

accccaaaac ccaatgatcc tgcagcagcc cttgcagcga ggcccccagg gaggggccag 180 

cgcctcccgc gggccgcctt gggggtgact tggggcctgg acgccagctc csctctscga 240 
ggagctgtgc cc atg age acc aag egg cgc ctg gag gag gag cag gag cct 291 

Met Ser Thr Lys Arg Arg Leu Glu Glu Glu Gin Glu Pro 

1 5 10 

ctg cgc aag cag ttt ctg tct gag gag aac atg gec acc cac ttc tct 339 

Leu Arg Lys Gin Phe Leu Ser Glu Glu Asn Met Ala Thr His Phe Ser 

15 " 20 25 

caa etc age ctg cac aat gac caa ccc eta ctg cag ccc cac cat gac 387 

Gin Leu Ser Leu His Asn Asp Gin Pro Leu Leu Gin Pro His His Asp 
30 35 40 45 

ctt etc ccc age cct gee ccc act cag gag ccc ttg etc tgagctgett 436 

Leu Leu Pro Ser Pro Ala Pro Thr Gin Glu Pro Leu Leu 

50 55 

ctctggcgct atcctggcag cctcatcctg aggcctccgt ctgetgagge tgg 489 

<210> 655 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 205. .387 

<400> 655 

acttcegggg gageggegeg gcggcgggga ggatctctca ccccatcact cagggtggcg 60 
caatcacgac tcattggctc actgcagcct agacctccca gctggagcaa ttctcctgcc 120 
tcagccttct gagtagctgg gactacagtt ggttctaaag agtggtgagt cagaagagac 180 
gtcaggcagc aagcgacttg ggee atg gee tct gac eta gac ttc tea cct 231 

Met Ala Ser Asp Leu Asp Phe Ser Pro 

1 5 

ccg gag tgc ccg age cca ctt tec tgg aga acc tgc tac ggt acg gac 279 

Pro Glu Cys Pro Ser Pro Leu Ser Trp Arg Thr Cys Tyr Gly Thr Asp 

10 15 20 25 

tct tec tgg gag cca tct tec age tea tct gtg tgc tgg cca tea teg 327 

Ser Ser Trp Glu Pro Ser Ser Ser Ser Ser Val Cys Trp Pro Ser Ser 

30 35 40 

tac cca ttc cca agt ccc acg agg egg agg ctg aac cgt ctg age cca 375 
Tyr Pro Phe Pro Ser Pro Thr Arg Arg Arg Leu Asn Arg Leu Ser Pro 

45 50 55 

gaa gtg ctg agg tgacgaggaa gcccaaggct gctgttcctt ctgtgaacaa 427 
Glu Val Leu Arg 
60 

gaggeccaag aaagagacta agaagaagcg gtagaagagg aggectgagg actgggcgkg 487 
cagggagagg gtcttkgggg acagcctcct gggaatctac attgtgtttc ccccgcatty 547 
ccaggctcag gttctgarga rgctgtgacg ccctatgacc gcagagatct agacagtcgt 607 
aacagtccc 616 

<210> 656 

<211> 508 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 37 . . 219 



WO 02/083898 PCT/IB01/00914 
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<400> 656 

ctttcccctg ccggtgtcct gctcgccgtc cccgcc atg ctg tct eta gac ttt 54 

Met Leu Ser Leu Asp Phe 



5 



gat 


gtg 


egg 


egg 


atg 


aac 


aag 


egg 


cag 


etc 


tat 


tat 


caa 


gtc 


Asp 


Val 


Arg 


Arg 


Met 


Asn 


Lys 


Arg 


Gin 


Leu 


Tyr 


Tyr 


Gin 


Val 




10 










15 










20 






ttt 


gga 


atg 


att 


gtc 


tea 


teg 


gca 


eta 


atg 


ate 


tgg 


aag 


ggg 


Phe 


Gly 


Met 


He 


Val 


Ser 


Ser 


Ala 


Leu 


Met 


He 


Trp 


Lys 


Gly 


25 










30 










35 








gta 


ata 


act 


gga 


agt 


gaa 


agt 


ccg 


att 


gta 


gtg 


gtc 


tea 


ggc 


Val 


He 


Thr 


Gly 


Ser 


Glu 


Ser 


Pro 


He 


Val 


Val 


Val 


Ser 


Gly 



102 



eta aat ttt gga atg att gtc tea teg gca eta atg ate tgg aag ggg 150 
Leu Asn 

tta atg gta ata act gga agt gaa agt ccg att gta gtg gtc tea ggc 198 
Leu Met 

40 45 50 

aaa atg ggc ata tea agt ttt tgaccaaagg agataataat gcggttgatg 249 
Lys Met Gly He Ser Ser Phe 
55 60 

accgaggcct ctataaacaa ggacaacatt ggctagagaa aaaagatgtt gtggggagag 309 
ccaggggatt tgttccttat attggaattg tgacgatcct catgaatgac tatcctaaat 369 
ttaagtatgc agttctcttt ttgctgggtt tattegtget ggttcatcgt gagtaagaag 429 
cctgccttgc tgttcctggg aagatgecat agttttcgtt actggatgtt tggagtagat 489 
actggtctgt gattggtgg 508 



<210> 657 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . .452 



<400> 657 

ttttcccctg ccggtgtcct gctcgccgtc cccgccatgc tgtctctaga ctttttggac 60 
gatgtgcggc ggatgaacaa geggcastet attatcaagt cctaaatttt gga atg 116 

Met 



att 


gtc 


tea 


teg 


gca 


eta 


atg 


ate 


tgg 


aag 


ggg 


tta 


atg 


gta 


ata 


act 


164 


He 


Val 


Ser 


Ser 
5 


Ala 


Leu 


Met 


He 


Trp 
10 


Lys 


Gly 


Leu 


Met 


Val 
15 


He 


Thr 




gga 


agt 


gaa 


agt 


ccg 


att 


gta 


gtg 


gtg 


etc 


agt 


ggc 


age 


atg 


gaa 


cct 


212 


Gly 


Ser 


Glu 

20 


Ser 


Pro 


He 


Val 


Val 
25 


Val 


Leu 


Ser 


Gly 


Ser 
30 


Met 


Glu 


Pro 




gca 


ttt 


cat 


aga 


gga 


gat 


ctt 


etc 


ttt 


eta 


aca 


aat 


cga 


gtt 


gaa 


gat 


260 


Ala 


Phe 
35 


His 


Arg 


Gly 


Asp 


Leu 
40 


Leu 


Phe 


Leu 


Thr 


Asn 
45 


Arg 


Val 


Glu 


Asp 




ccc 


ata 


cga 


gtg 


gga 


gaa 


att 


gtt 


gtt 


ttt 


agg 


ata 


gaa 


gga 


aga 


gag 


308 


Pro 


He 


Arg 


Val 


Gly 


Glu 


He 


Val 


Val 


Phe 


Arg 


He 


Glu 


Gly 


Arg 


Glu 




50 










55 










60 










65 




att 


cct 


ata 


gtt 


cac 


cga 


gtc 


ttg 


aag 


att 


cat 


gaa 


aaa 


ttt 


gtt 


cct 


356 


He 


Pro 


He 


Val 


His 
70 


Arg 


Val 


Leu 


Lys 


He 
75 


His 


Glu 


Lys 


Phe 


Val 
80 


Pro 




tat 


att 


gga 


att 


gtg 


acg 


ate 


etc 


atg 


aat 


gac 


tat 


cct 


aaa 


ttt 


aag 


404 


Tyr 


He 


Gly 


He 
85 


Val 


Thr 


He 


Leu 


Met 
90 


Asn 


Asp 


Tyr 


Pro 


Lys 
95 


Phe 


Lys 




tat 


gca 


gtt 


etc 


ttt 


ttg 


ctg 


ggt 


tta 


ttc 


gtg 


ctg 


gtt 


cat 


cgt 


gag 


452 


Tyr 


Ala 


Val 
100 


Leu 


Phe 


Leu 


Leu 


Gly 
105 


Leu 


Phe 


Val 


Leu 


Val 
110 


His 


Arg 


Glu 





taagaagect gecttgetgt tcctggggaa gatgecatag ttttcgtt 500 



<210> 658 
<211> 411 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 90. .305 



<220> 

<221> misc_f eature 

<222> 138,315,326,343,360,377 

<223> n=a, g, c or t 



<400> 658 

ttgggcagaa aaattcaagc aggagattgt atttcttttg gagttgtacg atttccttta 60 
ttatttgaac tgcagtaaag aaagctggg atg ggc tec tct agg gat act tec 113 



















Met 
1 


Gly 


Ser 


Ser 


Arg 
5 


Asp 


Thr 


Ser 




aga 


tec 


ctg 


ggc 


ggt 


tgt 


age 


cct 


ngc 


tec 


tct 


tta 


aat 


gga 


ttt 


ggt 


161 


Arg 


Ser 


Leu 


Gly 


Gly 


Cys 


Ser 


Pro 


Xaa 


Ser 


Ser 


Leu 


Asn 


Gly 


Phe 


Gly 






10 










15 










20 












ttc 


aaa 


gac 


gat 


cat 


etc 


cgt 


ctt 


etc 


gga 


tgt 


cat 


agt 


gee 


act 


gat 


209 


Phe 


Lys 


Asp 


Asp 


His 


Leu 


Arg 


Leu 


Leu 


Gly 


Cys 


His 


Ser 


Ala 


Thr 


Asp 




25 










30 










35 










40 




cat 


etc 


cag 


etc 


ctk 


gee 


ace 


ctg 


ggc 


ttt 


etc 


cac 


ttt 


tgc 


etc 


tat 


257 


His 


Leu 


Gin 


Leu 


Leu 


Ala 


Thr 


Leu 


Gly 


Phe 


Leu 


His 


Phe 


Cys 


Leu 


Tyr 












45 










50 










55 






gtt 


ttg 


ctt 


etc 


cam 


cgt 


ctt 


age 


cac 


gat 


ate 


tac 


etc 


tct 


gtc 


atg 


305 


Val 


Leu 


Leu 


Leu 


Xaa 


Arg 


Leu 


Ser 


His 


Asp 


He 


Tyr 


Leu 


Ser 


Val 


Met 










60 










65 










70 









tgatgtgacn cttgtttttt naaccaagag ttgecctnaa gstcccttaa aaaanagctg 365 
atcttactgg snttcttttg taaarctcct gtgatagatg cagatt 411 



<210> 659 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 235. .444 



<400> 659 

atttccagcg gttgctggtt ctgacgggtt gtagtctgee aggacaatga gttatgacta 60 
ccatcagaac tggggccgtg atgggggtcc ccgcagctcc ggtgggggct atggaggggg 120 
gecagcaggg ggtcatggag gtaaccgagg etceggagga greggeggeg gcggaggggg 180 
tggtcgaggc ggcaggggee ggcatcccgg gcacctgaaa gecgegaaat egge atg 237 

Met 
1 



tgg 


tac 


gcg 


aaa 


aaa 


cag 


ggg 


cag 


aag 


aac 


aag 


gaa 


gcg 


gag 


agg 


caa 


285 


Trp 


Tyr 


Ala 


Lys 


Lys 


Gin 


Gly 


Gin 


Lys 


Asn 


Lys 


Glu 


Ala 


Glu 


Arg 


Gin 










5 










10 










15 








gag 


aga 


get 


gta 


gta 


cac 


atg 


gat 


gaa 


cga 


cga 


gaa 


gaa 


caa 


att 


gta 


333 


Glu 


Arg 


Ala 


Val 


Val 


His 


Met 


Asp 


Glu 


Arg 


Arg 


Glu 


Glu 


Gin 


He 


Val 








20 










25 










30 










cag 


tta 


ctg 


aat 


tct 


gtt 


caa 


gcg 


aag 


aat 


gat 


aaa 


gag 


tea 


gaa 


gca 


381 


Gin 


Leu 


Leu 


Asn 


Ser 


Val 


Gin 


Ala 


Lys 


Asn 


Asp 


Lys 


Glu 


Ser 


Glu 


Ala 






35 










40 










45 












cag 


ata 


tec 


ace 


cca 


gac 


ttg 


get 


gtc 


cgc 


tct 


gcg 


cct 


teg 


ggc 


cag 


429 


Gin 


He 


Ser 


Thr 


Pro 


Asp 


Leu 


Ala 


Val 


Arg 


Ser 


Ala 


Pro 


Ser 


Gly 


Gin 




50 










55 










60 










65 





ccc ttt egg get tgc tgagcctgga ggecgakgag aatgeactte cgggttttgc 484 
Pro Phe Arg Ala Cys 
70 

tgaggctctg aggagctacc aggaggctgc ggctgeagge acctt 529 
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<210> 660 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 
<22l> CDS 
<222> 128. .286 



<400> 660 

gttctttggc cctgtgacac gtagcaacgg ggctggttca gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctactgcc tttgccgcca gcgcagctca gagtttgatt 120 
atttgca atg tea ggc ttt gaa aac tta aac acg gat ttc tac cag aca 169 

Met Ser Gly Phe Glu Asn Leu Asn Thr Asp Phe Tyr Gin Thr 

15 10 
agt tac age ate gat gat cag tea cag cag tec tat gat tat gga gga 217 
Ser Tyr Ser lie Asp Asp Gin Ser Gin Gin Ser Tyr Asp Tyr Gly Gly 
15 20 25 30 

agt gga gga ccc tat aac agt atg ctg get atg act att cgc age aag 265 
Ser Gly Gly Pro Tyr Asn Ser Met Leu Ala Met Thr lie Arg Ser Lys 

^35 4 0 4 5 

gca gat ttg tec etc cag aca tgatgeagee acaacagcca tacaccgggc 316 
Ala Asp Leu Ser Leu Gin Thr 
50 

agatttacca gccaactcag gcatatactc cagcttcacc tcagcctttc tatggaaaca 376 
actttgagga tgagccacct ttattagaag agttaggtat caattttgac cacatctggc 436 
aaaaaacact aacagtatta catccgttaa aagtagcaga tg 478 



<210> 661 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110. .277 



<400> 661 

tacggtttta ttattatatc gatttatttg tgaggaggga atctgttcat gtagatttta 60 
aaagtgcaag agtgtccagc gtcatctgcc tctcctgtcc cctgaccac atg att ttc 118 

Met lie Phe 
1 

166 



214 



tgt tec 


cat 


aaa 


eta 


gca 


ctg 


ttg 


agt 


ata 


ttt 


cca 


gag 


cat 


act 


cca 


Cys Ser 


His 


Lys 


Leu 


Ala 


Leu 


Leu 


Ser 


He 


Phe 


Pro 


Glu 


His 


Thr 


Pro 


5 










10 










15 










ggc gtg 


tgt 


aaa 


tac 


age 


ctt 


cca 


etc 


ctg 


cac 


aca 


cac 


cgt 


gtt 


ttt 


Gly Val 


Cys 


Lys 


Tyr 


Ser 


Leu 


Pro 


Leu 


Leu 


His 


Thr 


His 


Arg 


Val 


Phe 


20 








25 










30 










35 


aca tea 


gta 


aca 


ttc 


tgc 


gca 


tat 


tgt 


tea 


gca 


cca 


tgc 


ctt 


ttc 


cac 


Thr Ser 


Val 


Thr 


Phe 
40 


Cys 


Ala 


Tyr 


Cys 


Ser 
45 


Ala 


Pro 


Cys 


Leu 


Phe 
50 


His 



262 



tta aca gta ata att taaaggtcat tccatttcag tacataaaga gcgtcctcag 317 
Leu Thr Val He He 
55 

gccgggcgcg gtggctcgca cctgtaatcc tagcactttg ggaggctgag gcaggtggat 377 

cacctgaggt tggaagttga agaccagcct ggecaaaagg gcgaaaccct gtctctactg 437 

aaaatagaaa aattagctc 456 



<210> 662 
<211> 506 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 71. .379 



<400> 662 

acagctgccc gggactccag tgatcgccgc ggctcgctcg cgccccggaa actgcccctt 60 
ctcgggggtc atg atg ggc age aag atg gcg tct get agt agg gtc gtt 109 

Met Met Gly Ser Lys Met Ala Ser Ala Ser Arg Val Val 

15 10 

157 



205 



cag 


gta 


gtc 


aaa 


cca 


cac 


act 


cca 


tta 


ata 


agg 


ttt 


cct 


gac 


aga 


aga 


Gin 


Val 


Val 


Lys 


Pro 


His 


Thr 


Pro 


Leu 


He 


Arg 


Phe 


Pro 


Asp 


Arg 


Arg 




15 










20 










25 










gac 


aat 


cct 


aaa 


ccc 


aat 


gta 


tea 


gaa 


get 


ttg 


aga 


tea 


gca 


ggg 


eta 


Asp 


Asn 


Pro 


Lys 


Pro 


Asn 


Val 


Ser 


Glu 


Ala 


Leu 


Arg 


Ser 


Ala 


Gly 


Leu 


30 










35 










40 










45 


cca 


tct 


cac 


tct 


tct 


gta 


att 


tea 


caa 


cat 


tct 


aaa 


gga 


agt 


aaa 


tea 


Pro 


Ser 


His 


Ser 


Ser 


Val 


He 


Ser 


Gin 


His 


Ser 


Lys 


Gly 


Ser 


Lys 


Ser 










50 










55 










60 




cca 


gat 


ttg 


ctg 


atg 


tat 


cag 


ggt 


cca 


cca 


gac 


act 


gca 


gaa 


ata 


ata 


Pro 


Asp 


Leu 


Leu 


Met 


Tyr 


Gin 


Gly 


Pro 


Pro 


Asp 


Thr 


Ala 


Glu 


He 


He 








65 










70 










75 






aaa 


aca 


tta 


cct 


cag 


aaa 


tac 


aga 


agg 


aaa 


ctt 


gtg 


tct 


caa 


gaa 


gaa 


Lys 


Thr 


Leu 


Pro 


Gin 


Lys 


Tyr 


Arg 


Arg 


Lys 


Leu 


Val 


Ser 


Gin 


Glu 


Glu 






80 










85 










90 








atg 


gaa 


ttt 


ate 


caa 


cgt 


gga 


ggt 


cct 


gaa 


taaccatggt ggctgctgtt 


Met 


Glu 


Phe 


He 


Gin 


Arg 


Gly 


Gly 


Pro 


Glu 
















95 










100 





















253 



301 



349 



399 



tgtcatcaga caatagaatt gtctttacaa taaaggactt ccaaaatgac agatgagaaa 459 
ctgtatatta aacaccttta ataaatatta tgaaaaaaam haawaaa 506 



<210> 663 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 207 . .578 



<220> 

<221> misc_f eature 
<222> 19 

<223> n=a, g, c or t 



<400> 663 

atctcgcgtc cccaacggnc ccccgggtcg gtttccgegg tggecatgae tgcggccgtg 60 
ttcttegget gegcttcatt gccttcgggc ctgcgctcgc cctttatgtc ttcaccatcg 120 
ccaycgagcc gttgegtate atcttcctca tcgccggagc tttcttctgg ttggtgtctc 180 
tactgatttc gtcccttgtt tggttc atg gca aga gtc att att gac aac aaa 233 

Met Ala Arg Val He He Asp Asn Lys 

1 5 

gat gga cca aca cag aaa tat ctg ctg ate ttt gga gcg ttt gtc tct 281 
Asp Gly Pro Thr Gin Lys Tyr Leu Leu He Phe Gly Ala Phe Val Ser 
10 15 20 25 

329 



377 



gtc 


tat 


ate 


caa 


gaa 


atg 


ttc 


cga 


ttt 


gca 


tat 


tat 


aaa 


etc 


tta 


aaa 


Val 


Tyr 


He 


Gin 


Glu 


Met 


Phe 


Arg 


Phe 


Ala 


Tyr 


Tyr 


Lys 


Leu 


Leu 


Lys 










30 










35 










40 




aaa 


gee 


agt 


gaa 


ggt 


ttg 


aag 


agt 


ata 


aac 


cca 


ggt 


gag 


aca 


gca 


ccc 


Lys 


Ala 


Ser 


Glu 


Gly 


Leu 


Lys 


Ser 


He 


Asn 


Pro 


Gly 


Glu 


Thr 


Ala 


Pro 






45 










50 










55 






tct 


atg 


cga 


ctg 


ctg 


gee 


tat 


gtt 


tct 


ggc 


ttg 


ggc 


ttt 


gga 


ate 


atg 


Ser 


Met 


Arg 


Leu 


Leu 


Ala 


Tyr 


Val 


Ser 


Gly 


Leu 


Gly 


Phe 


Gly 


He 


Met 






60 










65 










70 









425 
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agt gga gta ttt tec ttt gtg aat acc eta tct gac tec ttg ggg cca 473 

Ser Gly Val Phe Ser Phe Val Asn Thr Leu Ser Asp Ser Leu Gly Pro 

75 80 85 

ggc aca gtg ggc att cat gga gat tct cct caa ttc ttc ctt tta ttc 521 

Gly Thr Val Gly lie His Gly Asp Ser Pro Gin Phe Phe Leu Leu Phe 

90 95 100 105 

age ttt cat gac get ggt cat tat ctt get gca tgt att ctg ggg cat 569 

Ser Phe His Asp Ala Gly His Tyr Leu Ala Ala Cys lie Leu Gly His 

110 115 120 

tgt att ttt tgatggctgt gagaaga 595 
Cys lie Phe 

<210> 664 
<211> 487 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 41. .226 



<400> 664 

gataeggtte ctccaccgag gcccatgcga agtttccact atg get tec age act 55 

Met Ala Ser Ser Thr 
1 5 



gtc 


ccg 


gtg 


age 


get 


get 


ggc 


teg 


get 


aat 


gaa 


act 


ccc 


gaa 


ata 


ccg 


103 


Val 


Pro 


Val 


Ser 


Ala 
10 


Ala 


Gly 


Ser 


Ala 


Asn 
15 


Glu 


Thr 


Pro 


Glu 


He 
20 


Pro 




gac 


aac 


gtg 


gga 


gat 


tgg 


ctt 


egg 


ggc 


gtc 


tac 


cct 


ttg 


cca 


ctg 


ata 


151 


Asp 


Asn 


Val 


Gly 
25 


Asp 


Trp 


Leu 


Arg 


Gly 
30 


Val 


Tyr 


Pro 


Leu 


Pro 

35 


Leu 


He 




gga 


atg 


act 


tec 


gga 


gaa 


ctt 


gat 


act 


aaa 


ttt 


ggg 


act 


ctt 


tgc 


tgc 


199 


Gly 


Met 


Thr 
40 


Ser 


Gly 


Glu 


Leu 


Asp 
45 


Thr 


Lys 


Phe 


Gly 


Thr 
50 


Leu 


Cys 


Cys 




ggg 


agt 


ttg 


get 


ggc 


cag 


gaa 


ctt 


gag 


tgacattgac ■ 


ctcatggcac 




246 


Gly 


Ser 
55 


Leu 


Ala 


Gly 


Gin 


Glu 
60 


Leu 


Glu 



















ctcagccagg ggtgtagcca agtagacaaa tggaatcctg tgctgaaccc gaatcttcca 306 

aaaaacagee tacaatctgt gaccaccaca agatgtgccc tgatggcagc tgaagtttga 366 

ttcagatggg cacttttctt ccccttccct gectagttte cttttgttcc ttgagtccac 426 

gcagaattcc attctctggt cagcagacag gcttaagcta aagtattgea tctattctgt 486 

a ~ 487 



<210> 665 
<211> 415 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 54 . .248 

<400> 665 

acttctgacc tgcggcccgt agtccgagcc ggggacggcg gcgtcggtgg gtc atg 56 

Met 
1 

etc egg cac tec ccc teg ctg tgg gag ctg gtg gag gag cac gtt ccg 104 
Leu Arg His Ser Pro Ser Leu Trp Glu Leu Val Glu Glu His Val Pro 

5 10 15 

etc egg gag cga cgc gaa gtg aag agg att ctg ggg gga ggc ggc ggt 152 
Leu Arg Glu Arg Arg Glu Val Lys Arg lie Leu Gly Gly Gly Gly Gly 

20 25 30 

gga cct gag cct gga get gcg ggc gga ggt ggc gat gtt acg ggc act 200 
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Gly Pro Glu Pro Gly Ala Ala Gly Gly Gly Gly Asp Val Thr Gly Thr 

35 40 45 

get cca aga ggc teg ate etc tea age ccc cag etc ccg ccc cat etc 248 
Ala Pro Arg Gly Ser lie Leu Ser Ser Pro Gin Leu Pro Pro His Leu 
50 " ^ 55 60 65 

tgacccctct tctcttctgg caccaccgcc tctcctaaag gacctcttgc gecaggaget 308 
ccggcagttg ctccagggtc tccgccacaa agecatctgt gagggcaggg accaggscca 368 
agcttgggtc cagtatagee cccarggtcc tgcactttgc cttggag 415 

<210> 666 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .432 

<400> 666 

agtgaggaaa tc atg ggt cag aat cga gag atg ctt ccg ttt tgg atg aac 51 

Met Gly Gin Asn Arg Glu Met Leu Pro Phe Trp Met Asn 

1 J 5 10 



age 


ace 


ggc 


agg 


egg 


gaa 


ggt 


tgg 


cag 


cgt 


gga 


tgg 


cac 


ggc 


tac 


gac 


99 


Ser 


Thr 


Gly 


Arg 


Arg 


Glu 


Gly 


Trp 


Gin 


Arg 


Gly 


Trp 


His 


Gly 


Tyr 


Asp 






15 










20 










25 












aac 


gag 


etc 


atg 


gac 


atg 


egg 


ggc 


ate 


ttc 


ctg 


gee 


ttc 


gga 


cct 


gat 


147 


Asn 


Glu 


Leu 


Met 


Asp 


Met 


Arg 


Gly 


He 


Phe 


Leu 


Ala 


Phe 


Gly 


Pro 


Asp 




30 










35 










40 










45 




ttc 


aaa 


tec 


aac 


ttc 


aga 


get 


get 


cct 


ate 


agg 


teg 


gtg 


gac 


gtc 


tac 


195 


Phe 


Lys 


Ser 


Asn 


Phe 


Arg 


Ala 


Ala 


Pro 


He 


Arg 


Ser 


Val 


Asp 


Val 


Tyr 












50 










55 










60 






aat 


gtc 


atg 


tgc 


aat 


gtg 


gtg 


ggc 


ate 


acc 


ccg 


ctg 


ccc 


aac 


aac 


gga 


243 


Asn 


Val 


Met 


Cys 


Asn 


Val 


Val 


Gly 


He 


Thr 


Pro 


Leu 


Pro 


Asn 


Asn 


Gly 










65 










70 










75 








tec 


tgg 


tec 


agg 


gtg 


atg 


tgc 


atg 


ctg 


aag 


ggc 


cgc 


gcg 


gca 


ctg 


ccc 


291 


Ser 


Trp 


Ser 


Arg 


Val 


Met 


Cys 


Met 


Leu 


Lys 


Gly 


Arg 


Ala 


Ala 


Leu 


Pro 








80 










85 










90 










cgc 


ctg 


tct 


ggc 


cca 


gee 


act 


gtg 


ccc 


tgg 


cat 


gat 


tct 


tct 


ctt 


cct 


339 


Arg 


Leu 


Ser 


Gly 


Pro 


Ala 


Thr 


Val 


Pro 


Trp 


His 


Asp 


Ser 


Ser 


Leu 


Pro 






95 










100 










105 












get 


tgc 


ata 


act 


gat 


cat 


att 


get 


tgt 


etc 


aga 


aaa 


aaa 


cac 


cat 


cag 


387 


Ala 


Cys 


He 


Thr 


Asp 


His 


He 


Ala 


Cys 


Leu 


Arg 


Lys 


Lys 


His 


His 


Gin 




110 










115 










120 










125 




caa 


agt 


ggg 


cct 


cca 


aag 


cca 


gat 


gat 


ttt 


cat 


ttt 


atg 


tgt 


gaa 




432 


Gin 


Ser 


Gly 


Pro 


Pro 


Lys 


Pro 


Asp 


Asp 


Phe 


His 


Phe 


Met 


Cys 


Glu 







130 135 140 



taatagcttc attaacacaa tcaagaccat gcacattgta aatacattat tcttg 487 

<210> 667 
<211> 476 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .218 

<400> 667 

gaacaagcgt ccctgatcca gaaggtgttc ag atg gag atg gcg agt tct get 53 

Met Glu Met Ala Ser Ser Ala 
1 5 

ggc tec tgg etc tct ggc tgc etc ate cct etc gtc ttc etc egg ctg 101 
Gly Ser Trp Leu Ser Gly Cys Leu He Pro Leu Val Phe Leu Arg Leu 
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366 

10 15 20 

tct gtg cat gtg tea ggc cac gca ggg gat gec ggc aag ttc cac gtg 14 9 

Ser Val His Val Ser Gly His Ala Gly Asp Ala Gly Lys Phe His Val 

25 30 35 

gec eta eta ggg ggc aca gee gag ctg etc tgc cct etc tec etc tgg 197 
Ala Leu Leu Gly Gly Thr Ala Glu Leu Leu Cys Pro Leu Ser Leu Trp 
40 45 50 55 

ccc ggg acg tac cca agg agg tgaggtggct gcggtcccca ttcccgcagc 248 
Pro Gly Thr Tyr Pro Arg Arg 
60 

gctcccaggc tgttcacata ttccgggatg ggaaggacca ggatgaagat ctgatgeegg 308 
aatataaggg gaggaeggtg ctagtgagag atgeccaaga gggaagtgtc actctgeaga 368 
tecttgaegt gegecttgag gaccaagggt cttaccgatg tctgatccaa gttggaaatc 428 
tgagtaaaga ggacaccgtg atectgeagg ttgcagcccc atctgtgg 476 



<210> 668 
<211> 475 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 90. .431 



<400> 668 

gtattatgat gtcgtataat gtaacattct gaaaatacaa catgtccacc taaatttcaa 60 



agggctgaaa aaaatttctt gaaagatga 


atg 


gac 


att 


gtg 


gee 


cat 


tgt 


gag 


113 


















Met 
1 


Asp 


He 


Val 


Ala 
5 


His 


Cys 


Glu 




aaa 


gec 


atg 


gca 


gag 


tea 


gtt 


aag 


aaa 


cat 


cag 


ttt 


age 


act 


tct 


gaa 


161 


Lys 


Ala 


Met 


Ala 


Glu 


Ser 


Val 


Lys 


Lys 


His 


Gin 


Phe 


Ser 


Thr 


Ser 


Glu 






10 










15 










20 












tct 


tat 


ctt 


cca 


gca 


act 


tta 


ata 


ctg 


aac 


age 


ata 


tec 


ttt 


aat 


act 


209 


Ser 


Tyr 


Leu 


Pro 


Ala 


Thr 


Leu 


He 


Leu 


Asn 


Ser 


He 


Ser 


Phe 


Asn 


Thr 




25 










30 










35 










40 




gat 


ggc 


tec 


ata 


cag 


tat 


aaa 


gaa 


aag 


ctg 


tat 


ttc 


tct 


gee 


tec 


gaa 


257 


Asp 


Gly 


Ser 


He 


Gin 


Tyr 


Lys 


Glu 


Lys 


Leu 


Tyr 


Phe 


Ser 


Ala 


Ser 


Glu 












45 










50 










55 






gee 


eta 


gac 


get 


tac 


att 


gat 


gat 


ttt 


cac 


tta 


aac 


tat 


gag 


cct 


cct 


305 


Ala 


Leu 


Asp 


Ala 


Tyr 


He 


Asp 


Asp 


Phe 


His 


Leu 


Asn 


Tyr 


Glu 


Pro 


Pro 










60 










65 










70 








gat 


att 


gat 


aca 


aag 


gtt 


aac 


ctg 


gat 


cag 


agt 


cct 


ctt 


gaa 


ttt 


ctt 


353 


Asp 


He 


Asp 


Thr 


Lys 


Val 


Asn 


Leu 


Asp 


Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


Leu 








75 










80 










85 










get 


aag 


tea 


aat 


agt 


ggg 


gtc 


cat 


gga 


ate 


gca 


cca 


ggc 


caa 


agt 


ggc 


401 


Ala 


Lys 


Ser 


Asn 


Ser 


Gly 


Val 


His 


Gly 


He 


Ala 


Pro 


Gly 


Gin 


Ser 


Gly 






90 










95 










100 












atg 


gag 


aag 


ate 


ata 


cag 


tgg 


tat 


get 


gca 


taacaaggtt tctgtcagca 


451 


Met 


Glu 


Lys 


He 


He 


Gin 


Trp 


Tyr 


Ala 


Ala 

















105 110 

acaaattget tatatgatgg tggc 475 



<210> 669 
<211> 578 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 411. . 575 



<400> 669 

gagcaagccc tggtggcagc gcagggtcca gtgcagcccc tccccacagc atgctggggg 



60 
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ctaattctga tgtcatcttt ctgcagaaaa ccattagacc atccctccag actgccaccc 120 

tcaaagccgt ctgcccaggc cccatctgac actcttgaca tctgcaggtc ccagacccta 180 

tgatgtgtcc actctggagg ctcctcatct tcctcgggtt gctggccttg cccttggcac 240 

cacacaagca gccttggcct ggcctggccc aagcccacag agacaacaaa tccaccctgg 300 

caagaagtcc ttcgataaca acatcgtaaa gatgtgtgca catatgagca tcgttgtgga 360 

gttctggctg gagaaagacg agtttggccg gaggatctgg tgataggcaa atg cga 416 

Met Arg 
1 



tgc 


aga 


ccc 


age 


agt 


gtc 


cat 


gtg 


gee 


ate 


etc 


act 


gar 


get 


ate 


cca 


464 


Cys 


Arg 


Pro 
5 


Ser 


Ser 


Val 


His 


Val 
10 


Ala 


He 


Leu 


Thr 


Glu 
15 


Ala 


He 


Pro 




cca 


aag 


atg 


ate 


agt 


ttc 


tct 


aca 


amc 


tea 


aag 


aga 


ate 


ttc 


aaa 


aak 


512 


Pro 


Lys 
20 


Met 


He 


Ser 


Phe 


Ser 
25 


Thr 


Xaa 


Ser 


Lys 


Arg 
30 


He 


Phe 


Lys 


Xaa 




tct 


cca 


a am 


atg 


kta 


aaa 


agt 


cag 


gta 


tgt 


cct 


ctg 


ate 


ggt 


gaa 


ate 


560 


Ser 


Pro 


Xaa 


Met 


Xaa 


Lys 


Ser 


Gin 


Val 


Cys 


Pro 


Leu 


He 


Gly 


Glu 


lie 




35 










40 










45 










50 




teg 


ggc 


age 


tgg 


atg 


tga 






















578 


Ser 


Gly 


Ser 


Trp 


Met 

55 



























<210> 670 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 62. .430 



<400> 670 

atttceggtt ccggtgtcag ttcgaggege cgccgccgcc gccgcagccg ccggagccgc 60 
a atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg cag 109 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
1 5 10 15 



tat 


aca 


age 


cct 


gag 


gag 


ate 


gac 


gcg 


cas 


tgc 


agg 


ctg 


aga 


age 


aga 


157 


Tyr 


Thr 


Ser 


Pro 


Glu 


Glu 


He 


Asp 


Ala 


Xaa 


Cys 


Arg 


Leu 


Arg 


Ser 


Arg 










20 










25 










30 








agg 


cca 


ggg 


aag 


aag 


agg 


age 


aaa 


aaa 


ggt 


gga 


gat 


ggg 


get 


gca 


ggt 


205 


Arg 


Pro 


Gly 


Lys 


Lys 


Arg 


Ser 


Lys 


Lys 


Gly 


Gly 


Asp 


Gly 


Ala 


Ala 


Gly 








35 










40 










45 










gac 


ccc 


aaa 


aag 


gag 


aag 


aaa 


tct 


eta 


gac 


tea 


gat 


gag 


agt 


gag 


gat 


253 


Asp 


Pro 


Lys 


Lys 


Glu 


Lys 


Lys 


Ser 


Leu 


Asp 


Ser 


Asp 


Glu 


Ser 


Glu 


Asp 






50 










55 










60 












gaa 


gaa 


gat 


gac 


tac 


cag 


caa 


aag 


cgc 


aaa 


ggc 


gtt 


gaa 


ggg 


etc 


ate 


301 


Glu 


Glu 


Asp 


Asp 


Tyr 


Gin 


Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly 


Leu 


He 




65 










70 










75 










80 




gac 


ate 


gag 


aac 


ccc 


aac 


egg 


gtg 


gca 


cag 


aca 


ace 


aaa 


aag 


gtc 


aca 


349 


Asp 


He 


Glu 


Asn 


Pro 


Asn 


Arg 


Val 


Ala 


Gin 


Thr 


Thr 


Lys 


Lys 


Val 


Thr 










85 










90 










95 






caa 


ctg 


gat 


ctg 


gac 


ggg 


cca 


agg 


age 


ttt 


cga 


gga 


gag 


aac 


gag 


aag 


397 


Gin 


Leu 


Asp 


Leu 


Asp 


Gly 


Pro 


Arg 


Ser 


Phe 


Arg 


Gly 


Glu 


Asn 


Glu 


Lys 










100 










105 










110 








aga 


ttg 


aga 


age 


aga 


agg 


caa 


aag 


age 


gtt 


aca 


tgaaaatgea cttggccggg 


450 


Arg 


Leu 


Arg 


Ser 


Arg 


Arg 


Gin 


Lys 


Ser 


Val 


Thr 















115 120 



aagacagagc aagccaa 467 



<210> 671 
<211> 468 
<212> DNA 
<213> Homo sapiens 



WO 02/083898 
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368 

<220> 
<221> CDS 
<222> 63. .431 



<400> 671 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 
ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 
15 10 15 

cag tat aca age cct gag gag ate gac gcg cas tgc agg ctg aga age 155 
Gin Tyr Thr Ser Pro Glu Glu lie Asp Ala Xaa Cys Arg Leu Arg Ser 

20 25 30 

aga agg cca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca 203 
Arg Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala 

35 4 0 4 5 

ggt gac ccc aaa aag gag aag aaa tct eta gac tea gat gag agt gag 251 
Gly Asp Pro Lys Lys Glu Lys Lys Ser Leu Asp Ser Asp Glu Ser Glu 

50 ~ 55 60 

gat gaa gaa gat gac tac cag caa aag cgc aaa ggc gtt gaa ggg etc 299 
Asp Glu Glu Asp Asp Tyr Gin Gin Lys Arg Lys Gly Val Glu Gly Leu 

65 70 75 

ate gac ate gag aac ccc aac egg gtg gca cag aca ace aaa aag gtc 347 
lie Asp lie Glu Asn Pro Asn Arg Val Ala Gin Thr Thr Lys Lys Val 
80 85 90 95 

aca caa ctg gat ctg gac ggg cca agg age ttt cga gga gag aac gag 395 
Thr Gin Leu Asp Leu Asp Gly Pro Arg Ser Phe Arg Gly Glu Asn Glu 

100 "* 105 110 

aag aga ttg aga age aga agg caa aag age gtt aca tgaaaatgea 441 
Lys Arg Leu Arg Ser Arg Arg Gin Lys Ser Val Thr 

115 120 
cttggccggg aagacagagc aagccaa 4 68 



<210> 672 
<211> 527 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. . 327 



<400> 672 

cttttctatg tcaattttct ggcaacaaag tctgtggcca agaaagaagc taccgactcc 60 
ttcgaccaca aaaagttctt ccaa atg gtc ggc ctg aag aaa aag agt gcg 111 

Met Val Gly Leu Lys Lys Lys Ser Ala 

1 5 

159 



207 



gat 


gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac 


aag 


gac 


aaa 


agt 


ggc 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


10 










15 










20 










25 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 


aaa 


ggc 


ttc 


tec 


cca 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro 










30 










35 










40 




gat 


gee 


aga 


gac 


ctg 


tct 


get 


aaa 


gaa 


acc 


aag 


atg 


ctg 


atg 


get 


get 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 








45 










50 










55 






gga 


gac 


aaa 


gat 


ggg 


gac 


ggc 


aaa 


att 


ggg 


gtt 


gac 


gaa 


ttc 


tec 


act 


Gly Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr 






60 










65 










70 








ctg 


gtg 


get 


gaa 


ast 


aag 


aag 


cac 


tgactgcccc • 


tggtcttcca cctctctgcc 


Leu 


Val 


Ala 


Glu 


Xaa 


Lys 


Lys 


His 




















75 










80 





















255 



303 



357 



ctgaacaccc aatctcggcc cctcttgcca ccctcctgca tttctgttca gttcgtttat 417 
gttatttttt actcccccat cccctgtggc cctctaatga caccattctt ctggaaaatg 477 



WO 02/083898 

369 

ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa 



aaaatgcgaa 



PCT/IB01/00914 

527 



<210> 673 
<211> 594 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .589 



<400> 673 

aaaaggatat acaaagaatg attttgccat tcatacacta aaaaataact ttgaagcaga 60 
tgcttatttt gttaaagtta ttgttcttag tggtctgtgt tctaatgact gtcctcgtaa 120 

gataacaccc tttggagtga atcaacctgg accttatatc atg tat aca act gta 175 

Met Tyr Thr Thr Val 

1 5 

gat gca aat ggg tat ctg aaa aat gga tea get ggc caa etc age caa 223 

Asp Ala Asn Gly Tyr Leu Lys Asn Gly Ser Ala Gly Gin Leu Ser Gin 

10 15 20 

tec tea cat tta get etc caa eta cca tac aac gtg ctt ggt tta ggt 271 

Ser Ser His Leu Ala Leu Gin Leu Pro Tyr Asn Val Leu Gly Leu Gly 

25 30 35 

egg age gca aat ttt ctt gac cat etc tac gtt ggt att ccc cgt cca 319 

Arg Ser Ala Asn Phe Leu Asp His Leu Tyr Val Gly lie Pro Arg Pro 

40 45 50 

tct gga gaa aaa tct ata cga aaa caa gag tgg act gca ate att cca 367 

Ser Gly Glu Lys Ser lie Arg Lys Gin Glu Trp Thr Ala lie lie Pro 

55 ~ 60 65 

aat tec cag eta att gtc att cca tac cct cac aat gtc cct cga agt 415 

Asn Ser Gin Leu lie Val lie Pro Tyr Pro His Asn Val Pro Arg Ser 

70 75 80 85 

tgg agt gee aaa ctg tat ctt aca cca agt aat att gtt ctg ctt act 4 63 

Trp Ser Ala Lys Leu Tyr Leu Thr Pro Ser Asn lie Val Leu Leu Thr 

90 95 100 

get ata get etc ate ggt gtc tgt gtt ttc ate ttg gca ata att ggc 511 

Ala lie Ala Leu lie Gly Val Cys Val Phe lie Leu Ala lie lie Gly 

105 110 115 

att tta cat tgg cag gaa aag aaa gca gat gat aga gaa aaa cga caa 559 

lie Leu His Trp Gin Glu Lys Lys Ala Asp Asp Arg Glu Lys Arg Gin 

120 125 130 

gaa gee cac egg ttt cat ttt gat get atg tgact 594 
Glu Ala His Arg Phe His Phe Asp Ala Met 
135 140 



<210> 674 
<211> 545 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 297 . . 500 



<400> 674 

atgaggttcc 

tccccttcca 

cataatgeat 

aaaaaggcaa 

ggtaagtctg 



atgcggccaa 
aatactttaa 
gggtccagtc 
gagtaatatt 
tgtgtgccag 



caagagaggg 
gttcccagtt 
ctgttctttg 
caaaatactg 
cctgcagcag 



agatagaaag 
taacctatgt 
ggttgcacaa 
aagagctatg 
gtggtctcag 



atg gac acc atg gat ccc ttc tgg ttt aat gca 



gaaagggece cagatgtggc 
gatcctttca atatttccac 
aataattagc acaagcatac 
gtacactaaa gtgccctggg 
ccctggaagg aacagc atg 

Met 
1 

ttc aaa agg act aat 



60 
120 
180 
240 
299 



347 
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Met Asp Thr Met Asp Pro Phe Trp Phe Asn Ala Phe Lys Arg Thr Asn 

5 10 15 

acc ata ctg cac cat ttg aga atg tec aag cac aca gat gca gca gaa 395 

Thr lie Leu His His Leu Arg Met Ser Lys His Thr Asp Ala Ala Glu 

20 25 30 

gag gtg eta ttg gaa aaa aaa ggt tgc acg gga gtc ata aca eta aac 443 

Glu Val Leu Leu Glu Lys Lys Gly Cys Thr Gly Val lie Thr Leu Asn 

35 4 0 J 4 5 

aga cca aag ttc etc aat gca ctg act ctt aat atg att egg cag att 491 

Arg Pro Lys Phe Leu Asn Ala Leu Thr Leu Asn Met lie Arg Gin lie 
50 ~ 55 60 65 

tat cac age taaagaagtt ggaacaagat cctgaaactt tcctgatcat tataa 545 
Tyr His Ser 

<210> 675 
<211> 534 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .234 



<400> 675 
tttttttcca 



get aat gaa 
Ala Asn Glu 
15 

ggc gtc tac 
Gly Val Tyr 
30 

ata eta aat 
lie Leu Asn 

ttg agt gac 
Leu Ser Asp 

gacaaatgga 
accacaagat 
ttccctgcct 
agacaggctt 
ettctgeagg 



ct atg get 
Met Ala 
1 

act ccc ga 
Thr Pro Gl 

cgc ttt gc 
Arg Phe Al 
35 

ttg gga ct 
Leu Gly Le 
50 

att gac ct 
lie Asp Le 
65 

atcctgtgct 
gtgecctgat 
agtttccttt 
aagctaaagt 
gggtgatttt 



tec age act gtc ccg gtg age get get ggc teg 
Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser 
5 10 
ata ccg gac aac gtg gga gat tgg ctt egg 
u lie Pro Asp Asn Val Gly Asp Trp Leu Arg 

20 25 
c act gat agg aat gac ttc egg agg aac ttg 
a Thr Asp Arg Asn Asp Phe Arg Arg Asn Leu 

40 45 
c ttt get gcg gga gtt tgg ctg gee agg aac 
u Phe Ala Ala Gly Val Trp Leu Ala Arg Asn 
55 60 
atg gca cct cag cca ggg gtg tagecaagta 
u Met Ala Pro Gin Pro Gly Val 
70 

gaacccgaat cttccaaaaa acagcctaca atctgtgacc 
ggcagctgaa gtttgattca gatgggcact tttcttcccc 
tgttccttga gtccacgcag aattccattc tctggtcagc 
attgectcta ttctgtaaag ttctgtacat agttcccaag 
tgctcttgtc ctgagaaata acagtgctgt 



51 



99 



147 



195 



244 



304 
364 
424 
484 
534 



<210> 676 
<211> 509 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 125, 



.367 



<220> 

<221> misc__f eature 

<222> 91, 461 

<223> n=a, g, c or t 



<400> 676 

agccgagcct gcggaaggcg geggeggegg cacctgcgat cageggctgg ggcaggttat 60 

ggtagtgcgg actgcggtgt gagcagagcg nccacggggc ccgccatgcg ccggcggccc 120 

tgac atg ggc gee acg ggt cca aag etc ggg gee tgt ggc ctt cgc etc 169 
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Met Gly Ala Thr Gly Pro Lys Leu Gly Ala Cys Gly Leu Arg Leu 
1 5 10 15 

ggc ggc egg ags ggc ggc tea gag gca gca gga get gag caa get ttg 
Gly Gly Arg Xaa Gly Gly Ser Glu Ala Ala Gly Ala Glu Gin Ala Leu 

20 25 30 

gtr egg cct egg ggc cga get gtr ccc ccc ttc gta ttc acg cgc cgc 
Val Arg Pro Arg Gly Arg Ala Val Pro Pro Phe Val Phe Thr Arg Arg 

35 4 0 4 5 

ggc tct atg ttc tat gat gag gat ggg gat ctr get cac gag ttc tat 
Gly Ser Met Phe Tyr Asp Glu Asp Gly Asp Leu Ala His Glu Phe Tyr 

50 55 60 

gag gag aca ate gtc acc aag aac ggg cag aak ckg gec aag ctg agg 
Glu Glu Thr lie Val Thr Lys Asn Gly Gin Xaa Xaa Ala Lys Leu Arg 

65 70 75 

gca teg tgaagctgga tcasccccgc atccacgtgg atttccctgt gatcctctat 
Ala Ser 
80 

gaggtgtgac ccctgggagg tggcagacag aagcaccccc tgcnccggca agaaactccc 
aggctcaatc aaggtgtggc ttccatttga gg 



217 



265 



313 



361 



417 



477 
509 



<210> 677 
<211> 591 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 101. 



.304 



<400> 677 

gcggaagtgc tgcttactgg gtcctgaatc ggctgcgtgc 
ctgaagtcac gggaacaacg ctgaggggca gecagagatt 



tgg gaa gaa 
Trp Glu Glu 

gec aaa tgg 
Ala Lys Trp 

att ttg gag 
lie Leu Glu 
40 

cct gtc tat 
Pro Val Tyr 
55 

taaaagaagg 
taacaattct 
acaattacag 
tgtacacaca 
tgtacaaact 



cca etc tac gtt ate acc ttc tac 
Pro Leu Tyr Val lie Thr Phe Tyr 



10 



15 



aca gaa age ccg aga caa cat gaa 
Thr Glu Ser Pro Arg Gin His Glu 
25 30 
aaa ttg act tac cat acc act cat 
Lys Leu Thr Tyr His Thr Thr His 
45 

gtc caa tea gca gaa tgt etc gga 
Val Gin Ser Ala Glu Cys Leu Gly 
60 

agactgaaat aatagcatct ttgatgaaaa 
ggggatcttg gtgattacag agttcttaat 
getgectata ggtcctatag tgctcacaca 
ettgeagtat attcatctct ttgtcttatt 
aatgacaaat acttgacagt cacacagcag 



tgttctaggc 
atg tgg get 
Met Trp Ala 
1 

ata gta gca 
He Val Ala 

gtt gtt eta 
Val Val Leu 
35 

caa ccc atg 
Gin Pro Met 
50 

cca cct aaa 
Pro Pro Lys 
65 

ctatctggaa 
ccctctgtcc 
cctccagccc 
tgagagtagg 
tgataca 



atctttcacc 
ttt cca 
Phe Pro 
5 

gtt gaa 
Val Glu 
20 

caa gtt 
Gin Val 

caa aag 
Gin Lys 



aag 

Lys 

gacaagttgt 
ataggtgatg 
ttccccatgg 
gctgggtgtg 



60 
115 



163 



211 



259 



304 



364 
424 
484 
544 
591 



<210> 678 
<211> 348 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 11. . 



331 



<400> 678 

aaaaaacaac atg cca cag tta aaa ctt act tea gtt tgg aca act act 



49 



WO 02/083898 PCT/IB01/00914 

372 

Met Pro Gin Leu Lys Leu Thr Ser Val Trp Thr Thr Thr 
15 10 
cac age tac tac aca grg acc cga acg agk tea cct ggd tak ama cct 97 
His Ser Tyr Tyr Thr Xaa Thr Arg Thr Xaa Ser Pro Gly Xaa Xaa Pro 

15 ~ 20 25 

gga cca cca cca atg gat atw caa atg gca aac aat ttt act ccg ccc 145 
Gly Pro Pro Pro Met Asp lie Gin Met Ala Asn Asn Phe Thr Pro Pro 
30 35 40 45 

tct gca act cct cag gga aat gac tgt gac etc tat gca cat cac age 193 
Ser Ala Thr Pro Gin Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser 

50 55 60 

acg gee agt ata gta atg cct ctg cat tac age etc gkc ttc ate att 241 
Thr Ala Ser lie Val Met Pro Leu His Tyr Ser Leu Xaa Phe lie lie 

65 70 75 

ggg etc gtg gga aac tta eta gee ttg gtc gtc att gtt caa aac agg 289 
Gly Leu Val Gly Asn Leu Leu Ala Leu Val Val lie Val Gin Asn Arg 

80 85 90 

aaa aaa ate ate tea cct tgr gee cam ccc cam ccc att gkc 331 
Lys Lys lie lie Ser Pro Xaa Ala Xaa Pro Xaa Pro lie Xaa 

95 100 105 

taatctgtag aagctaa 348 



<210> 679 
<211> 503 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 194 . . 409 



<400> 679 

catctcccac caggccccac ctgcaacact ggggatcaaa tttcaacacg agatttggaa 
ggggcaaata tccaaactgt atcagcactc tttgaaagat tacagagtaa atatctttct 
tegtcagaaa tggaatgatc cccgcctcgc gtacagtgaa tatcctgacg actctttaga 
cctcgacccc tec atg ttg gac tec att tgg aaa cct gat ttg ttc ttt 
Met Leu Asp Ser lie Trp Lys Pro Asp Leu Phe Phe 
1 5 10 

gee aat gaa aag ggt gee aac ttt cat gaa gtc act aca gac aac aaa 
Ala Asn Glu Lys Gly Ala Asn Phe His Glu Val Thr Thr Asp Asn Lys 

15 20 25 

ttg eta aga att ttc aaa aat gga aat gtt ctt tat tea ata aga tta 
Leu Leu Arg lie Phe Lys Asn Gly Asn Val Leu Tyr Ser lie Arg Leu 

30 35 4 0 

aca tta aca ctt tec tgt cca atg gat etc aag aat ttt ccc atg gat 
Thr Leu Thr Leu Ser Cys Pro Met Asp Leu Lys Asn Phe Pro Met Asp 
45 50 55 60 

gta caa aca tgt ata atg caa ctg gaa age ttg gtt taaaatgaca 
Val Gin Thr Cys lie Met Gin Leu Glu Ser Leu Val 

65 70 
agataaaaga ttgtaaatcg gtggaaccac tggaaagtag gcaaaaagag tttaacctac 
tggaaagcaa gagctgtcag aatc 



60 
120 
180 
229 



277 



325 



373 



419 



479 
503 



<210> 680 
<211> 561 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .426 



<400> 680 
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373 

ttgaagattt ttttggtata ttctagatgt taatctcatc ctgtcttctg tttatcaact 
ttgtcaatgg taactttgag aaatttttag ttttgatata gccccagtcc attaattttt 
gccttaggtt gcactttaag agtataagat atttttcctt accccaagat tacaaagata 
ttctgtattc tgtcagcttt ccagttttac ctttcacatt gagggcttta ttctatttga 
agttcatctt catat atg gga aaa caa agg aat etc aca tta ttt ttt tec 
Met Gly Lys Gin Arg Asn Leu Thr Leu Phe Phe Ser 



1 5 
ttt cca ttc tgt gag aca gtt ttc eta aat 
Phe Pro Phe Cys Glu Thr Val Phe Leu Asn 

15 20 
ttc ttt cct tea tat agt ttt gtg gag 
Phe Phe Pro Ser Tyr Ser Phe Val Glu 

30 35 
ttc cca cag aca cat att agt tta ctt 
Phe Pro Gin Thr His lie Ser Leu Leu 
45 50 



cca 
Pro 

ttt 
Phe 



cat 
His 

ctt 
Leu 

caa 
Gin 
55 



eta 
Leu 

tta 
Leu 
40 
ctt 



10 

tta aac agt gta 

Leu Asn Ser Val 
25 

tea tac etc aga 

Ser Tyr Leu Arg 

tat taagagttat 



Leu Tyr 

ctttgggaga ctgaggtggg 



taaggctgag cgtggtggct caatgectgt aatcccagca 
aggattgttt gagtccagga gttcaagacc agectgggea acatggagag accctgtctc 
tacaa 



60 
120 
180 
240 
291 



339 



387 



436 



496 
556 
561 



<210> 681 
<211> 520 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .326 



<400> 681 

ctttggcttc cccaccggca atg gag egg ttg acg ttg cct etc ggc ggc gcg 53 

Met Glu Arg Leu Thr Leu Pro Leu Gly Gly Ala 
15 10 



gcg 


gcg 


gtg 


gac 


gag 


tac 


ctg 


gag 


tac 


egg 


aga 


att 


gtt 


ggt 


gag 


gat 


101 


Ala 


Ala 


Val 


Asp 


Glu 


Tyr 


Leu 


Glu 


Tyr 


Arg 


Arg 


He 


Val 


Gly 


Glu 


Asp 










15 










20 










25 








gat 


gga 


ggg 


aaa 


ctt 


ttt 


act 


cct 


gaa 


gaa 


tat 


gaa 


gaa 


tac 


aaa 


aga 


149 


Asp 


Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Pro 


Glu 


Glu 


Tyr 


Glu 


Glu 


Tyr 


Lys 


Arg 








30 










35 










40 










aaa 


gtt 


tta 


cct 


ctg 


cgc 


tta 


caa 


aac 


aga 


tta 


ttt 


gtg 


age 


tgg 


egg 


197 


Lys 


Val 


Leu 


Pro 


Leu 


Arg 


Leu 


Gin 


Asn 


Arg 


Leu 


Phe 


Val 


Ser 


Trp 


Arg 






45 










50 










55 












tea 


cca 


aca 


ggg 


atg 


gat 


tgt 


aaa 


ctt 


gtg 


ggc 


cca 


gag 


aca 


ctg 


tgt 


245 


Ser 


Pro 


Thr 


Gly 


Met 


Asp 


Cys 


Lys 


Leu 


Val 


Gly 


Pro 


Glu 


Thr 


Leu 


Cys 




60 










65 










70 










75 




ttt 


tgt 


aca 


cat 


agg 


tat 


aaa 


caa 


cat 


aaa 


act 


gac 


ttg 


gaa 


gcg 


att 


293 


Phe 


Cys 


Thr 


His 


Arg 


Tyr 


Lys 


Gin 


His 


Lys 


Thr 


Asp 


Leu 


Glu 


Ala 


He 












80 










85 










90 






cct 


cag 


cag 


tgc 


ccc 


att 


gat 


ccc 


cct 


gee 


aag 


tgactggctg ecagtgeagg 


346 


Pro 


Gin 


Gin 


Cys 


Pro 


He 


Asp 


Pro 


Pro 


Ala 


Lys 















95 100 



gcttaccttt atgtcccctt gaatggtagc cagcccattc getgeaggtg caaacacttt 406 
gctgatcagc acagtgctgc gcctggcttt acatgeaata catgttccaa gtgttcagga 4 66 
ttccatagct gcttcacttg tgcttgtggt cagcctgcat attggycatg acac 520 



<210> 682 
<211> 513 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 85 . . 396 
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<400> 682 

actttctcgg ggtgtctgcg taactgccca gacttgcctt ggtttggtca gatgacctcc 
tctgggactg gctagccagc gttc atg tct aca tta ttg ate teg tct tec 

Met Ser Thr Leu Leu lie Ser Ser Ser 
1 5 
gat gee tta ttg gag gaa ctg gaa cgc tec ace ctt cag gac agt gat 
Asp Ala Leu Leu Glu Glu Leu Glu Arg Ser Thr Leu Gin Asp Ser Asp 
10 15 " 20 25 

gaa tat tec aac cca get cct ctt ccc ctg gat cag cat tec aga aag 
Glu Tyr Ser Asn Pro Ala Pro Leu Pro Leu Asp Gin His Ser Arg Lys 

30 35 40 

gag act aac ctt gat gag act teg gag ate ctt tct att cag gat aac 
Glu Thr Asn Leu Asp Glu Thr Ser Glu lie Leu Ser lie Gin Asp Asn 

45 50 55 

aca agt ccc ttg ccg gcg cas teg tgt ata eta cca ata tec agg age 
Thr Ser Pro Leu Pro Ala Xaa Ser Cys lie Leu Pro lie Ser Arg Ser 

60 65 70 

tea atg tct aca gtg aag ccc aag age caa agg aat cac cac cgc ctt 
Ser Met Ser Thr Val Lys Pro Lys Ser Gin Arg Asn His His Arg Leu 

75 80 85 

eta aaa cgt cag cag ctg etc agt tgg atg age tea ttg etc ace 
Leu Lys Arg Gin Gin Leu Leu Ser Trp Met Ser Ser Leu Leu Thr 
90 95 100 

tgactgagat geaggecaag gttgcagtga gagcagatgc tggcaagaag cacttaccag 
acaagcagga tcacaaggcc tccctggact caatgettgg gggtctcgag caggaat 



60 
ill 



159 



207 



255 



303 



351 



396 



456 
513 



<210> 683 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 317. .505 



<400> 683 

aagggaatct 

ggeggaggag 

aggeggggea 

cccaattctt 

cagtggcggc 

cgagtccccg 



atg 
Met 

age 
Ser 

cag 

Gin 

45 

cag 

Gin 



ctg 
Leu 

etc 

Leu 

30 

age 

Ser 



agg 
Arg 
15 
ttt 
Phe 

aaa 
Lys 



teg etc 
Ser Leu 



ccgggcgggg tagtgeagge geegggttte ccgcggtccg agctggcgcg 
aategctett aaagggccag cgcacacgcg ttcttttgtt ccggggccgc 
ggcccgactt tcgccgtctt cttgtctact ctccagaacg gecatgattt 
cattctgtcc tccaaggggg accgctcatc tacaaagact tecgegggga 
cgggatgtgg ccgagctctt etaceggaag ctgaegggae tgecaggaga 
gttgtc atg gac tat ggc tat gta cag ace aca tec acg gag 
Met Asp Tyr Gly Tyr Val Gin Thr Thr Ser Thr Glu 
15 10 
aat ttc ate cag acg gaa get gtg gtc age aag ccc ttc 
Asn Phe lie Gin Thr Glu Ala Val Val Ser Lys Pro Phe 

20 25 
gac etc age age gtt ggc ttg ktt ggg get gag aca caa 
Asp Leu Ser Ser Val Gly Leu Xaa Gly Ala Glu Thr Gin 

35 4 0 

gtg gee ccc cag cag tgc age cag yeg sec cgt cct gkc 
Val Ala Pro Gin Gin Cys Ser Gin Xaa Xaa Arg Pro Xaa 

50 55 60 

tgaccagagc caaaag 



60 
120 
180 
240 
300 
352 



400 



448 



496 



521 



<210> 684 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 2. . 196 



<400> 684 
a atg etc a 

Met Leu S 

1 

ctg etc ggc 
Leu Leu Gly 

gcg cac tea 
Ala His Ser 
35 

etc gtc ggc 
Leu Val Gly 
50 

gag tgatcat 

Glu 

65 

ctcatcgcca 
tctggtacgt 
eggaatgact 
actatgtctg 



gc gcg ctg ccc ggc tgg gga ccc gcg cac ctg cag cgc ccg 
er Ala Leu Pro Gly Trp Gly Pro Ala His Leu Gin Arg Pro 
5 "* ^ 10 15 

cct gca tec tgc ctg ggc ate ctg cgc ccg gee atg acg 
Pro Ala Ser Cys Leu Gly lie Leu Arg Pro Ala Met Thr 
20 25 30 

ttc gee etc ccg gtc ate ate ttc acc acg ttc tgg ggc 
Phe Ala Leu Pro Val lie lie Phe Thr Thr Phe Trp Gly 

40 45 
ate gee ggg ccc tgg ttc gtg ccg aag gac cca acc gcg 
lie Ala Gly Pro Trp Phe Val Pro Lys Asp Pro Thr Ala 
55 60 
cac catgctggtc gccaccgccg tctgctgtta cctcttctgg 



tcctggcgca stgaaccccc tgttcgggcc ccagctgaag aatgagacca 
gcgcttcctg tgggagtgac ccgccgcccc cgacccaggt gcccagctct 
gtggctccac tgtccctgac aaccccttcg tccggaccct cccccacaca 
gtcaccagct ccctcctgct ggcacc 



49 



97 



145 



193 



246 



306 
366 
426 
462 



<210> 685 
<211> 513 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 128. .376 



<400> 685 

atctggatat ttctcacatt tgccttggtt tgatgetect ggagttccca aaagaataga 60 
gatcacttta teaagcttge agaacaggtt gcatgattgt tgtgctgctg ettgetagta 120 



aaagttt 


atg 


ctt 


gta 


gaa 


aga 


aaa 


gtt 


aat 


ggg 


ctg 


gtg 


ccg 


egg 


etc 


169 






Met 


Leu 


Val 


Glu 


Arg 


Lys 


Val 


Asn 


Gly 


Leu 


Val 


Pro 


Arg 


Leu 








1 








5 










10 












atg 


cct 


gta 


ate 


gca 


gca 


cat 


tgg 


gag 


gee 


cag 


gtg 


ggc 


gca 


tea 


ctt 


217 


Met 


Pro 


Val 


He 


Ala 


Ala 


His 


Trp 


Glu 


Ala 


Gin 


Val 


Gly 


Ala 


Ser 


Leu 




15 










20 










25 










30 




gag 


ggt 


tec 


age 


ate 


ttg 


gac 


aga 


get 


gtt 


att 


gaw 


cac 


aat 


ttg 


ttg 


265 


Glu 


Gly 


Ser 


Ser 


He 


Leu 


Asp 


Arg 


Ala 


Val 


He 


Xaa 


His 


Asn 


Leu 


Leu 












35 










40 










45 






tct 


gca 


age 


aaa 


tta 


tat 


aat 


aat 


att 


acc 


ttc 


gaa 


gaa 


ctt 


gga 


get 


313 


Ser 


Ala 


Ser 


Lys 


Leu 


Tyr 


Asn 


Asn 


He 


Thr 


Phe 


Glu 


Glu 


Leu 


Gly 


Ala 










50 










55 










60 








ctt 


tta 


gag 


ate 


cct 


gca 


get 


aag 


gcg 


gaa 


aag 


ata 


gca 


tct 


caa 


atg 


361 


Leu 


Leu 


Glu 


He 


Pro 


Ala 


Ala 


Lys 


Ala 


Glu 


Lys 


He 


Ala 


Ser 


Gin 


Met 





65 70 75 



ata acc gaa gac gta tgaatggatt tattgaccag attgatggaa tagttcattt 416 
He Thr Glu Asp Val 
80 

tgaaacacga gaagccctgc caacgtggga taagcagatc caatcacttt gtttccaagt 476 
gaataacctt tttggagaaa attagtcaaa cagcacc 513 



<210> 686 
<211> 498 
<212> DNA 
<213> Homo sapiens 



<220> 
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376 

<221> CDS 
<222> 118. .399 



<400> 686 

aacaattcat gaagttgaag aaaagacact gtcagaaatg aacacagaag cggasaacag 60 
cttctccatc acgccagaaa tggcaatgct gaagaagtaa gacaactatt agagacc 117 



atg 


gcg 


agt 


aat 


gaa 


gtg 


att 


get 


gac 


att 


aat 


tgc 


aaa 


gga 


aga 


agt 


165 


Met 


Ala 


Ser 


Asn 


Glu 


Val 


He 


Ala 


Asp 


He 


Asn 


Cys 


Lys 


Gly 


Arg 


Ser 




1 








5 










10 










15 






aag 


tct 


aac 


ttg 


ggc 


tgg 


aca 


ccc 


eta 


cat 


ctg 


gca 


tgc 


tat 


ttt 


gga 


213 


Lys 


Ser 


Asn 


Leu 
20 


Gly 


Trp 


Thr 


Pro 


Leu 
25 


His 


Leu 


Ala 


Cys 


Tyr 
30 


Phe 


Gly 




cac 


aga 


caa 


gtg 


gtc 


cag 


gat 


ctg 


ttg 


aag 


get 


ggt 


gca 


gaa 


gtg 


aat 


261 


His 


Arg 


Gin 

35 


Val 


Val 


Gin 


Asp 


Leu 
40 


Leu 


Lys 


Ala 


Gly 


Ala 
45 


Glu 


Val 


Asn 




gtg 


ttg 


aat 


gac 


atg 


gga 


gac 


acg 


ccg 


ctt 


cat 


cga 


get 


gee 


ttt 


aca 


309 


Val 


Leu 

50 


Asn 


Asp 


Met 


Gly 


Asp 
55 


Thr 


Pro 


Leu 


His 


Arg 
60 


Ala 


Ala 


Phe 


Thr 




gga 


cga 


aag 


gtg 


aaa 


ate 


att 


eta 


tgt 


tea 


atg 


ttt 


gta 


agt 


gag 


gta 


357 


Gly 


Arg 


Lys 


Val 


Lys 


He 


He 


Leu 


Cys 


Ser 


Met 


Phe 


Val 


Ser 


Glu 


Val 




65 










70 










75 










80 




ttt 


gga 


gga 


gta 


gtt 


acc 


att 


gtt 


ttc 


tct 


gtt 


ata 


acc 


ate 






399 


Phe 


Gly 


Gly 


Val 


Val 
85 


Thr 


He 


Val 


Phe 


Ser 
90 


Val 


He 


Thr 


He 









tgaccagcaa ccgagaaagc cacacaaaaa aatgtataca ccagcacttt gggtcaaagg 459 
ccacaggatc ttttgagtct gacagtgagg tccagtact 4 98 



<210> 687 
<211> 550 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .306 



<400> 687 

agtctcgcca gctgccggtc ttteggggge teegtaaett tctatccgtc cgcgtcagcg 60 
cttgccaccc tcatctccaa t atg cct ggt ccg acc ccc agt ggc act aac 111 

Met Pro Gly Pro Thr Pro Ser Gly Thr Asn 
15 10 
gtg gga tec tea ggg cgc tct ccc age aaa gca gtg gee gee egg gcg 159 
Val Gly Ser Ser Gly Arg Ser Pro Ser Lys Ala Val Ala Ala Arg Ala 

15 20 25 

ggg gat cca ctg tec ggc aga gga aaa atg cca get gtg gga caa gga 207 
Gly Asp Pro Leu Ser Gly Arg Gly Lys Met Pro Ala Val Gly Gin Gly 

30 35 4 0 

gtg cag gee gca caa cct egg cag gca ccg ggg gga tgt ggc gat tct 255 
Val Gin Ala Ala Gin Pro Arg Gin Ala Pro Gly Gly Cys Gly Asp Ser 

45 50 55 

aca cag aag att cac ctg ggc tea aag ttg gee ctg ttc cag tat tgg 303 
Thr Gin Lys He His Leu Gly Ser Lys Leu Ala Leu Phe Gin Tyr Trp 

60 65 70 

tta tgagtcttct gttcatcget tctgtattta tgttgcacat ttggggcaag 356 
Leu 
75 

tacactegtt egtagattea gttacatcca tctgtcatct gaagaaggag gaaaaaaccc 416 
aacatttctt ggaccaaaag tatagtgact atctgttcat gagagaaatt ttctgtaagc 476 
ttgctgtttt acaggggatt tatcaataat tgattttgag gaatcagttt ttttctatgg 536 
ctaataaact tttt 550 



<210> 688 
<211> 501 



WO 02/083898 
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377 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .487 



<400> 688 



agctcct 


atg 


agg 


ccc 


ctg 






Met 
1 


Arg 


Pro 


Leu 


teg 


cct 


ccg 


ccg 


ccg 


ctg 


Ser 


Pro 


Pro 


Pro 


Pro 


Leu 


15 










20 


gtc 


ggc 


tec 


tct 


999 


cgt 


Val 


Gly 


Ser 


Ser 


Gly 


Arg 










35 




gac 


egg 


ctg 


cgc 


ctg 


ccg 


Asp 


Arg 


Leu 


Arg 


Leu 


Pro 








50 






ttt 


tac 


tat 


999 


ate 


ctg 


Phe 


Tyr 


Tyr 


Gly 


He 


Leu 






65 








gaa 


gga 


gee 


aag 


cag 


gag 


Glu 


Gly 


Ala 


Lys 


Gin 


Glu 




80 










att 


caa 


tgk 


gtg 


ate 


aat 


lie 


Gin 


Xaa 


Val 


He 


Asn 


95 










100 


acc 


egg 


age 


tgg 


etc 


tat 


Thr 


Arg 


Ser 


Trp 


Leu 


Tyr 










115 




atg 


gtc 


tec 


age 


aat 


tea 


Met 


Val 


Ser 


Ser 


Asn 


Ser 








130 






gtc 


ctt 


ggt 


aaa 


tec 


tgc 


Val 


Leu 


Gly 


Lys 


Ser 


Cys 



145 



acc tct tgaagaagaa gtac 
Thr Ser 
160 



ccg ccg gtc ggc gat gtc 
Pro Pro Val Gly Asp Val 
5 10 
ctg ccg gtg ccg gtt gtg 
Leu Pro Val Pro Val Val 
25 

etc atg gee tct age age 
Leu Met Ala Ser Ser Ser 
40 

etc tgc ttc ctg ggt gtc 
Leu Cys Phe Leu Gly Val 
55 

cag gaa aag at a aca aga 
Gin Glu Lys He Thr Arg 
70 

acg ttc acc ttt gee tta 
Thr Phe Thr Phe Ala Leu 
85 90 
get gtg ttt gee aag ate 
Ala Val Phe Ala Lys He 
105 

get gee tgt tct ate tec 
Ala Ala Cys Ser He Ser 
120 

gca eta cag ttt gtc aac 
Ala Leu Gin Phe Val Asn 
135 

aag cca ate cca gtc atg 
Lys Pro He Pro Val Met 
150 



egg ctg gag ctg 4 9 

Arg Leu Glu Leu 

age ggg tct cca 97 
Ser Gly Ser Pro 
30 

tec ctg gtg ccc 145 
Ser Leu Val Pro 
45 

ttt gtc tgc tat 193 
Phe Val Cys Tyr 
60 

gga aag tat ggg 241 

Gly Lys Tyr Gly 

75 

act ttg gtc ttc 289 
Thr Leu Val Phe 

tgr tgg gat cgt 337 
Xaa Trp Asp Arg 
110 

tat ctg ggt gee 385 
Tyr Leu Gly Ala 
125 

tac cca act cag 433 
Tyr Pro Thr Gin 
14 0 

etc ctt ggg gtg 481 

Leu Leu Gly Val 

155 

501 



<210> 689 
<211> 494 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 282. .461 



<400> 689 

attttcttaa tttttcagaa ctttaaacag atatgtaatt taaaagatgc agcccctcct 60 
gtgcatgccc tcctcatgct gcccctctgt tccagggccc acctttcctg tcctttgagg 120 
aggacccgcg gcagcaggca ctggagctgg tggtgcagga ggtctccagc gtgctcaggt 180 
eggeggagga gaaggtggtg gtctgtcacg tggtgggcca ggctattcag ttcttggtca 240 
gcgagacgcc tgccttgggt getggatgee ggctgtacga c atg egg etc tec cct 296 

Met Arg Leu Ser Pro 
1 5 

ttc cat tta cag ctg gag ttc cac atg aag gag aag aga gag gac etc 344 
Phe His Leu Gin Leu Glu Phe His Met Lys Glu Lys Arg Glu Asp Leu 

10 15 20 

cag att age tgg tct ttc ate agt gtg ccg gaa atg gee gtt aat ate 392 
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Gin lie Ser Trp Ser Phe lie Ser Val Pro Glu Met Ala Val Asn lie 

25 30 35 

cag ccc aaa cac tgg ggg agg acc agg tgg ctg aga caa gtg cga tgt 
Gin Pro Lys His Trp Gly Arg Thr Arg Trp Leu Arg Gin Val Arg Cys 

40 45 50 

ctg acg ttc tea agg aca tct tgaagcattt ggctggttct gcctctccat cag 
Leu Thr Phe Ser Arg Thr Ser 
55 " 60 



440 



494 



<210> 690 
<211> 582 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 120. 



530 



<400> 690 

aaagtcgtgc tgcaggcgtc gcgccaatct tegctctgag gtgctgtctc aceggtgaga 
cctggaagcg ggegagtetc gtgctgtgtc ggacctgcag cccctggcct tccgccacc 
atg gag tac etc ate ggt ate caa ggc ccc gac tat gtt ctt gtc gec 
Met Glu Tyr Leu lie Gly lie Gin Gly Pro Asp Tyr Val Leu Val Ala 
15 10 15 

tec gac egg gtg gee gee age aat att gtc cag atg aag gac gat cat 
Ser Asp Arg Val Ala Ala Ser Asn lie Val Gin Met Lys Asp Asp His 

20 25 30 

gac aag atg ttt aag atg agt gaa aag ata tta etc ctg tgt gtt gga 
Asp Lys Met Phe Lys Met Ser Glu Lys lie Leu Leu Leu Cys Val Gly 

35 4 0 4 5 

gag get gga gac act gta cag ttt gca gaa tat att cag aaa aac gtg 
Glu Ala Gly Asp Thr Val Gin Phe Ala Glu Tyr lie Gin Lys Asn Val 

50 55 60 

caa ctt tat aag atg cga aat gga tat gaa ttg tct ccc acg gca gca 
Gin Leu Tyr Lys Met Arg Asn Gly Tyr Glu Leu Ser Pro Thr Ala Ala 
65 70 75 80 

get aac ttc aca cgc cga aac ctg get gac tgt ctt egg agt egg acc 
Ala Asn Phe Thr Arg Arg Asn Leu Ala Asp Cys Leu Arg Ser Arg Thr 

8 5 90 95 

cca tat cat gtg aac etc etc ctg get ggc tat gat gag cat gaa ggg 
Pro Tyr His Val Asn Leu Leu Leu Ala Gly Tyr Asp Glu His Glu Gly 

100 105 110 

cca gcg ctg tat tac atg gac tac ctg gca gee ttg gee aag gee ctt 
Pro Ala Leu Tyr Tyr Met Asp Tyr Leu Ala Ala Leu Ala Lys Ala Leu 

115 ' 120 125 

ttg cag ccc acg get atg gtg cct tec tgactctcag tatcctcgac 
Leu Gin Pro Thr Ala Met Val Pro Ser 

130 135 
cgatactaca caccgactat etcaegtgar ag 



60 
119 
167 



215 



263 



311 



359 



407 



455 



503 



550 



582 



<210> 691 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 4 . . 168 



<400> 691 

aag atg gcg gac cgt ggc ggc gtg ggt gaa gee gca get gtt gga gcg 

Met Ala Asp Arg Gly Gly Val Gly Glu Ala Ala Ala Val Gly Ala 
15 10 15 



48 
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tct cct gca tct gtc cct ggc eta aac ccg acg 
Ser Pro Ala Ser Val Pro Gly Leu Asn Pro Thr 

20 25 
cga ctg cgt ccg ggc tgg egg gga ctg ggg cct 
Arg Leu Arg Pro Gly Trp Arg Gly Leu Gly Pro 

35 40 
egg ggc tgg ggc tgc ttt acg ccc tgaggatccc 
Arg Gly Trp Gly Cys Phe Thr Pro 

50 55 
tggcagcggt gactgtattt ttaaattcat tgacacccaa 
ggacctcttc attgatatca ggactaatat ttatatttga 
atggcacatc ttttattgag caagtatctg taagccattt 
cagagagcag catttcagaa ccaggttctc cttcgaggaa 
gacagaattt gtcaggtgac tacttttcta atgtgttttc 
gagtttgg 



eta ggc tgg agg gag 
Leu Gly Trp Arg Glu 
30 

cgt tgt ggt teg tgg 
Arg Cys Gly Ser Trp 
45 

tttgaggctg tgtgagaatt 



attctatgtg 
atggtggtac 
gcaaccactg 
cagagaaaat 
agagctgtgt 



gcacttacag 
ttccataagc 
atgggaggaa 
gaaaccagca 
atttaagatt 



96 



144 



198 



258 
318 
378 
438 
498 
506 



<210> 692 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 64 . .459 



<400> 692 



gtcctctggg ccacgctcca cttgcccgct 


. tccacccgga 


aagcccccca ggctgagtgc 


60 


ggc 


atg 


ate 


tec 


ate 


ace 


gaa 


tgg 


cag 


aag 


att 


ggt 


gtg 


ggg 


ate 


acc 


108 




Met 


He 


Ser 


He 


Thr 


Glu 


Trp 


Gin 


Lys 


He 


Gly 


Val 


Gly 


He 


Thr 






1 








5 










10 










15 




ggt 


ttc 


ggc 


ate 


ttc 


ttc 


ate 


etc 


ttt 


gga 


aca 


etc 


ctg 


tac 


ttt 


gat 


156 


Gly 


Phe 


Gly 


He 


Phe 


Phe 


He 


Leu 


Phe 


Gly 


Thr 


Leu 


Leu 


Tyr 


Phe 


Asp 












20 










25 










30 






tec 


gtg 


etc 


ctg 


gee 


ttt 


gga 


aac 


ctg 


ctg 


ttc 


ctg 


acg 


ggc 


ctg 


tec 


204 


Ser 


Val 


Leu 


Leu 


Ala 


Phe 


Gly 


Asn 


Leu 


Leu 


Phe 


Leu 


Thr 


Gly 


Leu 


Ser 










35 










40 










45 








etc 


ate 


att 


ggc 


ctg 


agg 


aag 


ace 


ttt 


tgg 


ttc 


ttc 


ttc 


caa 


egg 


cac 


252 


Leu 


He 


He 


Gly 


Leu 


Arg 


Lys 


Thr 


Phe 


Trp 


Phe 


Phe 


Phe 


Gin 


Arg 


His 








50 










55 










60 










aaa 


etc 


aag 


gga 


ace 


age 


ttc 


etc 


ctg 


ggg 


ggt 


gtg 


gtt 


ate 


gtg 


etc 


300 


Lys 


Leu 


Lys 


Gly 


Thr 


Ser 


Phe 


Leu 


Leu 


Gly 


Gly 


Val 


Val 


He 


Val 


Leu 






65 










70 










75 












eta 


cgc 


tgg 


ccc 


etc 


etc 


ggc 


atg 


ttc 


ctg 


gaa 


ace 


tac 


gga 


ttc 


ttc 


348 


Leu 


Arg 


Trp 


Pro 


Leu 


Leu 


Gly 


Met 


Phe 


Leu 


Glu 


Thr 


Tyr 


Gly 


Phe 


Phe 




80 










85 










90 










95 




age 


etc 


ttt 


aag 


ggc 


ttt 


ttc 


cct 


gtc 


gee 


ttc 


ggc 


ttc 


ctg 


ggc 


aat 


396 


Ser 


Leu 


Phe 


Lys 


Gly 


Phe 


Phe 


Pro 


Val 


Ala 


Phe 


Gly 


Phe 


Leu 


Gly 


Asn 












100 










105 










110 






gtc 


tgc 


aac 


ate 


ccc 


ttc 


ctg 


ggt 


gcg 


ctg 


ttc 


egg 


aga 


ctt 


caa 


ggc 


444 


Val 


Cys 


Asn 


He 


Pro 


Phe 


Leu 


Gly 


Ala 


Leu 


Phe 


Arg 


Arg 


Leu 


Gin 


Gly 





115 120 125 



act age teg atg gtc tgaaaacaga gatgagctcc ttgaacttgg atcattggtt 499 
Thr Ser Ser Met Val 
130 



<210> 693 
<211> 756 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 188. . 616 
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<400> 693 

attttattac ccggagtcct ggttcctggg ccagaccggt gtgtcgtttt tggcccaagc 
tagagaatgt taagggcttc tgcggtgggt tggtgctaga ggcgccgcga acaggtgctg 
cgggggcggc gcgggaggcg gtgcccttgc ttccggatcc ggtctcagct ctgggaggga 
acgggag atg ttg cag gcg ccg aga ggg egg gec agg gec gca etc egg 

Met Leu Gin Ala Pro Arg Gly Arg Ala Arg Ala Ala Leu Arg 

15 10 
aga etc gcg gtt get acg cgc ace atg get gga gcg ccc acg gtc teg 
Arg Leu Ala Val Ala Thr Arg Thr Met Ala Gly Ala Pro Thr Val Ser 
15 20 25 30 

ctt cct gaa etc cgt tea etc eta gec tec gga egg gee egg etc ttc 
Leu Pro Glu Leu Arg Ser Leu Leu Ala Ser Gly Arg Ala Arg Leu Phe 

35 40 45 

gac gtg cgc tct cgc gag gag gcg gca get ggg acc ate cca ggg gcg 
Asp Val Arg Ser Arg Glu Glu Ala Ala Ala Gly Thr lie Pro Gly Ala 



50 



55 



60 



etc aac ate ccg gtg tec gag ttg gag agt get ctg cag atg gag cca 
Leu Asn lie Pro Val Ser Glu Leu Glu Ser Ala Leu Gin Met Glu Pro 

65 70 75 

get gec ttc cag get tta tat tct get gag aag cca aag ctg gaa gat 
Ala Ala Phe Gin Ala Leu Tyr Ser Ala Glu Lys Pro Lys Leu Glu Asp 



80 



85 



90 



95 
cgc 



gag cat etc gtt ttc ttc tgt cag atg ggc aag egg ggc etc cag gca 
Glu His Leu Val Phe Phe Cys Gin Met Gly Lys Arg Gly Leu Gin Ala 

100 105 110 

age tgg ccc gga gtc ttg gat aca ctg ggg etc gca act acg ctg 
Arg Ser Trp Pro Gly Val Leu Asp Thr Leu Gly Leu Ala Thr Thr Leu 

115 120 125 

gag cct ata gag aat ggt tgg aga aag aga gtt agg cag gag gca get 
Glu Pro lie Glu Asn Gly Trp Arg Lys Arg Val Arg Gin Glu Ala Ala 

130 135 140 

tac tgattgecac cccctggccc ettaatggee accttaacta agggtgtgaa 
Tyr 

egggctgact tggtgaattg ggcaactcct tatagtgttg tgcacacaaa agcatcaaat 
aaagaacatt taatcaaaaa aaaaaaaaaw 



60 
120 
180 
229 



277 



325 



373 



421 



469 



517 



565 



613 



666 

726 
756 



<210> 694 
<211> 563 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 296. 



,463 



<400> 694 

aatgtttaat 

aaagcagaga 

ggttgtcgat 

gagacegtte 

atcctcctca 



gcccgcacct 
cagcaggaag 
ggacggtggc 
gcaatggggg 
gcagaagatt 



cgtccccggc 
aggggagtgg 
ggcacccaag 
cctgatcttc 
ccgctgtggg 



caccccgtga 
aggcagccca 
ggggacgtgg 
gctggactgg 
ggcaataaga 



aag atg 
Lys Met 

gee age 
Ala Ser 

tec ctg 
Ser Leu 
35 



age cgt aac 
Ser Arg Asn 
5 

tgg gaa gee 
Trp Glu Ala 
20 

ggc ttt aga 
Gly Phe Arg 



age age etc 
Ser Ser Leu 

aag cat ggc 
Lys His Gly 
25 

cct ttg tec 
Pro Leu Ser 
40 



ggc ggt gee 
Gly Gly Ala 
10 

cct gee tct 
Pro Ala Ser 

ccg tea ctg 
Pro Ser Leu 



ggcagecaga cactctccaa 
ttcacctggg gaaatgactg 
acccgttcta ctatgactat 
ccttcatcgt ggggctcctc 
agegcaggea aatca atg 
Met 
1 

acc cac tgc act ggg 
Thr His Cys Thr Gly 
15 

ggc gee tec cct tct 
Gly Ala Ser Pro Ser 
30 

cca gcg ctt ggg ctg 
Pro Ala Leu Gly Leu 
4 5 



60 
120 
180 
240 
298 



346 



394 



442 
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aag gaa get cca gac tea atg tgacccccag gtggcatcgc aactcctgcc 
Lys Glu Ala Pro Asp Ser Met 
50 55 

tcgtgccacc teatgettat aataaagccg gegtcagaga ccgctgcttc ccycaaaaaa 
aaaaaaaaaa 



493 



553 
563 



<210> 695 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 695 

agaggeccta 

ctgccagcag 

acaaagtcct 

agccaccacc 

aggtggtgaa 

cagacctgct 

ccaggctaca 

tcactctgca 

gcagcccacc 



cacccaccga 
cttctccaag 
ggaagttttc 
gcccatcacc 
gacccacgag 
cacctacttc 
gatgeactgg 
ggacagaggg 
tatcaccaac 



ggcatggggc 
geaegggagg 
cccaaaggcc 
tattccctct 
ccggcctcct 
tgcygggcgt 
gagctgtggt 
gcaggcccca 
ag 



tccctgggct 
aagaaattac 
gctgggtgct 
gtggaaccaa 
tcaacctcaa 
cctccacctc 
ccaagccagt 
gggtggagat 



gttctgcttg 
ccctgtggtc 
cataacctgc 
gaacatcaag 
cgtcacactc 
aggtgeccat 
gtctgagctg 
gatctgecag 



gccgtgctgg 
tccattgcct 
tgtgcacccc 
gtggccaaga 
aagtccagtc 
gtggacagtg 
cgggccaact 
gcgtcctcgg 



60 
120 
180 
240 
300 
360 
420 
480 
502 



<210> 696 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<400> 696 

gtccctcctc 

cctgaaggca 

agccctggca 

tgectctgat 

aaatgtcccc 

tttaggaggt 

tgccccgggt 

tggagatgga 

tagggaactt 

atcatgatga 



ctagttgcag 
ggcgagccag 
ggtctttaag 
ccctgaagga 
arggaaaaca 
ggtgaatacc 
tt kggagagg 
gatgatatgg 
cagttgaggt 
gttttgeett 



tctggtagtt 
ctgcccagga 
tgcgtttgtg 
agaaaaaaaa 
aakttgtgga 
aggagectgg 
atgtgcccaa 
aacggttcat 
actctgcgca 
atgccttgaa 



gtcgctggcc 
aaggtggaaa 
cagecgattt 
aaaaaaamcg 
raaggcccca 
aggaaatgtt 
taggcttgtc 
ggaggagatg 
ttcttatagg 
tcttgaggtt 



gtgtgacggc 
gtggtagaag 
caaggctaag 
ggaaaaaaac 
gtgcaraatg 
aaaggggttt 
gataacattg 
agagagctaa 
ggaccctcct 
aataatcat 



tcgctgttgc 
ctgacccctg 
agagaaagac 
tcaacwkgga 
aagcccccgc 
gggctcmacy 
atatgataga 
ggmggaaaat 
caccatgatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
589 



<210> 697 
<211> 420 
<212> DNA 
<213> Homo sapiens 



<400> 697 

gattcatagg 

caatatgtag 

ggctaagcgt 

tcacttgggg 

agaaatacaa 

getgaggegg 

ccactgcact 



agcatataat 
aataacatgt 
ggtggttcat 
tcaggagttc 
aagttggccg 
gagagtcact 
ccagcctgga 



ataatctttg 
tctcttagta 
gectgtaate 
gggaccggcc 
ggcgtggtgg 
tgaacccggg 
tgacagagtg 



taaagtagag 
ggcataggat 
ccagcacttt 
tggecgacat 
tgggcgcctg 
aggeggaggt 
agactctgtc 



ggacaattca 
atactacatt 
gggaggcega 
ggtgaagccc 
tggtctcagc 
tgcagtgagc 
tcaaaaaaaa 



gtattttgta 
aagaaatctg 
ggcgggcgga 
cgtctttact 
tacteggggg 
cgagatcacg 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 



<210> 698 
<211> 496 
<212> DNA 

<213> Homo sapiens 



<400> 698 

gacataatca gagctatget ggaggagaag agggcageca tttgctggct ggcttgcagt 60 

gagecaggag gtggcaggac gagttaggag gctggttcag tagcteggge aagagcaggg 120 

ccccccagga tetgaaggee tcccaggccc ccaggcccag cgggtcccag aggagagega 180 

ggaccccaag gtaactcegg tgagaagggc gaccagggat ttcaaggeca gecaggcttt 240 
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ccgggcccac cgggtccccc tggattccca ggcaaagttg gatcacctgg cccacctggc 300 

cctcaagcag agaagggcag cgaagggatt cgaggcccat caggcctgcc tggctcccct 360 

gggccaccgg gacctcctgg gattcagggc cccgccggtc tggatggttt ggatgggaag 420 

gatggcaagc ctggcttgag gggggacctg gtcctgctgg cccccctgga ctcatgggac 480 

caccgggctt tagggg 4 96 

<210> 699 
<211> 729 
<212> DNA 
<213> Homo sapiens 



<400> 699 

atctctgagc 

caagctggga 

gggtgacttc 

aagtgagtta 

atttgttaga 

cascccccas 

cctggaaatg 

cctctacttc 

aataccagag 

tggccctgac 

ctgagagtat 

cttgcccaag 

gkaccacgc 



ctccgcggcc 
tgcgcggagt 
ggtgcactag 
atatttgtaa 
gagactccag 
attcmcgttt 
catacccatc 
tcacagttca 
tctgcatggg 
ttttctaaga 
cccacccact 
atcacccagc 



ttctctagca 
agcgggtgga 
ggatagtaac 
agtgtttaga 
gaggtcttgc 
gtgagccaag 
tcaaacctgg 
ggcctgttct 
accatttcaa 
tgagaggagc 
tcacagatga 
aagtaagaaa 



aaatggtgga 
aaactggast 
agtaccacct 
acagcgtcta 
ccagggattg 
ccttgctttt 
gctttaaaat 
gccccacaca 
caacccttag 
tgataggaac 
agcaaactga 
cagagctgga 



gccggtaccc 
ggagttctca 
cgtagggttg 
ctgcatggaa 
acttctccst 
gcaccggccg 
cacctcctct 
gcagaaaatg 
aagctgactc 
aatttgggaa 
ggcccagaga 
gatgagctca 



ggctgtaagg 
ctcggccgct 
tagaaagacm 
agtagtattt 
tccstgtctc 
ggctctcttg 
gggacgcctt 
cccacgatgc 
agctgtctcc 
ccctcaaagg 
agggaaatta 
ggccagcacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
729 



<210> 700 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<400> 700 

caggaaaatc 

gaagccctgg 

ccccagtcct 

gagaagtgca 

caccccggct 

tgggtgggtg 

accctcagct 

tcatgcctat 



gcgagggacg 
aggctgagga 
ccctgcccgc 
gtaccagcat 
gctacacctg 
acgagctgta 
ctcgggcctg 
ataagttggc 



ggcggccccc 
gagaggtggc 
ctccagggcc 
cgcgaaccag 
tgccgactgt 
ctgtgagaag 
agccgccatg 
atggcaggga 



cgacagtcca 
acgccagcct 
ctggccaccc 
gctgtgcgca 
gggctgaacc 
catgcccgcc 
ccctcagcct 
caatggtggg 



gctcctttcg 
tcttgcccag 
ctcccaagct 
tccaggaggg 
tgaagatgcg 
agcgctactc 
gcctcactgc 



gctcttgcag 
ctcactgagc 
ccacacttgt 
ccggtaccgc 
cgggcacttc 
cgcacctgcc 
tgggccaggg 



60 
120 
180 
240 
300 
360 
420 
462 



<210> 701 
<211> 417 
<212> DNA 
<213> Homo sapiens 



<400> 701 

aatgctcagc 

tgcatcctgc 

catcatcttc 

gacccaaccg 

tctggctcat 

gaccatctgg 

gtccggaccc 



gcgctgcccg 
ctgggcatcc 
accacgttct 
cggagtgatc 
cgccatcctg 
trcccagctc 
tcccccacac 



gctggggacc 
tgcgcccggc 
ggggcctcgt 
atcaccatgc 
gcgcastgaa 
tcggaatgac 
aactatgtct 



cgcgcacctg 
catgacggcg 
cggcatcgcg 
tggtcgccac 
ccccctgttc 
tgtggctcca 
ggtcaccagc 



cagcgcccgc 
cactcattcg 
ggccctggtt 
cgccgtctgc 
gggccccagc 
ctgtccctga 
tccctcctgc 



tgctcggccc 
ccctcccggt 
cgtgccgaag 
tgttacctct 
tgaagaatga 
caaccccttc 
tggcacc 



60 
120 
180 
240 
300 
360 
417 



<210> 702 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<400> 702 

gcgcgcccgc cgcgtgccca gcccgggtag ctgctgcagc ctacgctgcc tcggcactgc 
ctgccgcgtg cggagccgga gcccgagcct gagtggcgcc gggcccgacg tggggctcct 



60 
120 
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gggccgcggc 
ggcggcctgg 
cgcctgcggc 
aggaggtgaa 
accggcgtct 
agcgccagcg 



ggcgggcggg 
aggagctgac 
gggaggaggc 
tcggcagctg 
rcaggcagag 
cgggcggcgc 



cgatgctcca 
ggacgaggag 
ggcgcgcctg 
cagggccacc 
aaccgtgagc 
gccgcacgcc 



383 

gaggcctgac 
atggcggcgc 
gcggcactgg 
tgggcgagat 
tgcgcgacct 
agtggcagct 



PCT/IB01/00914 

cagccatgga ggccgaggca 180 

taggcgaaga gctagtgcgg 240 

tcagcgcggc cgcctcatgc 300 

ccgcgagctc aagcagctca 360 

ctgctgcttc ctggactcgg 420 

cttcgggacc 470 



<210> 703 
<211> 452 
<212> DNA 
<213> Homo sapiens 



<400> 703 

aatgcgcaag 

aggcgggttt 

aggcaggaag 

cgcccctcct 

aaggaggcca 

ggtggcagct 

ctgccttcac 

tatgattgca 



aaggtgcctg 
tccagtgtta 
ccgcaaaaag 
caagtgtctg 
ttcttgatgt 
gctcagcaga 
ttagctggag 
actgtgcagg 



cgggactgga 
cttgcggctg 
tttctgagcc 
gctgaaagat 
tgcttctaga 
cccttggcat 
aggtgctggc 
ggcatcagag 



gcagagsggc 
ggcgtggggg 
cccgaacctg 
gccacccagg 
tctcatgtca 
gggaaagaga 
tgtggccggg 
ct 



tgcgagggtc 
actagctgcc 
tagcggacgt 
gaakggaact 
tcaccgagcc 
cggagtccac 
gactgaagcc 



ttcccagcgc 
tttctggcac 
ggaaaaagaa 
cgggctagct 
ctcagctgct 
cccaagccat 
agctgtgctc 



60 
120 
180 
240 
300 
360 
420 
452 



<210> 704 
<211> 304 
<212> DNA 
<213> Homo sapiens 



<400> 704 

atttcgggag 

cgccatctgt 

ctgggcctag 

agtcgagaag 

cggacgggcc 

aata 



cgggattgtt tcgcgcagga agcaggctcc attttagcgc cgcccgccgt 
ttcccttccc tcccccatac ctagcccgag tctgagccct aacgagaagg 
gccgctggat gctggagtga aaggaaggga gaaagggaaa aagcgggaag 
ggagtgttaa gaggccaagt gcgacgcgcg tatccgggca gacggactga 
cgtgcttctg ccgcggctgc gcccggggcc tcttaaccac tgatctttga 



60 
120 
180 
240 
300 
304 



<210> 705 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 84,216 

<223> n=a, g, c or t 

<400> 705 

agtcgtggac tcgtgcagct ggggcgtccg cagccgctcg tcacccgcgt gatgctgttt 
ctggcgttgg gcagcccgtg gggntcgaac tgcctctctg cggaaggagg actgcattgt 
gtgcggccgc cgcgctccga ggtccccggg cctctgtctc ccgggcgtcc tccagsagcg 
ggysttcggg gmgggtagcc ggctggagta cgggmncttc gggagccgcg cgccttctcc 
ggcgtccggg tcgagcgcag atccctgtt 

<210> 706 
<211> 476 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
269 



<400> 706 

atttgttcag cctccataat tcgaatacca gggcaggccg agccagccgt gcgccgcgct 60 

ccagggccca gggcgccgca cacgcatcca cccacccacc cagcctcgca gcgccatggg 120 

caagaacaag cagccacgcg gccagcagag gcaggggggc ccgccggccg cggacgccgc 180 

tgggcccgat gacatggagc cgaagaaggg cacgggggcc cccaaggagt gcggggaggg 240 

atcattgtct gcttagtggc ctatcttggc ttgtttatgc tttgtgtctc atatcaggtt 300 
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gacgaacgga catgtattca attttctatg aaactgttat actttctgct gagtgccctg 360 
ggcctgacgg tctcgtgtgc tggccgtggc ctttgccgcc caccactatt cgcagctcac 420 
acagtttacc tgtgagacca cactcgactc ttgccagtgc aaactgccct cctcgg 476 



<210> 707 
<211> 266 
<212> DNA 
<213> Homo sapiens 



<400> 707 

ctctgcgtgc gccggaggct gccgtggcgg gtgggccgcc tgacttctcc tcccggccag 60 

ttctcgagcg cctcaccggg cctcgccctg cagcctcgct ctcgctggcg ctgcgcggcc 120 

taggggactg ggctgctggc ctccgggtgc ggggtggggg caggctccga cctggggcgt 180 

cctggcagcg cgagccgcgg gatgggggcc cgggccgcgg aggaggcrcc gctggygtgt 240 

cccttggtgg agagggcgct gccggy 266 



<210> 708 
<211> 384 
<212> DNA 

<213> Homo sapiens 



<400> 708 

aaaaaacaac atgccacagt taaaacttac ttcagtttgg acaactactc acagctacta 60 

cacagagacc cgaacgagtc actgatatac acctggacca ccaccaatgg atatacaaat 120 

ggcaaacaat tttactccgc cctctgcaac tcctcaggga aatgactgtg acctctatgc 180 

acatcacagc acggccagta tagtaatgcc tctgcattac agcctcgtct tcatcattgg 240 

gctcgtggga aacttactag ccttggtcgt cattgttcaa aacaggaaaa aaatcatctc 300 

accttgagcc cacccccacc ccattgtcta atctgtagaa gctaataaat aatcatccct 360 

ccttgcctag caaaaaaaaa aaaa 384 



<210> 709 
<211> 497 
<212> DNA 
<213> Homo sapiens 



<400> 709 

ctcttttgct 

tcgctctttc 

gtttcttgat 

gaaggaaaaa 

agagatttta 

cctgctgctt 

cacaattatc 

caaggcggat 

tcgacagtgc 



ytaacagaca 
actgctagta 
ctgcttctaa 
caaaataacc 
cttagatgat 
aatcttgccc 
acagatacgg 
gatagcccat 
gaagaat 



agccascgay 
agatcagatt 
aagaagaagt 
tcaactccgt 
ttacacaatg 
ctgamcctct 
agttgccacc 
gtaaagcaat 



tttaggctgg 
gcgtttcttt 
agagaagata 
tttgaaaaaa 
aagaaagtac 
taatgctgat 
actgaaactt 
catgaaaaga 



ataatagtca 
cagttactct 
aatcctgtct 
acattccaag 
atgcactttg 
tctgaggaag 
atgcattcat 
tttttcttca 



aattcttacc 
tcaatcgcca 
tcaatacctg 
aactttcatc 
ggcttctgtc 
atgaagaaca 
tttgtgcatt 
atattttcac 



60 
120 
180 
240 
300 
360 
420 
480 
497 



<210> 
<211> 
<212> 
<213> 



710 
471 
DNA 
Homo 



sapiens 



<220> 

<221> misc_feature 
<222> 423 

<223> n=a, g, c or 



<400> 710 

attcccaact gccagtgatc tctgaagccg actctgaggc tccctctttg ctctaacaga 60 

cagcagcgac tttaggctgg ataatagtca aattcttacc tcgctctttc actgctagta 120 

agatcagatt gcgtttcttt cagttactct tcaatcgcca gtttcttgat ctgcttctaa 180 

aagaagaagt agagaagata aatcctgtct tcaatacctg gaagaaaaac aaaataacct 240 

caactccgtt ttgaaaaaaa cattccaaga actttcatca gakattttac ttagatgatt 300 

tacacaatga agaaagtaca tgcactttgg gcttctgtat gcctgctgct taatcttgcc 360 
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cctgcccctc ttaatgctga ttctgaggaa gatgaagaac acacaattat cacagatacg 
gantgccacc actgaaactt atgcattcat tttgtgcatt caagtcggat g 

<210> 711 
<211> 405 
<212> DNA 
<213> Homo sapiens 



420 
471 



<400> 711 

aatgatcctg 

gccgccttgg 

atgagcacca 

gaggagaaca 

agccccccca 

cttctctggc 

gacaccccca 



cagcagccct 
gggtgacttg 
agcggcgcct 
tggccaccca 
tgaccttctc 
gctatcctgg 
tgttccccct 



tgcagcgagg 
gggcctggac 
ggaggaggag 
cttctctcaa 
cccagccctg 
cagcctcatc 
accctgcaac 



cccccaggga 
gccagctccc 
caggagcctc 
ctcagcctgc 
cccccactca 
cctgaggccc 
cccagctggg 



ggggccagcg 
ctctccgagg 
tgcgcaagcr 
acaatgacca 
ggagcccttg 
tccgtctgct 
gacat 



cctcccgcgg 
agctgtgccc 
rtttctgtct 
cccctactgc 
ctctgagctg 
gakgctgggg 



60 
120 
180 
240 
300 
360 
405 



<210> 712 
<211> 160 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc 
<222> 105 
<223> n=a, 



feature 
g, c or t 



<400> 712 

aagaaaccca atgatcctgc agcagccctt gcagcgaggc ccccagggag gggccagcgc 60 

ctcccgcggg ccgccttggg ggtgacttgg ggccwggacg ssagntcccc yctccwagga 120 

gctgtgccca tgagcaccaa gcggcgvctg gaggaggaga 160 



<210> 713 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 713 

caatcttcgc 

tgtgtcggac 

gccccgacta 

atgaaggacg 

ggakaggctg 

aagatgcgaa 

ctggctgact 

gatgagc 



tctgaggtgc 
ctgcagcccc 
tgtkctkkkk 
atcatgacaa 
gagacactgt 
atggatatga 
gkcttcggag 



tgtctcaccg 
tggccttccg 
hvggcytccg 
gatgtttaag 
acagtttgca 
attgtctccc 
tcggacccca 



gtgagacctg 
ccaccatgga 
accgggtggg 
atgagtgaaa 
gaatatattc 
acggcagsag 
tatcatgtga 



gaagcgggcg 
gtacctcatc 
ccgccagcaa 
agatattact 
agaaaaacgt 
ctaacttcac 
acctcctcct 



agtctcgtgc 
ggtatccaag 
tattgtccag 
cctgtgtgtt 
gcaactttat 
acgccgaaac 
ggctggctat 



60 
120 
180 
240 
300 
360 
420 
427 



<210> 714 
<211> 433 
<212> DNA 
<213> Homo sapiens 



<400> 714 

agactgcgtg cagaaggtga ctgtctcagt ggagctgggt catctcagca ggccttggct 60 

ccttgaactt ttggccgcca tgtgcttccc gaagtcctct ctgatgacat gaagaagctg 120 

aaggcccgaa tgcaccaggc catagaaaga ttttatgata aaatgcaaaa tgcagaatca 180 

ggacgtggac aggtgatgtc gagcctggca gagctggagg acgacttcaa agagggctac 240 

ctggagacag tggcggctta ttatgaggag cagcacccag agctcactcc tctacttgaa 300 

aaagaaagag atggattacg gtgccgaggc aacagatccc ctgtcccgga tgttgaggat 360 

cccgcaaccg aggagcctgg ggagagcttt tgtgacaagg tcatgagatg gttccaggcc 420 

atgctgcagc ggc 433 



<210> 715 
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<211> 326 
<212> DNA 
<213> Homo sapiens 



<220> 

<22l> misc_feature 
<222> 126 

<223> n=a, g, c or t 



<400> 715 

atggcggtct 

ctcgcccaac 

aaactnccgc 

tttgactgga 

cgcagatcca 

ctgacctgtg 



tggcacctct 
cttactacct 
ygctctgcca 
gagaaktgga 
tgccccatga 
acatctctgt 



aattgctctc 
tctgtcggcc 
cggtctgccc 
gatcctgatg 
agagggacaa 
gtttca 



gtgtattcgg 
ctgctctctg 
acccaacgcg 
tttctcagtg 
agaatgggat 



tgccgcgact 
ctgccttcct 
aagacggtaa 
ccattgtgat 
ataggagtgc 



ttcacgatgk 
actcgt kagg 
cccgtgtgac 
gatgaagaac 
acagcccagc 



60 
120 
180 
240 
300 
326 



<210> 716 
<211> 507 
<212> DNA 
<213> Homo sapiens 



<400> 716 

agactgaggc tgaggctggg gaacatcggg cagcatgagc gctgcggctc ttcctgcgca 60 

ccacggctgc ggctcgtgcc tgccggggtc tggtggtctc taccgcgaac cggcggctac 120 

tgcgcaccag cccgcctgta cgagctttcg ccaaagagct tttcctaggc aaaatcaaga 180 

agaaagaagt tttcccattt ccagaagtta gccaagatga acttaatgaa atcaatcagt 240 

tcttgggacc cgtggaaaaa ttcttcactg aagaggtgga ctcccgaaaa attgaccagg 300 

aagggaaaat cccagatgaa actttggaga aattgaagag cctagggctt tttgggctgc 360 

aagtcccaga agaatatggt ggcctgggct tctccaacac catgtactca cgactagggg 420 

agatcatcag catggatggg tccatcactg tgaccctggc agcgcaccag gctattggcc 480 

tcaaggggat catcttggct ggcactg 507 

<210> 717 
<211> 561 
<212> DNA 
<213> Homo sapiens 

<400> 717 

cttcagaatg ctggatgaaa aaattgaaaa gggtcgggat tactgttcgg aagaagagga 60 

tatcacatag caccaatttt accactcaaa ccaggagcta ctactgtgta aataggttac 120 

accccagttg aaatctttgc aaaggtcggt tctattcagc gaacagcact atagcaaaag 180 

aagatcgttc catattgtac gccccattaa attacagtgt ttcttaatga acttgcaaag 240 

gaatattgct aaaaacaaac aaaaaaaact gttatctaac tttctttgtt gctgctagtt 300 

aaaacttgtt gcaacttttc acttctcttg tgtccaggta tgcagcaaaa ttctgcaatt 360 

tcaccttaaa gatactgttg gttttacaga tgctctccaa cctattttct ataagatgag 420 

gtagtggtga actcagataa caaacttctc ttctaaactg gttctgcttc taagacaagc 480 

atctcctgcc ctctctcctt cctccccatc tctcgcacgc agtctagaga tggactgagc 540 

cttgcttctc actggcagtg t 561 



<210> 718 
<211> 530 
<212> DNA 
<213> Homo sapiens 



<400> 718 

aacatctggg gacagcggga aaacatgagt gactccaagg aaccaagggt gcagcagctg 60 

ggcctcctgg aagaagatcc aacaaccagt ggcatcagac tttttccaag agactttcaa 120 

ttccagcaga tacatggcca caagagctct acagggtgtc ttggccatgg cgccctggtg 180 

ctgcaactcc tctccttcat gctcttggct ggggtccttg tggccatcct tgtccaagtg 240 

tccaaggtcc ccagctccct aagtcaggaa caatccgagc aagacgcaat ctaccagaac 300 

ctgacccagc ttaaagctgc agtgggtgag ctctcagaga aatccaagct gcaggagatc 360 

taccaggagc tgacccagct gaaggctgca gtgggtgagt tgccagagaa atccaagctg 420 
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caggagatct accaggagct gacccggctg aaggctgcag tgggtgagtt gccagagaaa 480 
tccaagctgc aggagatcta ccaggagctg acccggctga aggctgcagt 530 

<210> 719 
<211> 512 
<212> DNA 
<213> Homo sapiens 



<400> 719 

aataatgctc ccatcaaatg tcatcaactt ctcccaagca gagaaacccg aacccaccaa 60 

ccaggggcag gatagcctga agaaacgtct acaggcaaaa gtcaaagtta ttggggtgca 120 

tagcagcctg gctggaagat tctgagtgct ctgtctgccc tggtgggttt cattctcctg 180 

tctgtcaacc cggctgcatt aaatcctgcc tcattgcagt gtaagttgga cgaaaaggat 240 

ataccaacca gacttcttct ttcttatgat tatcattcac cttacaccat ggactgccat 300 

agagccaaag ccagtctggc tggaactctg tctctgatgc tggtttctac tgtgttggag 360 

ttctgcctag ctgtgctcac tgctgtgctg cagtggaaac agactgtctg acttccctgg 420 

ttactcatgt cctcaaagac gactcatgay gctggatatg aagaactatt gacttcttgg 480 

gaaaaaacgg agaaatatta attggaaagt tt 512 



<210> 720 
<211> 507 
<212> DNA 
<213> Homo sapiens 

<400> 720 

acctcgccag ggccggggcc gggacgggac ggaaaactgg cggcttggtt tgttcagcgc 60 

gtcgcgtcat ggctctcgtt tcaggcctcc acggtgtgca cctggagtca ggagcgcgct 120 

tgtgcgtaca gttcggttca gttactaaaa gatgtaactg aactgcagat ccttggtgaa 180 

atatctttca acaaatctct atatgaggga ctgaatgcag agaaccacag aactaagatc 240 

actgtcgtct tcctgaaaga tgagaagtac cattctttgc ctatcatcat taaaggcagc 300 

gttggtggac ttctggtgtt gatcgtgatt ctggtcatcc tgttcaagtg tggctttttt 360 

aaaagaaaat atcaacaact gaacttggag agcatcagga aggcccagct gaaatcagag 420 

aatctgctcg aagaagagaa ttaggacctg ctatccactg ggagaggcta tcagcagtcc 480 

tgggacttgg agacccagca tcctttg 507 

<210> 721 
<211> 408 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 261 

<223> n=a, g, c or t 



<400> 721 

acctttttta ctttcacagc aatagtgcag aatccagaat ggatgtcctc tttgtagcca 60 

tctttgctgt gccacttatc ctgggacaag aatatgagga tgaagaaaga ctgggagagg 120 

atgaatatta tcaggtggtc tattattata cagtcacccc cagttatgat gactttagtg 180 

cagatttcac cattgattac tccatatttg agtcagagga caggctgaac aggttggata 240 

aggacataac agaagcaata nagactacca ttagtcttga aacagcacgt gcagaccatc 300 

cgaagcctgt aactgtgaaa ccagtaacaa cggaacctgt ggaagaaggc tgctatgact 360 

ctttggatgg gagtctggca agaggaaatt ggaagataaa ataaataa 408 



<210> 722 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 281 

<223> n=a, g, c or t 
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<400> 722 

agagaaagat 

gacgctggtc 

gaagaaaaag 

gaccttcata 

atgggcactg 

tctgccaaga 

gcacttccca 



gttccaggca 
attatcttgc 
tggggcatcc 
agttcttatt 
ggcattctta 
caagaacttt 



gagggaagag 
tgcatgtatt 
tccttatcgt 
atggaataaa 
gctgcgggag 
cttctttacr 



gcctctttcc 
ctggggcatt 
tctcctgacc 
cctggcgtca 
gcactgccga 
accagcgctc 



tccaggcctg 
gtattttttg 
cacctgctgg 
satttataat 
nagcctgaaa 
cagataacct 



gagctttcat 
atggctgtga 
tgtcagccca 
cctggtgctc 
ctctgcctgc 
cagggaacca 



60 
120 
180 
240 
300 
360 
371 



<210> 723 
<211> 404 
<212> DNA 
<213> Homo sapiens 



<400> 723 

aagctaggag 

cgcagggggt 

cgactttgat 

ttagtagtga 

cgcccgcctc 

ctcccctctg 

aagtacgagc 



ccgcgcragt 
cgctgaggcg 
gatgtggakg 
cggggttcac 
ggcctcccaa 
gggagccgac 
gagcccgcgt 



cgtagtgtcg 
agggtgtcat 
aggatraagg 
cgtgttagcc 
agtgctggga 
cgcagccaac 
gctgggcacc 



ctgtttgcgg 
gtcagacaac 
gctagatgac 
aggatggtct 
ttacaggaag 
cagaagcgaa 
cgagcgctcc 



gtctccgcgc 
gaggacaatt 
ttggagaatg 
cgatctcctg 
gccaggagaa 
tcaccacacc 
agat 



gggaccgggg 
ttgatggcga 
ccgaagagtt 
ccttgtgatc 
tgtcgagatc 
atacatgacc 



60 
120 
180 
240 
300 
360 
404 



<210> 724 
<211> 486 
<212> DNA 
<213> Homo sapiens 



<400> 724 

aaagtggaag 

ggggctggcg 

gtcgctagac 

tcaagcgact 

gttcaactaa 

ctcgatctct 

atgtgagcca 

cactgttgmt 

aagaga 



gcaacgacac 
ggcgggatta 
tggagtgcag 
ctcctgcctc 
tttttgtaat 
tgaccttgtg 
ctgcgsctag 
gctagtgttg 



acggcccgtg 
gctgccggag 
tggcgcgatc 
agcctcccaa 
tttagtggag 
atccacccac 
ccagaaaaat 
aatccagaam 



gcgccttcag 
gctgagtttc 
ttggctcact 
gtagctggga 
acgaggtttc 
cttggcctcc 
atttattgat 
catagctgga 



ctagggagaa 
cgcagcctga 
tcaacctcca 
ctacaggtgc 
accatgttgg 
caaaggtact 
cctaaagtgg 
catgggatca 



agagaggacg 
gtctccatct 
catcctgggt 
gcaccaccat 
ccaagatggt 
gagattahag 
gagtcagcgc 
gaggatgaac 



60 
120 
180 
240 
300 
360 
420 
480 
486 



<210> 725 
<211> 374 
<212> DNA 
<213> Homo sapiens 



<400> 725 

aaaaaaaaag 

gagccagtct 

agtagggatg 

atgttgacag 

gccaggtctt 

tctggcaacc 

tcgtaggtct 



tcctagctag 
tcagatgatt 
agctgtcccc 
tcgctctcct 
cctcctatag 
agttggacgg 
tear 



ggtcccaagt 
ccagcctcag 
acagagacct 
agcccttctc 
tggagagtaa 
scccatcacc 



gacagccagc 
ccttcaggcc 
gcccagtgca 
tgtgcctcag 
ggaagtggtg 
gccctscggt 



atcaactata 
actcagctgg 
cattgtgaga 
cctctggcaa 
gtggaaagcg 
ccgagtcaac 



acatgtaaag 
tgecaaatag 
aagccccaga 
tgccattcag 
attctctcat 
acatactaca 



60 
120 
180 
240 
300 
360 
374 



<210> 726 
<211> 523 
<212> DNA 
<213> Homo sapiens 



<400> 726 

aacaagatgg cggcgctgcg ggaeggctag cggccctgcg tggaggegag gaatcegcat 60 

ctatggagat gtccctgcat cccatgactc ggagctgatg gecttcatga cgaggaagtt 120 

gtgggacctg gagcagcagg tgaaggecca gactgatgag atactgtcca aggatcagaa 180 
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gatagcggcc ctagaggacc tggtgcagac cctccggcca cacccggccg aggcaaccct 240 

gcagcggcag gaggaactgg agacgatgtg tgtgcagctg cagcggcagg tcagggagat 300 

ggagcggttc ctcagtgact atggcctgca gtgggtgggc gagcccatgg accaggagga 360 

ctcagagagc aagacagtct cagagcatgg cgagagggac tggatgacag ccaagaagtt 420 

ctggaagcca ggggactcat tggcgccccc tgaggtggac tttgacaggc tgctggccag 480 

cctgcaggat cttagtgagc tggtggtaga gggtgacacc aaa 523 



<210> 727 
<211> 492 
<212> DNA 
<213> Homo sapiens 



<400> 727 

attttgcgaa 

ttcggacccc 

cttcgccaga 

tgccgctcct 

tccactgctg 

cccctgaaga 

ggcccctggg 

cctgggtttg 

tatttcttga 



cggcgagcag 
agcggccgga 
gagaaggaag 
aagcctgcac 
tagtaacccg 
tcccaggtgg 
gccacccctt 
tccgcatgtt 
ag 



cggcggcggc 
tggtgaaatc 
gggataaacc 
agccgcggcg 
ggtccggggc 
gcgaggagga 
gtcagccggg 
gtaatcgtgc 



gcggagagac 
ctccctgcag 
cagcgccayc 
gcagcagcag 
ctcggtggtg 
ggagtagggy 
tgggtaggaa 
aaataaacgc 



gcagcggagg 
cggatcctca 
catccacgcc 
tgagagttcc 
ctcctgatgc 
bgcctcgggg 
ccgtagactc 
tcactccgaa 



ttttcctggt 
atagccactg 
agccgcacca 
agggtctccc 
ccctcaccca 
ctgggcatcc 
gctcatctcg 
tttagcggtg 



60 
120 
180 
240 
300 
360 
420 
480 
492 



<210> 728 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<400> 728 

acagggccac 

gcctggctgg 

tgggagccct 

aagccacctc 

cttttctgga 

agagaacttc 

tatccaatct 

cctttccagc 



tgctgctcac 
gactctgatg 
gcgtggaggt 
tctaccttaa 
ctatcaagtt 
cccattggac 
ttcaaattac 
aacaagattc 



agaagcagtg 
cccagccatg 
gtgaaatcca 
tattgacagg 
tacagaagct 
atctcaaaac 
ctgagtattt 
aaagtattta 



aggatgatgc 
gccttcctct 
gacaattgaa 
aaaccccatc 
ggtggctgtg 
tttgaaagaa 
ttctaatctg 
ttgcacagac 



caggatgatg 
cctgcgtgag 
gatggggcat 
cagagtttag 
gagacaaatc 
cttaatgtgg 
accaatctag 
ttgcggg 



tctgcctcgc 
accagaaatc 
atcagagcct 
ccctgggagc 
tagcatctct 
ctcacaatct 
agcacttgga 



60 
120 
180 
240 
300 
360 
420 
467 



<210> 729 
<211> 528 
<212> DNA 
<213> Homo sapiens 



<400> 729 

atctttggga 

tcaatggagc 

tcatttgcca 

tgttagggga 

gctttggaga 

tcattatcac 

aagaacaact 

ggtattaatt 

ctatcaataa 



gctgcatttt 
aaaaacatgg 
actaacaaac 
aaaaaaaata 
gaagtaatcc 
cattatcatc 
tgcttttgct 
tgctttttat 
tttagaagaa 



tgataaataa 
aattaaagaa 
tgagtctaca 
gtactaatgt 
acttttactt 
accaccacca 
agccaaagca 
ttaattataa 
ttaaatataa 



gaaaaacctc 
ccacagttca 
tctctttatt 
tggatttaga 
gagatttcca 
ccaccatgag 
atgggaacac 
caaaagcgaa 
atataaaata 



ctcattttaa 
actccatcaa 
tgcaatgtaa 
agtccggagt 
atctcatatc 
caaccaacat 
agcactctat 
atacatgttt 
aaaaccaa 



tgaagtagtt 
tgaagtaaca 
aaccccatcc 
tcaattttag 
accaccatca 
catcatcatc 
tagggggcaa 
ttgtaatttc 



60 
120 
180 
240 
300 
360 
420 
480 
528 



<210> 730 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 221,233 

<223> n^a, g, c or t 
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<400> 730 

gtcggcaggt 

gagacagggg 

aacgtccagg 

cgcttgctgc 

tgggksctgc 

aaggcaggaa 

agctgctgca 



ctgaggcggc 
cggccgagct 
atttcctgcg 
gccacccatc 
aggtattcaa 
cttgaagaga 
gctgagaar 



gctgtgtgtg 
gtatgaccag 
cgttctcttt 
ggaccgcatg 
aacctttgac 
aaatcagaag 



tgaagcgtac 
gcccttttgg 
ggcttcctct 
ggcttycccg 
cacatgrccc 
aaaggaagag 



ctagggcggg 
gcatcctgca 
accgcaagac 
nccbkrggcc 
cgtcagratg 
gaagaggcca 



aggcgacatg 
gcacgtgggc 
agacttctat 
ggngaagccy 
atgagaagag 
agactgtgtc 



60 
120 
180 
240 
300 
360 
379 



<210> 731 
<211> 103 
<212> DNA 
<213> Homo sapiens 



<400> 731 

aaagcaaccg cgtggtacga gtaacaactc caccggctgg cgtccgckgc gcttggcsac 60 
aagtcacttt tccagatcta ggagcaccat ggacacctcg tea 103 



<210> 732 
<211> 842 
<212> DNA 
<213> Homo sapiens 



<400> 732 

actgtttacc 

aactgtacat 

ttttaggaaa 

tgcctccatt 

agtttttatt 

atcagcacaa 

ctatggttca 

aatcttgacg 

aggctgggta 

attttcccac 

tgacacaagc 

agecaattet 

acttctccca 

tetgatgeca 

ta 



aaacaccttg 
tcacaactta 
attatagtga 
atataggt cc 
atttcttctt 
agectgatag 
agtcactaaa 
gtttctgccc 
ctttcatact 
atacattttt 
agctaataac 
ctgggtgtcc 
ctcttggcat 
aagggttaaa 



gtcataataa 
tgtttctttg 
aaatttgaca 
ttccaggaac 
ttccttttgt 
tactttacta 
gattcatttt 
gtcctaatgg 
ttgttttctt 
acattgtacc 
catttctggg 
agtgagtggt 
gtaagaaata 
gcttcttgga 



tgtcattagt 
agattaatag 
gttgattggc 
tcaaacactg 
tttcataata 
aaatgactgc 
tgttgagtcc 
cagagctctc 
gttttgcttt 
ttaggactca 
tttctgecta 
ggcttttttt 
agcatttaca 
tttcatttca 



ttctccattt 
atattggggg 
ataatttctt 
taagtgaaat 
taatgcagtt 
attctttgga 
ttatgagaaa 
tgacttgggt 
aaaactacga 
gtcatctcca 
accccctaat 
ctttccacat 
taattggaaa 
ttgatataca 



ttattttctg 
aaaaacgect 
gtttgaatgc 
atgggagtat 
tgttcaggaa 
ttccttcagt 
cagcagtatg 
gtatgetgee 
ctcagcatac 
cttaaattga 
tgtctgttaa 
tggcacattc 
aatctggatt 
gecactattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
842 



<210> 733 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 333,345 

<223> n=a, g, c or t 



<400> 733 

ageaaegggg 

atccagggat 

tggacgtgca 

cgccgggacg 

gtctcgctgc 

gtcgccctgt 



tttcctgaac 
tcttctaccc 
gcggcctcct 
tgetgeatat 
tctccaagca 
ccaaagagtc 



<210> 734 
<211> 200 
<212> DNA 
<213> Homo sapiens 



cttcgggcca 
atggaaaaac 
cctcctgagc 
gcccggtttc 
ccgagggacc 
cagggatgtg 



cccgaatcca 
gtgcggagga 
tgeagcatet 
ctggcgggtc 
aagaagaege 
ganggaggca 



ccttctcaga 
agcacatgtg 
gcttgttctg 
cgtgtgacat 
cgtccacggg 
ccgangggga 



eggaaataga 
ggcgctggtc 
gtgggccaag 
gtccaagtcc 
cagcgtgcca 
a 



60 
120 
180 
240 
300 
351 



WO 02/083898 



PCT/IB01/00914 



391 



<220> 

<221> misc_f eature 
<222> 43 

<223> n=a, g, c or t 



<400> 734 

tttgcttgac tgcttcctcc taaccctcta sccactagca ckntacttcc taaagctgtt 60 

gtgtcattaa ctctgttgga tcaactctct gggaaaagat tctgttaatg taagtgcact 120 

tactccctgg atgttgtcac tagtctagtg gcttttgcta aataaacctt tcttatttct 180 

agaaaaaaaa aaaaaaaaaa 200 



<210> 735 
<211> 160 
<212> DNA 
<213> Homo 



<220> 

<221> misc 
<222> 118 
<223> n=a, 



sapiens 

feature 
g, c or t 



<400> 735 

aaaaggactt gaattctgat cgggctgcag acgctctgca ccccgcccaa gcttctatgg 60 

tactttggtg tcctctctag cactggcaag tcgccttcct cttcgcccgg cgccagdntc 120 

ccctctctat gatcgccgag ttcccggcgc cgtctcgttk 160 

<210> 736 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<400> 736 

gcactggatg 

atccacgatg 

ctgaactcaa 

cagctgggac 

gggacagatg 

accatctgct 

acttcactg 



cacgcagcca 
tcctgcgagc 
agaagagaac 
gctgacacat 
gtggtccctg 
tagtggtcat 



catcagaaag 
tgatcaatca 
tggaataatc 
ggccccaact 
gtcactgaag 
gtggcagcag 



gcggttaaga 
gcttcacgct 
ctccctaatc 
ttttctcctg 
tacccacaga 
acagtatctc 



ggggcggggg 
ggaaataacc 
ccacttccca 
cctgatgtct 
acccatgcaa 
agaagtcctc 



aagcgctctg 
atataaaaac 
tggtgagccc 
ggacccacag 
gaggcacaaa 
gcagcatatg 



60 
120 
180 
240 
300 
3 60 
369 



<210> 737 
<211> 382 
<212> DNA 
<213> Homo sapiens 



<400> 737 

aaacacacag 

agactacaac 

ccatgtcatc 

tagcaccttg 

aggagcagaa 

gtattggaga 

ttaaaatcaa 



atcgtcccca 
cccaaggacc 
tttgcctact 
tgtgtatgtg 
atgaaagagg 
agttttaaaa 
aaaaaaaaaa 



ggctgctgtt 
gcctgctcta 
gacacccttg 
tgtgcgcgca 
caagaaataa 
aacccagaaa 
ar 



cgagaatgag 
tgcctgggac 
tccccacaag 
cgtgtgtgta 
gtgctatgtg 
aacgcttttt 



tattcctata 
aatggccacc 
cagaagcaca 
ggtgggaggc 
gcgagaaaaa 
ttttttaata 



cgacccagat 
aggtcactta 
gaggggtcac 
aggcaacacc 
aagttttaat 
aagaagaaat 



60 
120 
180 
240 
300 
360 
382 



<210> 738 
<211> 406 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 197 



WO 02/083898 



<223> n=a, g, c or t 
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<400> 738 

agttctagag 

tacttcatcc 

tcttacatca 

agcgggagca 

akccacagca 

ccggaagaca 

accagtcatg 



gctgggaagt 
catgatggaa 
gtgacaacct 
gtgcggngca 
gttcccacac 
gctactccag 
gcagctccca 



ccgggattga 
ggtggaaggg 
ggacctcgac 
gcatgaaaca 
ctcgggcatt 
cagtgcsats 
cacctcaggg 



392 



ggggccagca 
taaagagagg 
atggaccagc 
caagcgcctg 
gaggcagaca 
caccgcaagc 
gtggagartg 



tctaccaagg 
agaagaagca 
tggaaaaacg 
gtsccgkcca 
ccaagccccg 
tgaaaacctg 
gcggca 



gccttcttgt 
gtaccgggaa 
gtcgcgggcc 
tycaacsgcm 
ggacacgggg 
cagctcaatg 



60 
120 
180 
240 
300 
360 
406 



<210> 739 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 739 

cactttgaaa 

atttgctctg 

attgccgggg 

aacccacctt 

ccttttttgc 

agttcagtcc 

agttgtatag 

ccccttccta 

caatggtacc 



gaaaactgcc 
tgtgtttttg 
ttggggcaca 
agttccacat 
ccttggctct 
cacatttctg 
aaacagaacc 
tcaatcacct 
agggagggag 



attttgtatt 
tgtgtttgtt 
attgtggact 
ttggctgcat 
tgacactcta 
tactgttcct 
atggatggat 
gatagcaaca 
ca 



taatagcctc 
gatgtgcgtt 
ggggccatga 
cttgaattat 
aacccctgga 
ctttcacagc 
ggccaggatt 
gggacagctg 



caggttcttt 
tgtgcatatg 
ggcttccctg 
gccgactcca 
ccatctgaat 
tggaatatgt 
gccgtggtcc 
ccaataccct 



cagtaatgtt 
cgtgagtttc 
gtccccactg 
gacttctcct 
ggagcagcca 
cacatgatga 
ctagctagat 
gctctttact 



60 
120 
180 
240 
300 
360 
420 
480 
502 



<210> 740 
<211> 367 
<212> DNA 
<213> Homo 



sapiens 



<400> 740 

cagcaagccc 

atcgccatcc 

tacacctgct 

gtgagaagat 

gttccagagg 

ctgggacacg 

ttttttt 



aggcacttcg 
tgtctatcct 
tcaacagctg 
acaccacgca 
ggcccacgtg 
ccctgaagct 



ggctgtcccc 
cctggccgtc 
caggagcact 
cctcgcgttc 
tccctccacc 
tctggaccct 



ggcccccaga 
ctcctcacgg 
tccctatcag 
agcactcctg 
tcctccctgg 
gagagagatt 



gcccgggcac 
tcctcctggc 
gcccagagga 
ccgagggggc 
cccaggctgc 
gtttcttttc 



cgtggtcatc 
tgtgctcata 
ggcagggagt 
ttcctgaggg 
agagcctgag 
tttactattt 



60 
120 
180 
240 
300 
360 
367 



<210> 741 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 741 

aatgttcttg 

ccattcttgg 

gcctttggga 

ccttcttgcg 

caagggcggg 

caaggccagc 

taagaaaggc 

ggctttt 



tgaatcttag 
tggattttat 
ttttctaccc 
caasgcagtt 
gccacatcca 
cgccccctgt 
ttttctctcc 



<210> 742 
<211> 408 
<212> DNA 
<213> Homo sapiens 



acaaccccca 
gagatgtcaa 
acccagcaac 
gtaccatgaa 
ctgcagctga 
cttccctctg 
agtcattctg 



caacatcatt 
cactcctcat 
caccagaggt 
tgcggtcgtc 
gctcagagcc 
gagctgtttg 
cttgctgatc 



ccctgcttta 
taggattccc 
gcagaaaccc 
ctgctgcaga 
ttctgtgggg 
aggaatgcag 
aagaacactt 



tttgcttagt 
tccttctgaa 
ctgcagacat 
ccacgcccac 
tctgcagtgg 
ttcttttgca 
aacagcttca 



60 
120 
180 
240 
300 
360 
420 
427 



<400> 742 

tcatctatgt ttatcacatt ttaataccac agcacttata atgatgtcac tacatataga 



60 
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agctcaaagt 
tccagtgtag 
aaagtgtctg 
aatgtatgcc 
aagattaaga 
agtcaaagta 



taagggattt 
ctggccactt 
ctttttttat 
agcaaaatgt 
aaatgatact 
ctaaaaaatg 



gctgaagact 
atcccagggc 
ccttaaaatt 
gagctctgta 
gggaattttc 
tactagatca 



393 

gtaaagttaa 
tttttctact 
cacctgtcag 
ttttttggca 
tttttcctga 
ttaagactta 



<210> 743 
<211> 449 
<212> DNA 
<213> Homo sapiens 



<400> 743 

aacttgttat 

tcatcatgtc 

ggtggactgc 

cagatcttgc 

ggtgaaggcc 

tgggcttttc 

gcccaagacc 

tttctcttct 



ctattgcagg 
cctttaccag 
tttggacaaa 
tgtttcaacc 
tgcagtcagc 
tgtgccatgt 
accataatca 
ttctctgaaa 



aaagtgcctg 
gagggcccgt 
tggacctgcg 
aaaacctcat 
agaatatgtc 
gttctcccag 
tcattgatta 
aaaaaaaar 



tggaagaatt 
tcatcacaag 
ggaggcatta 
ttcttatgtt 
aactttgaat 
tgtgctct 



gagacaaaaa 
gaatgttttg 
aaaatttgga 
tgggtttaat 
caccctagta 



caatatgcca 
gtcataagcc 
gtaggagaga 
gctgaccaga 
aatgttttgg 
ggccagcaca 
ttcctctcca 



ttgtgacaag 
actatataaa 
gggacaacag 
gttgaggaac 
ttgaagaagc 
aagawgggct 
cgccagktct 



gcaccatcaa 
gggaatgatg 
taggagcagg 
agaagaagat 
tgcttgaatc 
tttggtgcag 
ctaaataaac 



120 
180 
240 
300 
360 
408 



60 
120 
180 
240 
300 
360 
420 
449 



<210> 744 
<211> 207 
<212> DNA 
<213> Homo sapiens 

<400> 744 

atttccggtt ccggtgtcag ttcgaggcgc cgccgccgcc gccgcagccg ccggagccgc 60 

aatgcctaaa ggaggaagaa agggaggcca caaaggccgg gcgaggcagt atacaagccc 120 

tgaggagatc gacgcgcast gcaggctgag aagcagaagg ccagggaaga agaggagcaa 180 

aaaaaaaaaa aaaaagctta tggatac 207 

<210> 745 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<400> 745 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 

caatgcctaa aggaggaaga aagggaggcc acaaaggccg ggcgaggcag tatacaagcc 120 

ctgaggagat cgacgcgcas tgcaggctga gaagcagaag gccagggaag aagaggagca 180 

aaaaaaaaaa aaaaaagctt atggatac 208 

<210> 746 
<211> 442 
<212> DNA 
<213> Homo sapiens 



<400> 746 

cgtcatttag 

ggtggtttga 

gcccctgtgc 

acaaccctgt 

agaatatata 

aattcattca 

gaagtcttaa 

aatccgttat 



gcttttggga 
gtagtctggc 
cccaaacatt 
ggatgtaaca 
tgaacatctt 
tttatacatt 
gaatggagcc 
gtacagtggg 



tttgttggta 
gttctctagg 
cagctcagga 
ggagtcagca 
ttgagtcttg 
cagccccttg 
tggggtccaa 
tc 



tggaaaaata 
acactgtggg 
tggctgctga 
gttttgatca 
ctggtattgt 
tttaaatatg 
atgagagccc 



catggattct 
gaggtaggac 
gctkggkttt 
ctggttcatg 
cattcattct 
tgcttcctgc 
tcagtggctg 



catggtactc 
gcagttgtct 
gggaaaactc 
gaaacatgtg 
ttcaatcact 
cggtcagcag 
ttcaggggsc 



60 
120 
180 
240 
300 
360 
420 
442 



<210> 747 
<211> 487 
<212> DNA 
<213> Homo sapiens 



WO 02/083898 
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<220> 

<221> misc_f eature 
<222> 221 

<223> n=a, g, c or t 



<400> 747 

tctgtggaaa 

atgctgtcct 

ctactcctgc 

atgagcaggt 

ggctgcccag 

gaccagcctg 

aggagtttga 

aatccagcgt 

tgcttka 



ggcgaggtct 
gtttccccca 
cttcttccag 
gccatctgta 
ccccaacacc 
stgsscctgc 
gaccagcctg 
ggaggcatgc 



ggaaagcagc 
ctcsscaaca 
ctggcattcc 
tccaccacat 
acgaggcacc 
tggaccttag 
agcaacatgg 
acctgtggtt 



ccctcccctg 
gggctggcst 
ctggcagcga 
actcctgctg 
ctcgggcatg 
gaggccaagg 
tgaaactccg 
tcagctactc 



ggagagagcs 
ccggctctgt 
tggcggggaa 
ntgtggtgtg 
ctgttgcttc 
caggtgaatt 
tctctacaaa 
aagaagtgag 



cagggcaggc 
cataccctat 
actggcagac 
gcgaagacaa 
tgggaccagt 
gcttgagtcc 
aaacaaaaaa 
gcaggaggat 



60 
120 
180 
240 
300 
360 
420 
480 
487 



<210> 748 
<211> 601 
<212> DNA 
<213> Homo sapiens 



<400> 748 

aaaaagcgcc 

gctggcgctg 

aagcggtgga 

actcccgttc 

ggactcggag 

caaagccgcc 

tctctctctg 

ttcagaaact 

aatggttcca 

gttgaacgaa 

a 



attacgcaga 
gaggcttcgg 
ctcttatcgt 
gggaacccaa 
ctgtggggtg 
ctcggggcct 
gccttggctc 
cgttttcatc 
aagaaaattc 
gccaggttcc 



gagaaagtta 
cgtcacgtgc 
gggagggctc 
cggcagacag 
tggggaggcg 
catggcggga 
tgtgactatc 
ttcttggttt 
tcacaataag 
taatgagaaa 



cgaggttcgt 
tggtctggat 
ttgatctgtg 
gtcctagtgc 
gaggcaccaa 
cgctcctggg 
aggtcctcgc 
catcttaata 
gctcggacgt 
gtgggctggc 



ggccgcggtt 
ttttctcgat 
atttatagat 
ccatcagata 
ctaagagcga 
aaaggcttta 
gctgt cgcgg 
ccaacgtcat 
ctccttaccc 
ttgttgagtg 



tccccaggca 
gcactgggga 
aggcacagct 
cccgcggccg 
cctagcatcg 
gcgcggtgtc 
catccaggcg 
gtctggttct 
aggttcaaaa 
gcagactata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
601 



<210> 749 
<211> 210 
<212> DNA 
<213> Homo sapiens 

<400> 749 

aaaaattaaa cctcaccgmc ggattcaatt ctctcgttct caggccgtct tcatcgccgg 60 

agsgtttwct tacgaaaaga atgggcttct ggacattgtt aaaaggtctg ctgctctttg 120 

caaatgcact ggcaatcctc aacgaagacc gtttccttgt tccaagagga tggacactcg 180 

gagaactcca ccaaaccggt agaagaaast 210 

<210> 750 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 750 

tgacataaaa 

aatagggcat 

aaaggtttaa 

gacggaggga 

atcctcttgt 

tgaagacagt 

taaattattt 



ggttaaagaa 
tttaaaggct 
ttaaccctgg 
gaagagccaa 
gtgctaggta 
gcagcagaac 
caatctttat 



caggcaacac 
ggaaccagtt 
agatctgcaa 
ggacaaggtc 
gtgggcgtgc 
attgaagtcc 



aatgagcact 
cagaggaaac 
atggagggag 
aggttgaat k 
atctggatct 
tgattgcagc 



taagttttta 
aagggtttgg 
gggtgaagga 
agtwagrggk 
gagaggcagt 
agtgactctg 



acatgtgggg 
gtagaggtag 
ggaatcttaa 
gtgtacacah 
aaagcagtga 
caaccttgga 



60 
120 
180 
240 
300 
360 
380 



<210> 751 
<211> 245 
<212> DNA 
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<400> 751 

actcttctga ggatccggca agatggcaga agtagagsag aagaagaagc ggaccttccg 60 

caagttcacc taccgcggcg tggacctcga ccagctggca rsatggtggg sgtctacaac 120 

ggcaagacct tcaaccaggt ggagatcaag cccgagatga tcggccasta cctgggcgag 180 

ttctscatca cctacaagys cgtaaagcat ggscggsccg gcatcggggc cacccactcc 240 

tcccg 245 

<210> 752 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<400> 752 

aagttagtac 

caacaccacc 

ctcctggtca 

ttctttcctt 

gatactgtac 

tcatgtgtag 

aaatgttgg 



actgaaattc 
accactgaag 
gtgtaattct 
ttgtggaaaa 
aaattatttg 
aacggggctg 



aaagtcatgc 
tatttttagt 
aatctcacgg 
tkgtcattct 
caacacagga 
gtccactcct 



tcataactgt 
tatataagat 
atatcctctg 
gaatcttcag 
tacagacttc 
cccaaatgca 



taatgaaagc 
tggaactacc 
ttgggggata 
gvcaaacaca 
aaaacaatga 
rgtccactat 



agattcaaag 
aagcatgtgg 
aggactgtgt 
tctggaaggt 
gaacaacatt 
ttctgcagaa 



60 
120 
180 
240 
300 
360 
369 



<210> 753 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 753 

accaggtgcc 

cccatctctg 

ctgcagctat 

ggtgcmrtry 

gcttgggagc 

tcaaagagac 

tcgcactcca 

cttaawsgaa 

tgccatggca 



cagtcttcca 
ctcaccatgt 
gcctttatgt 
ggagagcact 
aaatggctct 
agtgagaaga 
ctaaagaaaa 
ctygargtgg 



gttgcgaggg 
ggacgctgaa 
tctcttctct 
ttcggattcg 
ggcttyt ktt 
ataaggagca 
atcaaaatgc 
agcttwtgaa 



caagcaaacc 
atcgtccctg 
gagacagaaa 
gcagaaycta 
tgytgttgtg 
gagtcctcct 
ttctctttac 
atttgtgtcc 



cgtcatgagc 
gtcctgcttc 
actagcgaac 
ccagagcaca 
ccgtttgtga 
ggccttcgag 
aaagactgtg 
raagtgcrra 



aactcccttc 
tgtgcctcac 
cccaggggaa 
cccaaggctg 
tactgmagtg 
gcttcccatt 
tattcaatac 
atcttaaasg 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<210> 754 
<211> 336 
<212> DNA 
<213> Homo sapiens 



<400> 754 

ataaatgctt 

tgtttttctc 

gttgtgtacg 

tccagaacct 

accactctaa 

gktayaaacc 



tccaatcagt 
aactatgaat 
ggataaagac 
atcacaatga 
aatgcaactc 
gtciaaaatta 



gctcccattc 
ctgacttcac 
tgtgagcccc 
atgacacgaa 
taataaagaa 
tgtatagtct 



ttcattctat 
tcccattctt 
tggagttcag 
gaatatattt 
aattactgcc 
gataaa 



tcaccactta 
gaaaacctcc 
ggaccattac 
gataaaaatc 
tawgtcrgdd 



gttactagaa 
tgtagttcct 
ttaatcatat 
tgatgaatga 
s kg km teams 



60 
120 
180 
240 
300 
336 



<210> 755 
<211> 497 
<212> DNA 
<213> Homo sapiens 



<400> 755 

ataaatgctt tccaatcaat gctcccattc ttcattctat tcaccactta gttactagaa 60 

tgtttttctc aactatgaat ctgacttcac tcccwttctt gaaaacctcc tgtagttcct 120 

gttgtgtacg ggataaagac tgtgagcccc tggagttcag ggaccattwc ttaatcatat 180 

tccagaacct atcacaatga atgacacgaa gaatatattt gataaaaatc tgatgaatga 240 

accactctaa aatgcaactc taataaagaa aattactgcc tattcagttg gaactgtacc 300 
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aygttacaaa cgtaaaatta tgtatagtct gataaagaaa tactrtaata gccttggaaa 

cagctcacat acaatttttt ttataaatgg attcattacc atataactgg accagcttac 

tgaagagtag ccctacagca gtcacttcgc agtttgcaat tcatttgagg ttcaaagcta 
gctcaaggga aaacaac 

<210> 756 
<211> 400 
<212> DNA 
<213> Homo sapiens 



360 
420 
480 
497 



<400> 756 

atcattttcc 

tactgcagcc 

attcatactc 

atcacagcaa 

gctatcgtgg 

tgacacctgc 

aataaattcc 



ctgcaggaca 
acctcgcaac 
ctggtgactt 
agcccagaaa 
ttgttgcagt 
caatgacaga 
cccaaagctc 



cccggcctcg 
ccaccctggg 
actgctgcag 
acagaaaggg 
ctggccgacc 
tgcaagccca 
caaaaaaaaa 



gtcctgcttt 
cagctgctaa 
aataattggt 
atgatcatgg 
aactgatggc 
acaccctttt 
aaaaaaaagg 



agtattcatc 
aagggggaaa 
ttctgtttgc 
tgattttact 
agtaaagaga 
ggtacgcaaa 



ccagcttcat 
aattaaagca 
aaggaacatt 
gctgcttgtg 
ccaccagcag 
acctgctctc 



60 
120 
180 
240 
300 
360 
400 



<210> 757 
<211> 524 
<212> DNA 

<213> Homo sapiens 



<400> 757 

acttcctcgt 

gagttcgagg 

actgcgccaa 

ttgaagtgct 

agaaagcact 

tgaaaaggga 

aggcgctgga 

csgtactcta 

tctgcctgca 



tgcccccgcc 
actcggtgtt 
gctctgcgag 
gactctcaag 
gcaggagtat 
tgtccaggaa 
gattgctgca 
cctccagctt 
gaaactgatt 



gcgggccgag 
cgaggagagg 
ccgcagtggt 
aaattcaaag 
tccagtatct 
ggtcaggctc 
aacttggaaa 
gctatttgtt 
tctttgcatc 



atggattccg 
ccggagcggc 
tttatgaaga 
gagacctggc 
ctgaaaaatt 
ggtgtctggc 
ataaagcarc 
caagtttgca 
cttttaatcc 



ggtgctggtt 
ggtcaggacc 
aacagaaagc 
ctacagacga 
gtcatcaacc 
tcacctgggt 
carcacagac 
gaacttggag 
ttgg 



gttcggcggc 
gccgcgtcct 
agtgatgatg 
caagagtatc 
aattttgcca 
aggcatatgg 
catttaacca 
aaaacaattt 



60 
120 
180 
240 
300 
360 
420 
480 
524 



<210> 758 
<211> 543 
<212> DNA 
<213> Homo sapiens 



<400> 758 

aagaagggag 

gcggtttccg 

cgttggtgtc 

acaagaagac 

caaaacgacc 

tacatcgtca 

aastgccaaa 

aaattgggtg 

atattggttg 

tga 

<210> 759 
<211> 850 
<212> DNA 
<213> Homo 



gtggtcgccc 
agcgttggtg 
tggcacgcgc 
aagaaaatga 
actgaaaaca 
tagccggaaa 
agtgacatgc 
ttggatgcaa 
gactctgcat 



tccgtcgtgg 
tctggcggtt 
caccctctct 
tttgaggaca 
acgccggtgg 
cagtcagagc 
agggatgtcc 
agctacagca 
gataagcacc 



tctggcgtgt 
tccgaccgtt 
tgctttggtt 
gcttcaatcg 
caaaatatcc 
caccagccaa 
ttcctgagat 
cgtctttcct 
gagaagtccg 



attccgagcs 
ggtgtctggc 
gcgccatgcc 
cggtgtgaag 
aaagaaaggg 
tgatmttttc 
ccgtgctatc 
caccgacagc 
cgtgaagtgc 



ttggtgtctg 
ggtttccgac 
gatgtaccag 
aagaaagcag 
tcccaagcgg 
aatgctgcga 
tgcattgagg 
tatttaaaat 
gtgaaggctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
543 



sapiens 



<400> 759 

aagtttcatg 

tgggagcaga 

ctgcccgttg 

ctgcccctgc 

gcgggagggg 



ggttcctgga 
agccagtttc 
gggtccagct 
ggcccctggg 
cctcctcgga 



cgctcagcca 
ccgccatctg 
gaggaaggag 
gcgggtgctg 
gaagcggggc 



ggggtagtct 
ctctccgtgg 
cgcggat ccc 
ctgcggcgcc 
gcggtcccaa 



ctgaagggcc 
aggcagtgct 
aggctcgttc 
agctccaagg 
ctacgcagag 



gcctggcaat 
ctcgcggctg 
tttgcctggg 
gcgkatccag 
gctggcacgc 



60 
120 
180 
240 
300 
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cgaccctcca 
cgccaacccc 
ctccccatca 
cctgcccgtt 
gcagaggacc 
ccagcctcct 
gaggagacag 
ttgtatcact 
atctgcagca 
gtgaggtcac 



cacctcacca 
acctagatgc 
tcgtactagg 
tctgcggact 
cattcattgc 
atggccagga 
gtggcagtga 
gcaagcgctt 
ccaccaatat 



cgcccccatc 
aaagcaggat 
gaggaagaag 
tgttctgagg 
ttctgctttt 
tggcatgtac 
tcgctactgc 
caacaccttc 
ccaatgcaag 



397 

tccgtccgtg 
tcaaaagaac 
cgggtgagaa 
ccgaggcacc 
gctgctgacc 
cagcgattcc 
aacttgatga 
atccatgaag 
aacggcaaga 



tacacacact 
atctttgcgt 
acaaaacttc 
tctaagatac 
ctgctggggc 
tgcggcaaca 
tgcaaagacg 
atatctggaa 
tgaactgcca 



cacacaagga 
tttctaccgg 
tttccattgt 
tgatggctct 
tggggctggt 
cgtgcaccct 
gaagatgact 
cattcgtagt 
tgagggtgta 



<210> 760 
<211> 532 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 524 

<223> n=a, g, c or t 



<400> 760 

ataaaacaaa 

actctcatgt 

aagtcacaga 

tggacttagt 

agagtgccgg 

ccccaccacc 

tggcaatgag 

catcaacgtc 

gcggctgcca 



atagtatgca 
agcattttat 
tatgtaaact 
agttttaagc 
gaggtggggg 
cctgacatcg 
caggatgctc 
accactcatc 
gcactgacag 



acttacagca 
ccatattgag 
gagatgcaga 
agaaccatga 
gcggcgcatc 
agaacgctga 
aggacctgga 
ttcccctcta 
caacaagcag 



atgtctctcc 
agctctgtga 
gacatcaata 
aaacctctgt 
cgcggcctcg 
gctcaccccc 
caccatgcag 
ccactatgag 
aacctgcggc 



cacttataat 
gatagaaatt 
acttgctcaa 
gacaactccc 
agcttgctac 
atcttgccct 
cggctgaaca 
aaaggcctgt 
agtntttgaa 



aaaacaccta 
attatattac 
ggttacaagg 
tccagctcca 
ctcagcccat 
tcctgttcct 
tcggctacgt 
tcaactacaa 
ga 



<400> 762 

aggtatctag 

ggccctgggc 

ctctggacct 

caaccggggc 

ggcggatcat 

tgtactaaaa 

tgggaggctg 

atcgcgccac 

aaacaaaaaa 

ccccag 



gtatcctccc 
taaccctccc 
caggttccct 
cggcgcggtg 
cagaggtcag 
atacagaaat 
aggtaggaga 
tgcactccag 
ctgcccgcag 



atctcaggcc 
cccatcacac 
acctgcaaaa 
gctcacgcct 
gagttcgaga 
cagccgggca 
atcgcttgaa 
cctgggtgac 
agagctgccc 



agactggaga 
aactcaacag 
cagtttgcac 
gtcatcttaa 
ccagcctggc 
tggtggctcg 
tccgggaggt 
acagtgagac 
cgaccacaag 



ttgagcgcct 
caacgcccaa 
ctcctccctc 
caccttggga 
caacaaggtg 
cgcctgtagt 
ggaagtcgca 
tccacctcaa 
gcctgaatcc 



gttgtatacg 
gccacaactt 
cacaaaactg 
ggcggaggtg 
aaaccccatc 
cccagctact 
gtgagccgag 
aaacaaaaca 
agccttccag 



360 
420 
480 
540 
600 
660 
720 
780 
840 
850 



60 
120 
180 
240 
300 
360 
420 
480 
532 



<210> 761 
<211> 275 
<212> DNA 
<213> Homo sapiens 

<400> 761 

tgtttgtttt gcttctacta agactgtttt gtttcaaaaa ggaaacaagt tttgtgtttg 60 

ctgtctacgc tggagtcctg aactgtgggt agaaaacacg acctggcttt gtagaaagga 120 

cacagggctg ttttatgaac taagcggtga ggctcaggtg gcggctctcg cagacccctg 180 

atgctgttgt tcttkgaggg cttaaggcct gatgaacgta ggcacgtgat gcataatagt 240 

cttcaatggt acacttaact agtctcttct gtgta 275 

<210> 762 
<211> 546 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
546 



<210> 763 
<211> 545 
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<212> DNA 

<213> Homo sapiens 



<400> 763 

ctggtgtctc 

ccacttctga 

agaaatccaa 

gggcaaaatc 

atctgtttta 

ttttaaagtt 

ttttagtaca 

tgtctaaata 

gtctgctttc 

tcttg 



tcaaaagcct 
tgattctgac 
gggggagagt 
tgtacgggca 
aaatgtgagg 
acctgaagct 
agacatcaaa 
gtgaccatct 
ttttgtcttt 



gatcctgcca 
tctaattttg 
gatgacttcc 
aagaaaccta 
cgattatttt 
cttaacttcc 
gtgaagtaaa 
catgggcatt 
aaacctgatt 



aaaccaagaa 
agaaaattgt 
atatggactt 
taaagtacct 
aagtaattat 
tcccctctga 
gcccaagtgt 
gttttcttct 
tttaagttct 



tcgccgcaaa 
ttcgaaagca 
tgactcagct 
ggaagagtca 
cttaccaagc 
atttagtttg 
tctttagctt 
ctgctttgtc 
tctgaactgt 



aggaagccat 
gtcacaagca 
gtggctcctc 
gatgaagatg 
ccaagactgg 
gggaaggtgt 
tttataatac 
tgtgttttga 
agaatagcta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
545 



<210> 764 
<211> 500 
<212> DNA 
<213> Homo sapiens 



<400> 764 

gacctgggct 

cgtccgagga 

gcccctactt 

tggaaacctt 

ggatgccgca 

atccctacca 

tgggatgtgg 

caagggttga 

gagctgatga 



gcagttaggt 
gaggatcaag 
gggccccact 
ctcaagagcc 
gaaagaccac 
ttgggttcaa 
gtggccagga 
tatttgtggt 
gaatgctggc 



tgtcgcctgc 
tcttccatac 
gtccccagaa 
tgattgggaa 
catcctatac 
tgtggagaca 
caagattcga 
cgacagcaat 



aggaggtgtg 
cagggaccag 
agcagctgct 
gaaggagatg 
aagctgaaac 
gtggagtata 
cccctctgga 
gatcgggagc 



ctgttgaggg 
gccaaaacca 
gtgatcatgg 
cgcatcctga 
tgggggagat 
agaacatcag 
gacactactt 
gagtaaatga 



agaggaaaac 
gactgccact 
gcaatatctt 
tggtgggcct 
cgtcaccacc 
ctttacagtg 
ccagaacacc 
ggcccgggaa 



<210> 765 
<211> 527 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
500 



<400> 765 

gctttgacaa 

gtctttaaca 

aataacctca 

ctcaagcttc 

ttctctctat 

ttgtctgccg 

ctccttcttg 

attcaggttc 

ggttttgtct 



aaccagggaa 
gtgcattcaa 
aagaagagca 
tgaccaaagc 
cgaccatcga 
ccacaaaccc 
gaccggtttt 
ttgtggctct 
ccaatttgca 



agagacagag 
tggcaaataa 
cagcgctcca 
cgagaaagca 
gcgtcttgga 
ggaaacgcct 
tgcatatgtg 
actctctgtt 
gaaatctgat 



agagataatg 
gaagccatca 
caggtgaaac 
ggacttttgt 
ttgctgacca 
ggaacgttat 
gttcctgaag 
cttggtggct 
tagcccgatc 



gcggctaaac 
ccttctgctg 
tgctgacaag 
ctttagcaga 
aagcagagga 
tcactttgag 
attacacttg 
ctgctgcttt 
gtttttt 



taatctgttc 
caacaagaac 
agtcgagcag 
gaaatcaggt 
gttcggcgtt 
cctcggttta 
ggaagtagtg 
tgctgcttct 



60 
120 
180 
240 
300 
360 
420 
480 
527 



<210> 766 
<211> 921 
<212> DNA 
<213> Homo sapiens 



<400> 766 

agtagggagg 

tgcaggagcg 

tggggctcag 

tctggtttcc 

acgctgggcg 

aacgtgtggc 

cccgcggcct 

accgagtggc 

gtccatgagt 

tgtgagactg 



aaaaccggag 
ggggactgct 
aggggtgggc 
gccctggagc 
cgggcatcgt 
tctggatcac 
actacgcctc 
tcaacctcat 
ctggttacta 
gtccaggcag 



gagagcgcag 
gggggcgggg 
tttggagatc 
aagccggggc 
gatagccgag 
ctttctgggc 
ccgccgtgtg 
cttcaagtgg 
cagccaggct 
cccttctgga 



gaggaaacag 
cttggtggtg 
agagggtcga 
ctggtcggca 
gcgctacaga 
gatcccaaga 
ggcatcgcgg 
tttctttttg 
ccagcccagg 
cactgcatga 



taccggctgg 
accgctggcg 
cgctgcttcg 
gctgggccgc 
accagctagc 
tcctctttct 
tgctctggat 
gagacaggcc 
ttcaccagtt 
tcacaggagc 



aggccggtct 
gggcggggcc 
ttgcctggac 
catggagtcc 
ctggctggag 
gttctacttc 
cagcctcatc 
cttttggtgg 
cccctcttct 
agccctctgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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cccataatga 
gtgatgccta 
atcttagcac 
tggctgatga 
ctggccctca 
ctttcttggt 



cagccctgtc 
gcctggctta 
atttccctca 
ctccccgagt 
tgctaggcac 
ccatcagcct 



ttcgcaggtg 
ttgcaccttc 
ccaggtgctg 
gcctatggag 
cagcctcatc 
a 



gccactcggg cccgcagccg ctgggtaagg 
cttttggcgg ttggcttgtc gcgaatcttc 
gctggcctaa taactggcgc tgtcctgggc 
cgggagctaa gcttctatgg gttgactgca 
tattggaccc tctttacact gggcctggat 



<210> 767 
<211> 570 
<212> DNA 
<213> Homo sapiens 



<400> 767 

aagggaatct 

ggcggaggag 

aggcggggca 

cccaattctt 

cagtggcggc 

cgagtccccg 

cagcctggga 

catcagacac 

cctcctagaa 

cgaggggaca 



ccgggcgggg 
aatcgctctt 
ggcccgactt 
cattctgtcc 
cgggatgtgg 
gttgtcatga 
gacagagcga 
agcggcctct 
ctgctctcca 
tctcccgmaa 



tagtgcaggc 
aaagggccag 
tcgccgtctt 
tccaaggggg 
ccgagctctt 
ggcagaggtt 
gactccctct 
atttggtggt 
ggttggccac 
tgtggctctg 



gccgggtttc 
cgcacacgcg 
cttgtctact 
accgctcatc 
ctaccggaag 
gcagtgagcc 
cgggcatcac 
cacaacttca 
ccttctgggc 



ccgcggtccg 
ttcttttgtt 
ctccagaacg 
tacaaagact 
ctgacgggac 
gagatccgga 
catggccgtc 
gaaaacgttt 
gattactgtg 



agctggcgcg 
ccggggccgc 
gccatgattt 
tccgcgggga 
tgccaggaga 
caccgcactc 
atttcattca 
ctcccttcag 
gctccctggg 



660 
720 
780 
840 
900 
921 



60 
120 
180 
240 
300 
360 
420 
480 
540 
570 



<210> 768 
<211> 490 
<212> DNA 
<213> Homo sapiens 



<400> 768 

cactcgctgc 

gcctcggccg 

ctgaagtccg 

tcccgactca 

gtttcaactg 

gatgagttga 

gagaatcaag 

ggtgaaaatg 

agagctcatg 



ctcggcagcg 
cctcgggtca 
agctgaatcg 
agcaggctat 
atactccaaa 
gtggagatgc 
aggcacatga 
aagaagaaaa 



ctgctcttct 
ggcggaaggt 
gcggaactta 
tgaagaggaa 
caagaaacca 
ttctgtggaa 
gcaagatgga 
tgtgcattcc 



aagatggctg 
aaaaagatca 
gacatcaccg 
ggaggcgat c 
actaaaggca 
gatgatgctt 
aatgatgaac 
aaggagttac 



ccgctaccgg 
ccgatctgcg 
gagtcaagac 
cagataatat 
aaggtaaaaa 
ttatcaagga 
taaaggactc 
tctctgcaga 



tgcggtggca 
ggtcatcgat 
cgtgctcatc 
tgaattaact 
acatgaagca 
ctgtgaattg 
tgaagaattt 
agaaaacaag 



60 
120 
180 
240 
300 
360 
420 
480 
490 



<210> 769 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 769 

cactcgctgc 

agcctcggcc 

tctgaagtcc 

ctcccgactc 

tgtttcaact 

agatgagttg 

ggagaatcaa 

tggtgaaaat 

gagagctcat 



ctcggcagcg 
gcctcgggtc 
gagctgaatc 
aagcaggcta 
gatactccaa 
agtggagatg 
gaggcacatg 
gaagaagaaa 

g 



gctgctcttc 
aggcggaagg 
ggcggaactt 
ttgaagagga 
acaagaaacc 
cttctgtgga 
agcaagatgg 
atgtgcattc 



taagatggct 
taaaaagatc 
agacatcacc 
aggaggcgat 
aactaaaggc 
agatgatgct 
aaatgatgaa 
caaggagtta 



<210> 770 
<211> 403 
<212> DNA 
<213> Homo sapiens 



gccgctaccg 
accgatctgc 
ggagtcaaga 
ccagataata 
aaaggtaaaa 
tttatcaagg 
ctaaaggact 
ctctctgcag 



gtgcggtggc 
gggtcatcga 
ccgtgctcat 
ttgaattaac 
aacatgaagc 
actgtgaatt 
ctgaagaatt 
aagaaaacaa 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<400> 770 

cgcccacgcg gaactcgaac tcctgcaggg cctcgtagtc cagcgacctr agggcgaaca 



60 
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ggtggccgtt 
atgctgaaaa 
gccatggtca 
tttccagaaa 
wmaaaatcta 
ttagaaatct 



gtctgtgttg 
gagctgcaac 
gttccggagc 
gacctgcccc 
gttttctgat 
cttcatcacc 



atggagacca 
ggtctcagga 
gttatcgttg 
atcagaggct 
tattttaatt 
ataaaaaatg 



400 

gggaggcgag 
gaattttcgc 
atatctacca 
ttaatatcca 
tctccccttc 
agtagaataa 



gggcagaatc 
ggatggggct 
cctgtattat 
cctgatatga 
gttcattaag 
aaa 



gcggtcttga 
tgtgaatgaa 
gacagtgcat 
cacaatttww 
tgcaaatgtc 



120 
180 
240 
300 
360 
403 



<210> 771 
<211> 374 
<212> DNA 

<213> Homo sapiens 



<400> 771 

attcttttcc 

gccacctgag 

actctggact 

tgtggtctct 

aacaaaagat 

aggtgctgga 

ggtgakagct 



cccttctcac 
gacccagagc 
gctgaccttt 
gatctaatcc 
aatgagaaaa 
gcaggggtcg 
ccag 



cggccaggat 
tcaaagccac 
gaactattat 
ttagagaatg 
gtgaggttgg 
tgatctgaat 



gcagacataa 
atgttgagaa 
gttatttcca 
tgaccatgaa 
aagttggttt 
gaccagaggg 



tgatggcagg 
gggcagagat 
gggaaatgca 
gacacttttc 
actgagccag 
aaggactgat 



agctggagca 
aactgtatcc 
aaccaaagga 
ctacctggta 
gagctataac 
ggaattggat 



60 
120 
180 
240 
300 
360 
374 



<210> 772 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 772 

agaaacagtc 

agtgttatcc 

ggtttttctg 

caggaagtac 

agaggtgttg 

caggagtcgg 

aatcaccatt 



tctccccagc 
tgctgatccc 
aagctgctga 
tcaaagatca 
atgagggcaa 
gatccacatc 
gaaaaaaaar 



tcacttggct 
aatttcctat 
catcgggtca 
tggctcaaaa 
taaaagcaaa 
tgcagactac 



gaagatacta 
agttctggca 
gccctcaaca 
gtctgcagag 
ggtggcttta 
acaaaaacaa 



aagagaacta 
ccagaaaatc 
ccatcacctt 
cagagacaag 
tcacggtgag 
acaacacaga 



gatgttatct 
ctgaatgata 
tggccatgtt 
agattgacaa 
ctactt ctca 
ttaaaagcac 



60 
120 
180 
240 
300 
360 
380 



<210> 773 
<211> 472 
<212> DNA 

<213> Homo sapiens 



<400> 773 

agatggctca 

aactgggccc 

ctgtgctgtt 

tcagtagaag 

gttttgcgac 

agcaaacttg 

ggggcccatg 

ggagtagcaa 

<210> 774 
<211> 521 
<212> DNA 
<213> Homo 



ggtgcttcgg 
tagagactgg 
tgatgcagtt 
ctatatcaac 
agagcatcag 
tgaagacctt 
ttttccctca 
caggactctg 



sapiens 



ctgccacggc 
ggcaggcggt 
gatcactgat 
aatggttgtt 
ccacaagtat 
attatataac 
gcagtcggat 
gactgtcttc 



agaggttctc 
tctgactcag 
attccactct 
ttcctttcct 
ttgatgcgag 
tttatcagac 
gggggaggac 
acactaggtg 



tgctctcttt 
aagatctttt 
tagatattac 
gccaactctt 
gtccatgtct 
aagaaaagcc 
tgctttatgg 
gtgtgggcag 



tctctgctgg 
ggaagaattg 
atatgaaata 
ccacaaagaa 
tccatacacc 
acctcctcca 
acttgcatca 
ca 



60 
120 
180 
240 
300 
360 
420 
472 



<400> 774 

attccaggcg 

tttccctgct 

gtgctaggta 

agcctcctcg 

cgagtgacct 

tgagaatttt 

cagtgtaata 



cctgcaacta 
tcttatcgtc 
gcagcctccc 
acctcccttc 
tgagcccagg 
aatgcatgga 
gcaggaccaa 



aacgtggccg 
tgctttaacg 
ggccctcggg 
ctggtgacgg 
gcgacggtca 
aaagctgtcc 
aatattctgt 



ggtctgcaag 
ccttaaatag 
aaaggcgggg 
acgaacagtt 
gcttgttaat 
atgttccaac 
caatcagctg 



ctaggtgcca 
cccgctgaag 
tggggaggcg 
cccgtagaat 
tcctggctgc 
tgctgtacat 
accatatact 



gcggggaaag 
gctgcagcag 
agagcagctt 
ttcgcttcac 
aggaactttg 
ccaaaagtct 
taatgactcc 



60 
120 
180 
240 
300 
360 
420 



WO 02/083898 



PCT/IB01/00914 



401 



taaaatctcg tggacttcta agaaagcgcc atggcctgtg ctgctgttat gattcctggg 
ttgttgcggt gctctgttgg agccatccgt attgaggctg y 

<210> 775 
<211> 447 
<212> DNA 
<213> Homo sapiens 



<400> 775 

acttagctga 

cccaggttca 

gtaagagagg 

ctggtccaaa 

aaatcaatca 

gtgaccaagg 

ctggggacac 

agccaggaga 



tagctagcat 
tagaatggga 
ccgaagagga 
aggcgataag 
tgggtttctc 
acaggcaggg 
agggaaagat 
acaaggcttg 



tagctgctgg 
tgcccactga 
gaatctggtc 
ggagaacaag 
tctgctgatc 
cctccaggac 
ggcccccgtg 
atgggtc 



ccaggtgcat 
tgacaatgcc 
ctcctggaca 
gtgatcaggg 
agcagctcat 
cccctggccc 
gaatgccagg 



aagccatctt 
aggccctcca 
gcctggtcct 
acctaggatg 
taaacgccgc 
tccaggccca 
agtacccggt 



agatgtccag 
gggaaacgag 
cagggccctc 
stgtttccta 
ctgattaagg 
agagggccac 
gaaccaggaa 



480 
521 



60 
120 
180 
240 
300 
360 
420 
447 



<210> 776 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 776 

attctacagg 

tgggctccac 

atggatggag 

tcccggcgcc 

gaagaagata 

gatggatgcc 

ctgcccagca 

caggctccgt 

ggtcccawtt 



aagcaatttt 
tgaggaagag 
aacgcctgct 
aacgcggccg 
aagagcggag 
gcgtcccctt 
acctgccagc 
cgcggagtcg 

g 



ttggttgtga 
cccttcctcc 
ccaagccgga 
ccgccaaaat 
actaggccag 
cgcagcgacg 
tacaggagcc 
ctgagtccgt 



agggagaaat 
cagcagtgtt 
cgacgacatt 
ccaggcgagt 
aactgagcat 
agacttccct 
ccctgcgtcc 
gcccttttag 



gggctcctca 
gtataaagta 
ctagacatcc 
tttcggggcc 
tttcaaagtt 
gccgtgtttg 
cagagactcc 
ttagttctgc 



tggaggttcc 
gagaagataa 
cgctggacga 
acatggcgcg 
cccgaggaga 
tgaccccctc 
ctcacccagg 
agtctagtat 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<210> 777 
<211> 389 
<212> DNA 
<213> Homo sapiens 



<400> 777 

tgtgtaatgt 

ggcgttcagt 

aaaagcctgg 

aaatgtgata 

tgagccaagt 

catggagggc 

ggtgcagggc 

<210> 778 
<211> 440 
<212> DNA 
<213> Homo 

<400> 778 

tgtagtgttt 

caacttgcca 

tgtcacccag 

ctgggattac 

ggtttcacca 

cggcct ccca 

tttttgatct 

ttatagaaca 



tgaatctgga 
gctgcggasc 
tgatacagct 
cttagaatac 
gagtgagttt 
cattcctgcc 
tgctggtgtc 



sapiens 



gcagtagcgt 
cctcagcggc 
cctggtgtgc 
aggcgcccac 
tgttggccag 
gagttttgcg 
atcaaaacaa 
taaaaaaaaa 



aattgatcag 
tgttaaaggt 
ctggctttct 
tttggctgct 
gcagaaagca 
cggcaacagc 
agagcaaga 



tt ctcacctt 
agaagttgag 
agtggttcar 
caccaagccc 
gctggtctca 
attacaggcg 
cctcattgtt 



cattaaaggg 
cactcagatg 
gagcacacta 
aaggaaactt 
ggtggtgagc 
accgtcctgc 



cacatgaagc 
tgcaggtgtt 
cggatctgga 
cctctccatt 
tcctgcctgc 
agggagccac 



ccctgatgaa 
tgttgttcgt 
gccattctcc 
agctaatttt 
aactcctgac 
tgacaccacg 
ctcaatgact 



gggtcttttc 
gt caccacag 
tgcctcagcc 
tgtatgttcg 
ctcaagcgac 
cccagcggaa 
gaccttataa 



agtgtctgca 
aatcttctct 
aaatactgga 
gcagaatagc 
tggaggttgc 
ttggcagaag 



agttgcaaac 
agtctcgttc 
tcccaagtag 
gtagagacgg 
ccacccgcct 
tttcttattc 
taagtaaatg 



60 
120 
180 
240 
300 
360 
389 



60 
120 
180 
240 
300 
360 
420 
440 
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<210> 779 
<211> 418 
<212> DNA 
<213> Homo sapiens 

<400> 779 

atagattaca acggtgatgg cgggcaagcg gtccggctgg agccgggcgg ctctcctcca 60 

gctccttctc ggcgtgaacc tggtggtgat gccgcccacc cgggcccgga gtctgcgctt 120 

cgttaccttg gaggggatgc tacagcactg ggaactgggc caggccctgc ggcasgctat 180 

cacggcttcc taaacacctc ttatcaccgg caagaggttt atgtgcgasr gacttgaccg 240 

gactctcatg agtgctgagg ccaacctggc tggactcttc cctcccaacg gatgcagcgc 300 

ttcaacccga acatctcgtg gcagcctatt cctgtgcaca ctgtsccatc actgaggaca 360 

ggctgctgaa gttcccgttg ggcccatgtc cccgttatga gcagctgcag aacgagac 418 

<210> 780 
<211> 362 
<212> DNA 
<213> Homo sapiens 

<400> 780 

aaacttgccc ttctcaaaca tggccgccac ggcgcctctg gaagggaacc gctctgggcc 60 

ccgcctttga tctcgttggt ggggctgggg gatgagagct gcaccgcgcg ggacaagtcg 120 

ccggcggccc gacggagcag aagagagagc atggagctgg agaggatcgt cagtgcagcc 180 

ctccttgcct ttgtccagac acacctcccg gagccgacct cagtggcttg gatgaggtca 240 

tcttctccta tgtgcttggg gtcctggagg actgggcccc tcggcattca gaggagaact 300 

tcgatatgga ggctttcact gagatgatgg aggcctatgt gcctggcttc gccacatccc 360 

ca 3 62 

<210> 781 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<400> 781 

agattatttc tacttggttg ccttccagat gtttcacact tggacagcaa actgatttca 60 

aaccactcyt tttcaaagat ctctgaggga gacattgcac ctggccactg cagcccagag 120 

caggtctggc cacggccatg agcatgctga gccatcatgc ccaccgtgga tgacattctg 180 

gagcaggttg gggagtctgg ctggttccag aagcaagcct tcctcatctt atgcctgctg 240 

tcggctgcct ttgcgcccat ctgtgtgggc atcgtcttcc tgggtttcac acctgaccac 300 

cactgccaga gtcctggggt ggctgagctg agccagcgct gtggctggag ccctgcgagg 360 

agctgaacta tacagtgcc 379 

<210> 782 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<400> 782 

ttctcatctt ttgtaatgtg tggtcctcac ttggcctctg ggacacctcc aagtcttagt 60 

tgtcttccta actcactagt tgctccttct cactctcctt tgctggttct tccttttctc 120 

cttgacctct taatgttaga gtcccagagt gtagtcatag tccttctctg tgatctattt 180 

ttaccctgtc tttgtgatct agtataatgg ccaccaccta taaacttata gctcacgaat 240 

ttatatcact agctctgaga tctctcacga actccagatg tgcacatctg actgcttact 300 

caacatctcg aatgcctgtt cttcaagagt ttgagtcatg aagtgtgaag cctagagatg 360 

gtgtgataca cagagatgaa actggttttc acaggtgata gatggaaggc tgttttcaaa 420 

agatcaagct tgaccacata ttatcccctc cac 453 

<210> 783 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<400> 783 
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aactacttcc gggggagcgg cgcggcggcg sggagttggt tctaaagagt ggtgagtcag 60 

aagagacgtc aggcagcaag cgacttgggc catggctctg acctagactt ctcacctccg 120 

cagtgcccga gcccactttc ctggagaacc tgctacggta cggactcttc ctgggagcca 180 

tcttccagct catctgtgtg ctggccatca tcgtacccat tcccaagtcc cacgaggcgg 240 

aggctgaacc gtctgagccc agaagtgctg aggtgacgag gaagcccaag gctgctgttc 300 

cttctgtgaa caagaggccc aagaaagaga ctaagaagaa gcggtagaag aggaggcctg 360 

aggagctggg cgggcaggga gagggtcttg gggacagccc tcctgggaat ctacattgtg 420 

ttcccccgca ttccaggctc agggtctgag gaggctgtga cgccctatga ccgcagagat 480 

ctagacagtc gtaacagtcc ccaggctcba gctgggcaat ccaccacttc ctcttccttc 540 

tgcttctgtg acg 553 



<210> 784 
<211> 472 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 445 

<223> n=a, g, c or t 



<400> 784 

ttccctttcg aattccaggg tatatctggg aggccggaga cgtgtctggt tattacacag 60 

atgcacagct ggacgtggga tccacacagc tcagaacagt tggatcttgc tcagtctctg 120 

tcagaggaag atcccttgga caagaggacs ctgccttggt gtgagagtga gggaagarga 180 

agctggaacg agggttaagg aaaaccttcc agtctggaca gtgactggag agctccaagg 240 

aaagcccctc ggtaacccag ccgctggcac catgaaccca gagagcagta tctttattga 300 

ggattacctt aagtatttcc aggaccaagt gagcagagag aatctgctac aactgctgac 360 

tgatgatgaa gcctggaatg gattcgtggc tgctgctgaa ctgcccaggg atgaggcaga 420 

tgagctccgt aaagctctga acaangcttg caagtcacat ggtcatgaag ga 472 



<210> 785 
<211> 157 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -29. .-1 



<220> 

<221> UNSURE 
<222> 120 

<223> Xaa - His, Gin 



<400> 785 



Met 


Ala 


Thr 


Met 


Val 
-25 


Pro 


Ser 


Val 


Leu 


Trp 
-20 


Pro 


Arg 


Ala 


Cys 


Trp 
-15 


Thr 


Leu 


Leu 


Val 


Cys 
-10 


Cys 


Leu 


Leu 


Thr 


Pro 
-5 


Gly 


Val 


Gin 


Gly 


Gin 
1 


Glu 


Phe 


Leu 


Leu 
5 


Arg 


Val 


Glu 


Pro 


Gin 
10 


Asn 


Pro 


Val 


Leu 


Ser 
15 


Ala 


Gly 


Gly 


Ser 


Leu 


Phe 


Val 


Asn 


Cys 


Ser 


Thr 


Asp 


Cys 


Pro 


Ser 


Ser 


Glu 


Lys 


He 


Ala 


20 










25 










30 










35 


Leu 


Glu 


Thr 


Ser 


Leu 
40 


Ser 


Lys 


Glu 


Leu 


Val 
45 


Ala 


Ser 


Gly 


Met 


Gly 
50 


Trp 


Ala 


Ala 


Phe 


Asn 

55 


Leu 


Ser 


Asn 


Val 


Thr 
60 


Gly 


Asn 


Ser 


Arg 


He 
65 


Leu 


Cys 


Ser 


Val 


Tyr 
70 


Cys 


Asn 


Gly 


Ser 


Gin 
75 


He 


Thr 


Gly 


Ser 


Ser 
80 


Asn 


He 


Thr 


Val 


Tyr 
85 


Arg 


Leu 


Pro 


Glu 


Arg 
90 


Val 


Glu 


Leu 


Ala 


Pro 
95 


Leu 


Pro 


Pro 


Trp 


Gin 


Pro 


Val 


Gly 


Gin 


Asn 


Phe 


Thr 


Pro 


Ala 


Leu 


Pro 


Ser 


Gly 


Gly 


Trp 
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100 105 110 115 

Val Ala Pro Asp Xaa Pro His Gly Gly Ala Ala Ser Leu 
120 125 



<210> 786 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 786 
Met Val Met Leu 
-15 

Ala Ala Glu Gly 
1 

Val Gin Glu Asn 
20 

He Glu Lys He 
35 

Asn Tyr Ser Leu 
50 

Leu Arg Ala Asp 
65 

Val Asn Leu Thr 

Met Pro Ser Ala 
100 

Ala Leu Val Tyr 
115 

Phe Ala Trp He 
130 



Leu Leu Leu Leu 
-10 

Gin Ala Phe His 
5 

Phe Asp Val Asn 

Pro Thr Thr Phe 
40 

Met Glu Asn Gly 
55 

Gly Thr Val Asn 
70 

Glu Pro Ala Lys 

85 

Pro Tyr Trp He 

Ser Cys Thr Cys 
120 

Leu Ala Arg 
135 



Ser 


Ala 


Leu 


Ala 








-5 


Leu 


Gly 


Lys 


Cys 




10 






Lys 


Tyr 


Leu 


Gly 


25 








Glu 


Asn 


Gly 


Arg 


Lys 


He 


Lys 


Val 








60 


Gin 


He 


Glu 


Gly 






75 




Leu 


Glu 


Val 


Lys 




90 






Leu 


Ala 


Thr 


Asp 


105 








He 


He 


Gin 


Leu 



Gly Leu Phe Gly 

Pro Asn Pro Pro 
15 

Arg Trp Tyr Glu 
30 

Cys He Gin Ala 
45 

Leu Asn Gin Glu 

Glu Ala Thr Pro 
80 

Phe Ser Trp Phe 
95 

Tyr Glu Asn Tyr 
110 

Phe His Val Asp 
125 



<210> 787 
<211> 143 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -17 . .-1 



<400> 787 



Met 


Ala 


Leu 
-15 


Glu 


Val 


Leu 


Met 


Leu 
-10 


Leu 


Ala 


Val 


Leu 


He 
-5 


Trp 


Thr 


Gly 


Ala 


Glu 
1 


Asn 


Leu 


His 


Val 
5 


Lys 


He 


Ser 


Cys 


Ser 
10 


Leu 


Asp 


Trp 


Leu 


Met 
15 


Val 


Ser 


Val 


He 


Pro 
20 


Val 


Ala 


Glu 


Ser 


Arg 

25 


Asn 


Leu 


Tyr 


He 


Phe 
30 


Ala 


Asp 


Glu 


Leu 


His 
35 


Leu 


Gly 


Met 


Gly 


Cys 
40 


Pro 


Ala 


Asn 


Arg 


He 
45 


His 


Thr 


Tyr 


Val 


Tyr 
50 


Glu 


Phe 


He 


Tyr 


Leu 
55 


Val 


Arg 


Asp 


Cys 


Gly 
60 


He 


Arg 


Thr 


Arg 


Val 


Val 


Ser 


Glu 


Glu 


Thr 


Leu 


Leu 


Phe 


Gin 


Thr 


Glu 


Leu 


Tyr 


Phe 


65 










70 










75 










Thr 


Pro 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin 


Glu 


He 


His 


Leu 


Glu 


Cys 


80 










85 










90 










95 


Ser 


Thr 


Ser 


Arg 


Lys 
100 


Ser 


Val 


Trp 


Leu 


Thr 
105 


Pro 


Val 


Ser 


Thr 


Glu 
110 


Asn 


Glu 


He 


Lys 


Leu 
115 


Asp 


Pro 


Ser 


Pro 


Phe 

120 


He 


Ala 


Asp 


Phe 


Gin 
125 


Thr 
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<210> 788 
<211> 148 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -17 . .-1 



<400> 788 



Met 


Ala 


Leu 


Glu 






-15 




Ala 


Glu 


Asn 


Leu 




1 






Val 


Ser 


Val 


He 


Asp 


Glu 


Leu 


His 








35 


Tyr 


Val 


Tyr 


Glu 






50 




Arg 


Val 


Val 


Ser 




65 






Thr 


Pro 


Arg 


Asn 


80 








Ser 


Thr 


Ser 


Arg 


Glu 


He 


Lys 


Leu 








115 


Ala 


Glu 


Glu 


Leu 






130 





Val Leu Met Leu 
-10 

His Val Lys He 

5 

Pro Val Ala Glu 

20 

Leu Gly Met Gly 

Phe He Tyr Leu 
55 

Glu Glu Thr Leu 
70 

He Asp His Asp 
85 

Lys Ser Val Trp 
100 

Asp Pro Ser Pro 



Leu Ala Val Leu 

Ser Cys Ser Leu 
10 

Ser Arg Asn Leu 
25 

Cys Pro Ala Asn 
40 

Val Arg Asp Cys 

Leu Phe Gin Thr 
75 

Pro Gin Glu He 
90 

Leu Thr Pro Val 
105 

Phe He Ala Asp 
120 



He Trp Thr Gly 
-5 

Asp Trp Leu Met 
15 

Tyr He Phe Ala 
30 

Arg He His Thr 
45 

Gly He Arg Thr 
60 

Glu Leu Tyr Phe 

His Leu Glu Cys 
95 

Ser Thr Glu Asn 
110 

Phe Gin Thr Thr 
125 



<210> 789 
<211> 75 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24 . .-1 



<220> 

<221> UNSURE 
<222> 51 

<223> Xaa = Glu, Lys 
<400> 789 

Met Leu Pro Pro Leu Pro Ser Arg 
-20 

Leu Cys Pro Ala His Val Gly Gly 
-5 

Leu Val Met Asp Pro Thr Ser He 

10 15 
Gly Arg Gin Ala Glu Leu Cys Gin 
25 30 
Leu Ala Arg Gly Ala Arg Leu Gly 
45 



Leu Gly Leu Leu Leu Leu Leu Leu 

-15 -10 
Leu Trp Trp Ala Val Gly Ser Pro 
1 5 
Cys Arg Lys Ala Arg Arg Leu Ala 
20 

Ala Glu Pro Glu Val Val Ala Glu 

35 40 
Val Arg Xaa 
50 



<210> 790 
<211> 64 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -22. .-1 

<400> 790 

Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 

-20 -15 -10 

Leu Gin Gly Ala Val Ser Leu Lys He Ala Ala Phe Asn He Gin Thr 

-5 " 1 5 10 

Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr lie Val 

15 20 25 

Gin He Leu Ser Arg Tyr Asp He Ala Leu Val Gin Glu Val Arg Asp 
30 35 4 0 

<210> 791 
<211> 117 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 

<220> 

<221> UNSURE 
<222> 82 

<223> Xaa = Glu, Gly 
<400> 791 



Met 


Ser 


Ala 


Gin 
-30 


Glu 


Ser 


Cys 


Leu 


Ser 
-25 


Leu 


He 


Lys 


Tyr 


Phe 
-20 


Leu 


Phe 


Val 


Phe 


Asn 
-15 


Leu 


Phe 


Phe 


Phe 


Val 
-10 


Leu 


Gly 


Ser 


Leu 


He 
-5 


Phe 


Cys 


Phe 


Gly 


He 
1 


Trp 


He 


Leu 


He 
5 


Asp 


Lys 


Thr 


Ser 


Phe 
10 


Val 


Ser 


Phe 


Val 


Gly 
15 


Leu 


Ala 


Phe 


Val 


Pro 
20 


Leu 


Gin 


He 


Trp 


Ser 
25 


Lys 


Val 


Leu 


Ala 


He 
30 


Ser 


Gly 


He 


Phe 


Thr 
35 


Met 


Gly 


He 


Ala 


Leu 
40 


Leu 


Gly 


Cys 


Val 


Gly 
45 


Ala 


Ser 


Arg 


Ser 


Ser 
50 


Ala 


Ala 


Ser 


Trp 


Ala 
55 


Cys 


He 


Leu 


Gly 


Cys 
60 


Cys 


Cys 


Ser 


Cys 


Leu 

65 


Pro 


His 


Arg 


Ser 


Pro 
70 


Trp 


Glu 


Ser 


Ser 


Ser 
75 


Pro 


Leu 


Ser 


Gly 


Pro 


Ala 


Xaa 


Ala 


Lys 

























80 

<210> 792 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 

<220> 

<221> UNSURE 
<222> 35 

<223> Xaa = His, Gin 



<400> 792 

Met Ala Lys Met Phe Asp Leu Arg Thr Lys He Met He Gly He Gly 
-30 -25 -20 
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Ser Ser Leu Leu Val Ala Ala Met Val 

-15 -10 
Leu Tyr Phe Lys Val Ala Lys Ala Leu 
1 5 

Ala Val Ala Val Lys Asn His Asn Pro 

20 25 
Asn Ser Xaa Ala Lys Ala Thr Thr Met 
35 40 

Cys Cys 
50 
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Leu Leu Ser Val Val Phe Cys 
-5 

Lys Ala Ala Lys Asp Pro Asp 
10 15 
Asp Lys Val Cys Trp Ala Thr 
30 

Glu Ser Cys Pro Ser Leu Gin 
45 



<210> 793 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 793 

Met Lys lie Leu 

Leu Phe Ala Glu 
1 

Asp Trp Tyr Asp 
15 

Glu His Leu Leu 
30 



Val Ala Leu Ala 
-15 

Glu lie Gly Ala 
5 

Ser Asp Gin lie 
20 

Gin Arg lie Ala 
35 



Val Phe Phe Leu 
-10 

Asn Asp Asp Leu 

Lys Glu Glu Leu 
25 

Arg Arg Pro Lys 
40 



Val Ser Thr Gin 
-5 

Asn Tyr Trp Ser 
10 

Pro Glu Pro Phe 



<210> 794 
<211> 79 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28 . .-1 



<400> 794 
Met lie Tyr Thr 
-25 

Leu Leu Leu Asn 
-10 

Asp Glu Glu His 
5 

Leu Met His Ser 

Ala lie Met Lys 
40 



Met Lys Lys Val 

Leu Ala Pro Ala 
-5 

Thr lie lie Thr 
10 

Phe Cys Ala Phe 
25 

Arg Phe Ser Ser 



His Ala Leu Trp 
-20 

Pro Leu Asn Ala 

Asp Thr Glu Leu 
15 

Lys Ala Asp Asp 
30 

lie Phe Ser Leu 
45 



Ala Ser Val Cys 
-15 

Asp Ser Glu Glu 
1 

Pro Pro Leu Lys 
20 

Gly Pro Cys Lys 
35 

Asp Ser Ala 
50 



<210> 795 
<211> 76 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24 . . -1 



<400> 795 

Met Gly Ser Ala Cys lie Lys Val Thr Lys Tyr Phe Leu Phe Leu Phe 
-20 -15 -10 
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Asn 


Leu 


He 


Phe 
-5 


Phe 


He 


Leu 


Gly 


Ala 
1 


Val 


He 


Leu 


Gly 
5 


Phe 


Gly Val 


Trp 


He 


Leu 


Ala 


Asp 


Lys 


Ser 


Ser 


Phe 


He 


Ser 


Val 


Leu 


Gin 


Thr Ser 




10 










15 










20 








Ser 


Ser 


Ser 


Leu 


Arg 


Met 


Gly 


Ala 


Tyr 


Val 


Phe 


He 


Gly 


Val 


Gly Ala 


25 










30 










35 








40 


Val 


Thr 


Met 


Leu 


Met 


Gly 


Phe 


Leu 


Gly 


Cys 


He 


Gly 









45 50 



<210> 796 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -18. .-1 



<220> 

<221> UNSURE 
<222> 117 

<223> Xaa = Ser, Thr 
<400> 796 

Met Lys Ala Leu He Val Leu Gly Leu Val Leu Leu Ser Val Thr Val 

-15 -10 -5 

Gin Gly Lys Val Phe Glu Arg Cys Glu Leu Ala Arg Thr Leu Lys Arg 

1 5 10 

Leu Gly Met Asp Gly Tyr Arg Gly He Ser Leu Ala Asn Trp Met Cys 
15 20 ~ 25 30 

Leu Ala Lys Trp Glu Ser Gly Tyr Asn Thr Arg Ala Thr Asn Tyr Asn 

35 40 45 

Ala Gly Asp Arg Ser Thr Asp Tyr Gly He Phe Gin lie Asn Ser Arg 

50 55 60 

Tyr Trp Cys Asn Asp Gly Lys Thr Pro Gly Ala Val Asn Ala Cys His 

"65 " 70 75 

Leu Ser Cys Ser Ala Leu Leu Gin Asp Asn lie Ala Asp Ala Val Ala 

80 " 85 90 

Cys Ala Lys Arg Val Val Arg Asp Pro Pro Thr Ser Ala Ser Gin Ser 
95 100 105 110 

Ala Gly He Thr Gly Val Xaa Asn Cys Ala Arg Pro His Ser Val Leu 

115 120 125 

lie Lys Glu He Thr Gin Thr 
130 



<210> 797 

<211> 82 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -29. .-1 



<400> 797 

Met Asp Leu Gin Gly 
-25 

Leu Leu Met Leu Phe 
-10 

Val Glu Asn Leu Ser 
5 

Val Val Arg Glu Asn 

20 



Arg Gly Val Pro Ser 
-20 

His Thr Met Ala Gin 
-5 

Gly Leu Ser Thr Asn 
10 

Gly Thr Thr Cys Leu 
25 



He Asp Arg Leu Arg Val 
-15 

lie Met Ala Glu Gin Glu 
1 

Pro Glu Lys Asp He Phe 
15 

Met Ala Glu Phe Ala Ala 
30 35 
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Lys Phe lie Val Pro Tyr Asp Val Trp Ala Ser Asn Tyr Val Asp Leu 
40 * 45 50 

He Thr 



<210> 798 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 798 

Met Glu Glu Gly Gly Asn Leu Gly Gly Leu He Lys Met Val His Leu 

-20 ^ -15 -10 

Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val Thr Phe Val 

-5 15 

Ser Gly Phe Leu Leu Phe Arg Ser Leu Pro Arg His Thr Phe Gly Leu 

10 15 20 

Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His He 

25 30 35 



<210> 799 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



<220> 

<221> UNSURE 
<222> 47 

<223> Xaa = Gly, Arg 



<400> 799 
Met Lys Thr Glu 
-25 

Ala Ser Leu Phe 
-10 

Arg Arg Gin Cys 
10 

Pro Phe Thr Val 
25 

Pro Arg Ala Leu 
40 

Val Thr Leu Gly 

55 

Arg Glu Val Leu 

Leu Leu Lys Ser 
90 

Ser Leu He Glu 
105 



Gin Leu Asn Arg 
-20 

Thr Glu Asn Val 
-5 

Gin Val Asn Tyr 

He Asn He Met 
30 

Trp Lys Gly Met 
45 

Ala Glu Gly He 

60 

His Lys Trp Ser 
75 

Leu Thr Tyr Val 

Thr Val Gin Ser 
110 



Phe Ala Gly Phe 
-15 

Leu Ala His Pro 
1 

His Ala Gin His 
15 

Tyr Ser Phe Asn 

Xaa Ser Thr Phe 
50 

He Ser Glu Phe 
65 

Pro Lys Gin He 
80 

Val Ala Met Leu 
95 

Glu He He Arg 



Gly He Gly Leu 

Cys He Val Leu 
5 

Tyr His Leu Thr 
20 

Lys Thr Gin Gly 
35 

He Val Gin Gly 

Thr Pro Leu Pro 
70 

Gly Glu His Leu 
85 

Phe Tyr Ser Ala 

100 
Asp Asn 
115 



<210> 800 
<211> 126 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -44 . . -1 



<400> 800 

Met His Glu Glu 

Pro Asp Thr Tyr 
-25 

Cys Cys Leu Val 
-10 

Thr Ala Ser lie 

5 

Ala Met Gin Gin 

Asn Phe Thr Glu 
40 

Gin Ala Phe Met 
55 

Cys Pro Asn Asn 
70 



Glu lie Tyr Thr 
-40 

Gin Lys Cys Leu 

Met Val lie Ser 
-5 

Phe Leu Gly Val 
10 

Gin Glu Lys Leu 
2 5 

Trp Lys Arg Ser 

Gin Asn Ser Leu 
60 

Trp lie Gin Asn 
75 



Ser Leu Gin Trp 
-35 

Ser Ser Asn Lys 
-20 

Cys Val Phe Cys 

Lys Leu Leu Gin 
15 

He Gin Gin Glu 
30 

Cys Ala Leu Gin 
45 

Ser Ser Ala His 

Arg Glu Ser Cys 
80 



Asp Ser Pro Ala 
-30 

Cys Ser Gly Ala 
-15 

Met Gly Leu Leu 

1 

Val Ser Thr He 
20 

Arg Ala Leu Leu 

35 

Met Lys Tyr Cys 

50 

Asn Ser Ser Pro 

65 

Tyr Tyr 



<210> 801 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -20. .-1 



<400> 801 

Met Gly Arg Ala Arg Glu Val Gly 
-20 -15 
Gly Ala Gly Ala Cys Tyr Cys Val 
1 

Asp Ser Glu Lys Leu Glu Glu Glu 

15 20 
Gin Glu Leu Asp 
30 



Trp Met Ala Ala Gly Leu Met lie 

-10 -5 
Tyr Lys Leu Thr lie Gly Arg Asp 
5 10 
Gly Glu Glu Glu Trp Asp Asp Asp 
25 



<210> 802 
<211> 154 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15. .-1 



<400> 802 




Met 


Asn 


He 


Leu 


-15 








Val 


Thr 


Lys 


Gly 








5 


Gly 


His 


Cys 


Arg 






20 




Cys 


Arg 


Asn 


Lys 




35 






Thr 


Arg 


Arg 


Pro 


50 








Asp 


Tyr 


Ser 


Asp 


Asn 


Asp 


Leu 


He 



Met Leu Thr Phe 
-10 

Ser Phe Glu Pro 

Arg Arg Cys Leu 
25 

Leu Ser Cys Cys 
40 

Ala Phe Pro Val 
55 

Val Asp Ser Phe 
70 

Thr Phe Asp Thr 



He He Cys Gly 
-5 

Gin Lys Cys Trp 
10 

Asp Thr Glu Arg 

He Ser He He 
45 

He His Leu Glu 
60 

Thr Gly Ser Pro 
75 

Thr Lys Phe Gly 



Leu Leu Thr Arg 
1 

Lys Asn Asn Val 
15 

Tyr He Leu Leu 
30 

Ser His Glu Tyr 

Asp He Thr Leu 
65 

Val Ser Met Leu 
80 

Glu Thr Met Thr 
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85 90 
Pro Glu Thr Asn Thr Pro Glu Thr Thr 

100 105 
Thr Pro Glu Thr Thr Met Pro Pro Ser 

115 120 
Met Pro Pro Pro Ser Gin Thr Ala Leu 
130 135 
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95 

Met Pro Pro Ser Glu Ala Thr 
110 

Glu Thr Ala Thr Ser Glu Thr 
125 

Thr 



<210> 803 
<211> 165 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 



<400> 803 

Met Met Thr lie Thr Phe Leu Pro Tyr Thr Phe Ser Leu Met Val Thr 

-35 -30 -25 

Phe Pro Asp Val Pro Leu Gly lie Phe Leu Phe Cys Val Cys Val lie 

-20 -15 -10 

Ala lie Gly Val Val Gin Ala Leu lie Val Gly Tyr Ala Phe His Phe 

-5 15 
Pro His Leu Leu Ser Pro Gin lie Gin Arg Ser Ala His Arg Ala Leu 
10 15 20 25 

Tyr Arg Arg His Val Leu Gly lie Val Leu Gin Gly Pro Ala Leu Cys 

30 35 4 0 

Phe Ala Ala Ala lie Phe Ser Leu Phe Phe Val Pro Leu Ser Tyr Leu 

45 50 55 

Leu Met Val Thr Val lie Leu Leu Pro Tyr Val Ser Lys Val Thr Gly 

60 65 70 

Trp Cys Arg Asp Arg Leu Leu Gly His Arg Glu Pro Ser Ala His Pro 

75 80 85 

Val Glu Val Phe Ser Phe Asp Leu His Glu Pro Leu Ser Lys Glu Arg 
90 95 100 105 

Val Glu Ala Phe Ser Asp Gly Val Tyr Ala lie Val Ala Thr Leu Leu 

110 115 120 

lie Leu Asp lie Trp 
125 



<210> 804 
<211> 145 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -22 . . -1 



<220> 

<221> UNSURE 
<222> 84 

<223> Xaa = Ala , Pro, Ser , Thr 



<220> 

<221> UNSURE 
<222> -1 

<223> Xaa = Ala, Ser 



<220> 

<221> UNSURE 
<222> 21 
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<223> Xaa = Cys , Phe, lie, Asn, Ser , Thr , Tyr 
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<220> 

<221> UNSURE 
<222> 46 

<223> Xaa = Cys , Phe, Ser , Tyr 



<400> 804 
Met Gly Arg Trp 
-20 

Leu Pro Met Val 
-5 

Thr Ser Trp Asp 

Gin Leu lie Ser 
30 

Pro Thr Thr Xaa 
45 

Asp Thr Met Gin 
60 

Glu Pro Ala Arg 
75 

Thr Ser Cys Cys 

Gly Thr lie Leu 

110 

Glu 



Ser Leu Thr Ala 
-15 

Thr Xaa Gly Pro 

1 

Thr Val Leu Thr 
15 

Thr Thr Leu Gly 

Cys Ser Gly Thr 
50 

Arg Glu Glu Leu 
65 

Ala Glu Asp Leu 
80 

Pro Thr Leu Ala 
95 

Met Thr lie Met 



Ser Pro Val Thr 

Ala Ala Gly Lys 
5 

Ala lie Xaa Cys 
20 

Ala Ser Ala Ser 
35 

Thr Val Phe Leu 

Val Lys Asn Ala 
7 0 

Ser Xaa Ala Pro 
85 

Pro Ala Ala Pro 
100 

Phe Ala Ser Val 
115 



Leu Thr Ser Leu 
-10 

Ser Ser Ser Ser 
10 

Ala Ser Thr Val 
25 

Gly Ala Arg Met 
40 

Thr Ala Leu Gin 
55 

Thr Pro Pro Ala 

His Val Thr Pro 
90 

Pro Ala Ser Leu 
105 

Gly Phe Ser Asn 
120 



<210> 805 
<211> 109 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -30. .-1 



<220> 

<221> UNSURE 
<222> 64 

<223> Xaa = He, Met 
<220> 

<221> UNSURE 
<222> 58 

<223> Xaa - Leu, Val 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Lys, Arg 
<400> 805 

Met Ala Gly Lys Arg Ser Gly Trp Ser Arg Ala Ala Leu Leu Gin Leu 
-30 -25 ~ " -20 -15 

Leu Leu Gly Val Asn Leu Val Val Met Pro Pro Thr Arg Ala Arg Ser 

-10 -5 1 

Leu Arg Phe Val Thr Leu Glu Gly Met Leu Gin His Trp Glu Leu Gly 

5 10 15 

Gin Ala Leu Arg Gin Arg Tyr His Gly Phe Leu Asn Thr Ser Tyr His 

20 25 30 

Arg Gin Glu Val Tyr Val Arg Xaa Tyr Phe Asp Arg Thr Leu Met Ser 
35 40 45 50 
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Ala Glu Ala Asn Leu Ala Gly Xaa Phe Pro Pro Asn Gly Xaa Gin Arg 

55 ^ 60 65 

Phe Asn Pro Asn lie Ser Trp Gin Pro lie Pro Val His 
70 75 



<210> 806 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 



<400> 806 

Met Ala Thr Pro Pro Phe Arg Leu 
-30 -25 
Val Ser Arg Trp Met Gly Leu Ala 

-15 -10 
Ser Pro Ser lie Arg Gin Lys Lys 
1 5 

Asn Ala Phe Arg Glu Glu Met Lys 
20 



lie Arg Lys Met Phe Ser Phe Lys 
-20 

Cys Phe Arg Ser Leu Ala Ala Ser 
-5 

Leu Met His Lys Leu Gin Glu Glu 
10 15 

He 
25 



<210> 807 
<211> 79 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 807 

Met Val Gly He 

Cys Cys Ser Ser 
1 

Val Arg Val Pro 
15 

Cys Ser Leu Pro 
30 

Phe Ser Ser Trp 



Leu Pro Leu Cys 
-15 

Tyr Val Pro Ser 
5 

His Ser Ala Gly 
20 

Gin Val Phe Leu 

35 

Leu Ser Leu Glu 
50 



Cys Ser Gly Cys 
-10 

Val Ala Pro Thr 

His Cys Gly Gin 

25 

Lys Pro Trp He 
40 

Leu Phe Ser Phe 
55 



Val Pro Ser Leu 
-5 

Ala Ala His Ser 
10 

Arg Val Leu Ala 

Phe Val Glu His 
45 

Leu Arg Tyr 
60 



<210> 808 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -20. . -1 



<400> 808 

Met Asp Ser Ser Thr 
-20 

Glu He Thr Ala Ser 
1 

Phe Ser Thr Gin Ser 
15 

He Leu Gly Thr He 
30 



Ala His Ser Pro Val Phe 
-15 -10 
Glu Tyr Glu Ser Thr Glu 
5 

Pro Leu Gin Lys Leu Phe 
20 

Gin He Leu Phe Gly He 
35 



Leu Val Phe Pro Pro 
-5 

Leu Ser Ala Thr Thr 
10 

Ala Arg Lys Met Lys 
25 

Met Thr Phe Ser Phe 
40 
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Gly Val 


He 


Phe 


Leu 


Phe 


Thr 


Leu 


Leu 


Lys 


Pro 


Tyr 


Pro 


Arg 


Phe 


Pro 


45 








50 










55 










60 


Phe lie 


Phe 


Leu 


Ser 


Gly 


Tyr 


Pro 


Phe 


Trp 


Gly 


Ser 


Val 


Leu 


Phe 


He 








65 










70 










75 




Asn Ser 


Gly 


Ala 


Phe 


Leu 


He 


Ala 


Val 


Lys 


Arg 


Lys 


Thr 


Thr 


Glu 


Thr 






80 










85 










90 






Leu lie 


He 


Leu 


Ser 


Arg 


He 


Met 


Asn 


Phe 


Leu 


Ser 


Ala 


Leu 


Gly 


Ala 




95 










100 










105 








He Ala 


Gly 


He 


He 


Leu 


Leu 


Thr 


Phe 


Gly 


Phe 


His 


Pro 


Arg 


Ser 


Lys 


110 










115 










120 










Leu His 


Leu 




























125 






























<210> 809 




























<211> 96 




























<212> PRT 




























<213> Homo sapiens 
























<220> 






























<221> SIGNAL 


























<222> -33. .- 


-1 


























<400> 809 




























Met Lys 


Leu 


Lys 


Leu 


Lys 


Asn 


Val 


Phe 


Leu 


Ala 


Tyr 


Phe 


Leu 


Val 


Ser 






-30 










-25 










-20 






He Ala 


Gly 


Leu 


Leu 


Tyr 


Ala 


Leu 


Val 


Gin 


Leu 


Gly 


Gin 


Pro 


Cys 


Asp 




-15 










-10 










-5 








Cys Leu 


Pro 


Pro 


Leu 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Leu 


Arg 


Gin 


Lys 


Asp 


1 








5 










10 










15 


Leu Arg 


He 


Ser 


Gin 


Leu 


Gin 


Ala 


Glu 


Leu 


Arg 


Arg 


Pro 


Pro 


Pro 


Ala 








20 










25 










30 




Pro Ala 


Gin 


Pro 


Pro 


Glu 


Pro 


Glu 


Ala 


Leu 


Pro 


Thr 


He 


Tyr 


Val 


Val 






35 










40 










45 






Thr Pro 


Thr 


Tyr 


Ala 


Arg 


Leu 


Val 


Gin 


Lys 


Ala 


Glu 


Leu 


Val 


Arg 


Leu 




50 










55 










60 








<210> 810 




























<211> 169 




























<212> PRT 




























<213> Homo sapiens 
























<220> 






























<221> SIGNAL 


























<222> -16. 


-1 


























<400> 810 




























Met Ser 


Gly 


Cys 


Gly 


Leu 


Phe 


Leu 


Arg 


Thr 


Thr 


Ala 


Ala 


Ala 


Arg 


Ala 


-15 










-10 










-5 










Cys Arg 


Gly 


Leu 


Val 


Val 


Ser 


Thr 


Ala 


Asn 


Arg 


Arg 


Leu 


Leu 


Arg 


Thr 


1 






5 
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Ala 


Gly 


Val 


Glu 








20 










25 










30 






He 


Ser 


Pro 


Ala 


Ala 


Asp 


Gly 


Pro 


Ser 


Pro 


Glu 


Gly 


Pro 


He 


Ala 


Glu 






35 










40 










45 








Pro 


Asp 


Gly 


Glu 


Ser 


Asp 


Gly 


Glu 


Glu 


Glu 


Val 


Glu 


Pro 


Pro 


Gly 


Ala 




50 










55 










60 










Asn 


Asp 


Gly 


Glu 


Thr 


Val 


Gly 


Glu 


Phe 


Glu 


Thr 


Gly 


Asp 


Glu 


Asp 


Leu 


65 










70 










75 










80 


Gly 


Ala 


He 


Asp 


Pro 


Gly 


Gly 


Glu 


Val 


He 


Gly Ala 


Glu 


Asp 


Gly 


Gly 










85 










90 










95 




lie 


Val 


Glu 


Asp 


Asp 


Leu 


Gly 


Ala 


He 


Gly Ala 


Gly 


Gly 


Glu 


Val 


He 








100 










105 










110 






Gly 


Ala 


Glu 


Asp 


Gly 


Glu 


Thr 


Gly 


Asp 


Val 


Asp 


Leu 


Gly 


Gly 


Ala 


Gly 






115 










120 










125 








Gly 


Glu 


Val 


Val 


Gly 


Glu 


Thr 


Gly 


Glu 


Leu 















130 135 



<210> 824 
<211> 74 



WO 02/083898 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 53 

<223> Xaa = * , Ala, Asp, Glu, Phe, Leu, Ser , Val , Tyr 
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421 



<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = His,Asn,Tyr 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = lie, Lys , Met , Asn 



<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = any one of the twenty amino acids 
<400> 824 

Met Arg Ser His Thr lie Thr Met Thr Thr Thr Ser Val Ser Ser Trp 

15 10 15 

Pro Tyr Ser Ser His Arg Met Arg Phe Xaa Xaa Xaa His Ser Asp Gin 

20 25 30 

Pro Pro Gin Asn Phe Ser Ala Thr Pro Asn Val Thr Thr Cys Pro Met 

35 40 45 

Asp Glu Lys Leu Xaa Thr Thr Val Leu Thr Thr Ser Tyr Ser Val lie 

50 55 60 

Phe lie Val Gly Leu Val Gly Asn lie lie 
65 70 



<210> 
<211> 
<212> 
<213> 



825 
135 
PRT 
Homo 



sapiens 



<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Glu, Gin 
<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = Glu, Gly 
<220> 

<221> UNSURE 
<222> 133 

<223> Xaa = His, Pro 



<400> 825 

Met Lys Glu Ala 

1 

Arg Ser Val Ser 
20 

Leu Ser Lys Val 
35 

Phe Pro Ala Glu 



Gly Gin Met Xaa 
5 

Thr Gin Thr Gly 

Asn Leu Phe Thr 
40 

Ala Gin Arg Gin 



Asn Leu Glu Ser 
10 

Ser Met Thr Gly 
25 

Leu Leu Ser Leu 
Lys Ser Gin Lys 



Ala Arg Ala Gly 
15 

Gin lie Pro Arg 
30 

Trp Met Glu Leu 
45 

Asn Glu Xaa Gly 
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50 55 60 



Lys 


His 


Gly 


Pro 


Leu 


Gly 


Asp 


Asn 


Glu 


Glu 


Arg 


Thr 


Arg 


Val 


Ser 


Thr 


65 










70 










75 










80 


Asp 


Lys 


Arg 


Gin 


Val 


Lys 


Arg 


Thr 


Gly 


Leu 


Val 


Val 


Val 


Lys 


Asn 


Met 










85 










90 










95 




Lys 


He 


Val 


Gly 


Leu 


His 


Cys 


Ser 


Ser 


Glu 


Asp 


Leu 


His 


Ala 


Gly 


Gin 








100 










105 










110 






He 


Ala 


Leu 


He 


Lys 


His 


Gly 


Ser 


Arg 


Leu 


Lys 


Asn 


Cys 


Asp 


Pro 


Tyr 






115 










120 










125 








Phe 


Ser 


Arg 


Lys 


Xaa 


Val 


Leu 






















130 










135 




















<210> 826 




























<211> 61 




























<212> PRT 




























<213> Homo sapiens 
























<220> 






























<221> UNSURE 


























<222> 44 




























<223> Xaa = 


* 


Glu, 


Gin 






















<400> 826 




























Met 


Pro 


Gly Ala 


Ala 


Gly 


Arg 


Arg 


Cys 


Pro 


Asp 


Arg 


Ser 


Pro 


Pro 


Phe 


1 








5 










10 










15 




Gly 


Pro 


Asp 


Phe 


Gly 


Ser 


Ser 


Gly Val 


Leu 


Met 


Leu 


Leu 


Lys 


Pro 


Glu 








20 










25 










30 






Ser 


Thr 


Cys 


Pro 


Asp 


Pro 


Ser 


Leu 


Tyr 


Asp 


Ala 


Xaa 


Leu 


Cys 


Pro 


Val 






35 










40 










45 








Pro 


Glu 


Pro 


Lys 


Pro 


Cys 


Arg 


Gly 


Val 


Ala 


Ser 


Asp 


Pro 










50 










55 










60 










<210> 827 




























<211> 130 




























<212> PRT 




























<213> Homo : 


sapiens 
























<400> 827 




























Met 


Arg 


Ser 


His 


Thr 


He 


Thr 


Met 


Thr 


Thr 


Thr 


Ser 


Val 


Ser 


Ser 


Trp 


1 








5 










10 










15 




Pro 


Tyr 


Ser 


Ser 


His 


Arg 


Met 


Arg 


Phe 


He 


Thr 


Asn 


His 


Ser 


Asp 


Gin 








20 










25 










30 






Pro 


Pro 


Gin 


Asn 


Phe 


Ser 


Ala 


Thr 


Pro 


Asn 


Val 


Thr 


Thr 


Cys 


Pro 


Met 






35 










40 










45 








Asp 


Glu 


Lys 


Leu 


Leu 


Ser 


Thr 


Val 


Leu 


Thr 


Thr 


Ser 


Tyr 


Ser 


Val 


He 




50 










55 










60 










Phe 


He 


Val 


Gly 


Leu 


Val 


Gly 


Asn 


He 


He 


Ala 


Leu 


Tyr 


Val 


Phe 


Leu 


65 










70 










75 










80 


Gly 


He 


His 


Arg 


Lys 


Arg 


Asn 


Ser 


He 


Gin 


He 


Tyr 


Leu 


Leu 


Asn 


Val 










85 










90 










95 




Ala 


He 


Ala 


Asp 


Leu 


Leu 


Leu 


He 


Phe 


Cys 


Leu 


Pro 


Phe 


Arg 


He 


Met 








100 










105 










110 






Tyr 


His 


He 


Asn 


Gin 


Asn 


Lys 


Trp 


Thr 


Leu 


Gly 


Val 


He 


Leu 


Cys 


Lys 






115 










120 










125 








Val 


Val 
































130 































<210> 828 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 828 
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Met Asn Phe Leu Ser Ala Leu Gly Ala lie Ala Gly lie lie Leu Leu 

1 5 10 15 

Thr Phe Gly Phe lie Leu Asp Gin Asn Tyr lie Cys Gly Tyr Ser His 

20 25 30 

Gin Asn Ser Gin Cys Lys Ala Val Thr Val Leu Phe Leu Gly lie Leu 
35 - 4Q 45 

lie Thr Leu Met Thr Phe Ser lie lie Glu Leu Phe lie Ser Leu Leu 

50 55 60 

Ser Gin Phe 
65 



<210> 829 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 829 

Met Pro Lys Gly Gly Arg Lys Gly 
1 5 

Tyr Thr Ser Pro Glu Glu lie Asp 
20 

Arg Pro Gly Lys Lys Arg Ser Lys 

35 40 
Asp Pro Lys Lys Lys Lys Lys 
50 55 



Gly His Lys Gly Arg Ala Arg Gin 

10 15 
Ala Xaa Cys Arg Leu Arg Ser Arg 
25 30 
Lys Gly Gly Asp Gly Ala Ala Gly 
45 



<210> 830 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 830 

Met Pro Lys Gly 

1 

Tyr Thr Ser Pro 
20 

Arg Pro Gly Lys 
35 

Asp Pro Lys Lys 
50 



Gly Arg Lys Gly 

5 

Glu Glu lie Asp 

Lys Arg Ser Lys 
4 0 

Lys Lys Lys 
55 



Gly His Lys Gly 
10 

Ala Xaa Cys Arg 
25 

Lys Gly Gly Asp 



Arg Ala Arg Gin 
15 

Leu Arg Ser Arg 

30 

Gly Ala Ala Gly 
45 



<210> 831 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> 9,72 

<223> Xaa = Gly,Arg,Trp 



<220> 

<221> UNSURE 
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<222> 67 

<223> Xaa = Lys, Met , Arg, Thr 
<220> 

<221> UNSURE 
<222> 2 

<223> Xaa = Phe,Leu 



<220> 

<221> UNSURE 
<222> 85 

<223> Xaa = Ser,Tyr 



<400> 831 




























Met 


Xaa 


Leu 


Val 


Asp 


Trp 


Phe 


Tyr 


Xaa 


Val 


Leu 


Ser 


Ser 


Leu 


Gly 


Leu 


1 








5 










10 










15 




Trp 


Gin 


Lys 


Glu 


Ala 


Lys 


He 


Leu 


Phe 


Leu 


Gly 


Leu 


Asp 


Asn 


Ala 


Gly 








20 










25 










30 






Lys 


Thr 


Thr 


Leu 


Leu 


His 


Met 


Leu 


Lys 


Asp 


Glu 


Arg 


Leu 


Val 


Gin 


His 






35 










40 










45 








Gin 


Pro 


Thr 


Gin 


Tyr 


Pro 


Thr 


Ser 


Glu 


Glu 


Leu 


Ser 


He 


Gly 


Arg 


He 




50 










55 










60 










Lys 


Phe 


Xaa 


Ala 


Phe 


Asp 


Leu 


Xaa 


Gly 


His 


Gin 


He 


Ala 


Arg 


Arg 


Val 


65 










70 










75 










80 


Trp 


Lys 


Asp 


Tyr 


Xaa 


Ala 























85 



<210> 832 
<211> 131 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> 129 

<223> Xaa = Leu, Val 



<400> 832 

Met Ser Pro Leu 

1 

Arg Thr Tyr Thr 
20 

Arg Lys Arg Asp 
35 

Val Leu Pro Gin 
50 

Leu Tyr Ser Thr 
65 

Ser Arg Arg Met 

Ser Val Pro Leu 
100 

Glu Leu Asp Tyr 
115 

Xaa Phe Gin 
130 



Glu Cys Ser Glu 
5 

Trp Gin Leu Thr 

Thr Arg Ser Glu 
4 0 

Arg Gly Thr Ala 
55 

Pro Arg Cys Ala 
70 

Ala Ser Gin His 
85 

Asp Pro Met Glu 

Asn Pro Pro Arg 
120 



Cys Phe Gly Asp 
10 

Leu His Ser Arg 
25 

Ser Leu Glu He 

Glu Pro Phe Pro 
60 

Gin Gin Ala Ala 
75 

Ser Tyr Pro Leu 
90 

Arg Pro Met Ser 
105 

Val Gin Leu Ser 



Gin Leu Leu His 
15 

Pro Asn Tyr Thr 

30 

Pro He Ser Val 
45 

Arg Leu His Asn 

Leu Pro Arg Leu 
80 

Asn Arg Phe Ser 
95 

Gin Ala Asp Leu 
110 

Asp Glu Met Phe 
125 



<210> 833 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> UNSURE 
<222> 32 

<223> Xaa = * , Lys, Leu, Met 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = Asp,Glu 
<220> 

<221> UNSURE 

<222> 70,78 

<223> Xaa = His, Gin 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Leu, Val 



<400> 833 



Met 


Ala 


Thr 


Val 


Met 


Ala 


Ala 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Val 


Leu 


Glu 


1 








5 










10 










15 




Glu 


Glu 


Phe 


Arg 


Trp 


Leu 


Leu 


His 


Asp 


Xaa 


Xaa 


His 


Ala 


Val 


Leu 


Xaa 








20 










25 










30 






Gin 


Leu 


Gin 


Asp 


He 


Leu 


Lys 


Glu 


Ala 


Ser 


Leu 


Arg 


Phe 


Thr 


Leu 


Pro 






35 










40 










45 








Gly 


Ser 


Gly 


Thr 


Glu 


Gly 


Pro 


Ala 


Lys 


Gin 


Glu 


Asn 


Phe 


He 


Leu 


Gly 




50 










55 










60 










Ser 


Cys 


Gly 


Thr 


Asp 


Xaa 


Val 


Lys 


Gly 


Val 


Leu 


Thr 


Leu 


Xaa 


Gly Asp 


65 










70 










75 










80 


Ala 


Leu 


Ser 


Gin 


Ala 


Asp 


Val 


Asn 


Leu 


Lys 


Met 


Pro 


Arg 


Asn 


Asn 


Gin 










85 










90 










95 




Leu 


Leu 


His 





























<210> 834 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<400> 834 

Met Val Ser His Thr Phe His Met Arg Thr Glu Glu Ser Asp Ala Ser 

15 10 15 

Gin Glu Gly Asp Asp Leu Pro Lys Ser Ser Ala Asn Thr Ser His Pro 

20 25 30 

Lys Gin Asp Asp Ser Pro Lys Ser Ser Glu Glu Thr He Gin Pro Lys 

35 4 0 4 5 

Glu Gly Asp He Pro Lys Ala Pro Glu Glu Thr He Gin Ser Lys Lys 

50 55 60 

Glu Asp Leu Pro Lys Ser Ser Glu Lys Ala He Gin Pro Lys Glu Ser 
65 70 75 80 

Asn He Pro Lys Ser Ser Ala Lys Pro He Gin Pro Lys Leu Ala lie 
85 90 95 

<210> 835 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 94 

<223> Xaa = Leu, Met 
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<400> 835 



Met Ser 


Leu 


Leu 


Asp 


Gly 


Leu 


Ala 


1 






5 








Ser Ser 


Lys 


Ala 


Arg 


Met 


Lys 


Lys 






20 










Lys Tyr 


Arg 


Leu 


Lys 


Tyr 


Leu 


Arg 




35 










40 


Val Phe 


Glu 


Asn 


Ala 


Ala 


He 


Cys 


50 










55 




Lys Phe 


Leu 


Lys 


Ala 


Lys 


Glu 


Glu 


65 








70 






Leu Gin 


Leu 


Gin 


Ala 


Leu 


Thr 


Glu 








85 








Leu 














<210> 836 












<211> 107 












<212> PRT 












<213> Homo sapiens 








<400> 836 












Met Ala 


Gin 


Glu 


Val 


Ser 


Glu 


Tyr 


1 






5 








Ala Trp 


Val 


Glu 


Ala 


Leu 


Arg 


Cys 






20 










Arg His 


Arg 


Arg 


Asp 


Phe 


Leu 


Leu 




35 










40 


Ala Gly 


Gly 


Ala 


Ala 


Ser 


Ala 


Ser 


50 










55 




Ser Gly 


Thr 


Arg 


Asn 


Arg 


Gin 


Leu 


65 








70 






Ala Trp 


Ala 


Asn 


His 


Val 


Phe 


Leu 








85 








Met Asp 


Lys 


He 


Leu 


Ser 


Met 


Ala 






100 











<210> 


837 










<211> 


65 










<212> 


PRT 










<213> 


Homo 


sapiens 








<220> 












<221> 


UNSURE 








<222> 


20 










<223> 


Xaa = 


Pro, Arg 








<400> 


837 










Met Arg Leu 


Leu Gin 


Pro 


Arg 


Ser 


1 




5 








Trp Leu Phe 


Xaa Leu 


His 


Gly 


Asp 






20 








Gly Ser Arg 


Lys Asn 


Met 


Asp 


Cys 




35 








40 


Cys Arg Arg 


Thr Ser 


Leu 


Trp 


Asn 


50 






55 




Gly 












65 












<210> 


838 










<211> 


81 










<212> 


PRT 










<213> 


Homo 


sapiens 









426 



Ser 


Ser 


Pro 


Arg 


Ala 


Pro 


Leu 


Gin 




10 










15 




Leu 


Pro 


Lys 


Lys 


Ser 


Gin 


Asn 


Glu 


25 










30 






Leu 


Arg 


Lys 


Ala 


Ala 


Lys 


Ala 


Thr 










45 








Asp 


Glu 


He 


Ala 


Arg 


Leu 


Glu 


Glu 








60 










Arg 


Arg 


Tyr 


Leu 


Leu 


Lys 


Lys 


Leu 






75 










80 


Gly Val 


Ser 


Thr 


Gly 


Xaa 


Gin 


Leu 




90 










95 





Leu 


Ser 


Gin 


Asn 


Pro 


Arg 


Val 


Ala 




10 










15 




Asp 


Gly 


Glu 


Thr 


Asp 


Lys 


His 


Trp 


25 










30 






Arg 


Asn 


Ala 


Gly Asp 


Leu 


Ala 


Pro 










45 








Thr 


Asp 


Glu 


Ala 


Ala 


Asp 


Ala 


Glu 








60 










Gin 


Gin 


Leu 


He 


Ser 


Phe 


Ser 


Met 






75 










80 


Gly 


Cys 


Arg 


Tyr 


Pro 


Gin 


Lys 


Val 




90 










95 




Glu 


Gly 


He 












105 

















Arg 


Leu 


Gin 


Arg 


Thr 


Ser 


Gin 


Lys 




10 










15 




Glu 


Ser 


Leu 


Ala 


Pro 


Lys 


His 


Pro 


25 










30 






Tyr 


Trp 


Gin 


Asn 


Thr 


Ser 


Val 


His 










45 








Leu 


His 


Leu 


Pro 


Gly 


Lys 


Thr 


Leu 



60 
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<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = * , Trp 
<220> 

<22l> UNSURE 
<222> 9 

<223> Xaa = Ala, Pro 
<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Arg, Ser 
<220> 

<221> UNSURE 
<222> 12 

<223> Xaa = Gly, Trp 
<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Lys,Arg 



<400> 838 

Met Leu Ala Arg 

1 

Pro Val Phe Thr 
20 

lie Ser Pro Ser 
35 

Gly Phe Gin Arg 
50 

Pro Ser Thr Gin 

65 

Arg 



Gly Leu Val Xaa 

5 

lie Ser Phe His 

Leu Ser Phe Leu 
40 

Val Val Lys Asn 
55 

Asn Arg Leu Phe 
70 



Xaa Xaa Asn Xaa 
10 

His Leu Pro Thr 
25 

lie Cys Glu Thr 

Pro Cys Glu His 
60 

His Trp Arg Arg 
75 



Gly Asn Arg Xaa 
15 

Trp Asn Leu Tyr 
30 

Ala Ala Asn Thr 
45 

Leu Leu Gin Leu 

Arg Arg Asp Gly 
80 



<210> 839 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 47 

<223> Xaa = Phe, Ser 



<400> 839 

Met Val Arg Pro 

1 

Leu Gly Gly Arg 
20 

Arg Ser Arg Ser 
35 

Asp Pro 
50 



Arg Arg Ala Pro 
5 

Gly Arg Pro Pro 

Trp Pro Ala Ser 
40 



Tyr Arg Ser Gly 
10 

Arg Pro Leu Val 
25 

Pro Pro Arg Pro 



Ala Gly Gly Pro 
15 

Val Arg Ala Val 
30 

Ala Ala Xaa Ala 
45 



<210> 840 
<211> 85 
<212> PRT 

<213> Homo sapiens 
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<400> 840 




























Met 


Pro 


Ser 


Phe 


Val 


Asp 


Arg 


He 


Phe 


Gly 


Gly 


Glu 


Leu 


Thr 


Ser 


Met 


1 








5 










10 










15 




lie 


Met 


Tyr 


Asp 
20 


Gin 


Cys 


Arg 


Thr 


Val 
25 


Ser 


Leu 


Val 


His 


Glu 
30 


Ser 


Phe 


Leu 


Asp 


Leu 
35 


Ser 


Leu 


Pro 


Val 


Leu 
40 


Asp 


Asp 


Gin 


Ser 


Gly 
45 


Lys 


Lys 


Ser 


Val 


Asn 
50 


Asp 


Lys 


Asn 


Leu 


Lys 
55 


Lys 


Thr 


Val 


Glu 


Asp 
60 


Glu 


Asp 


Gin 


Asp 


Ser 


Glu 


Glu 


Glu 


Lys 


Asp 


Asn 


Asp 


Ser 


Tyr 


He 


Lys 


Glu 


Arg 


Ser 


Asp 


65 










70 










75 










80 


lie 


Pro 


Ser 


Gly 


Thr 
85 

























<210> 841 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 59 

<223> Xaa = Lys, Asn 
<400> 841 

Met Ala Ser He Phe Ser Lys Leu 
1 5 

Leu Phe Ala Thr Met Gly Thr Ser 
20 

Asn Arg Gin Lys Val Cys Ala Glu 

35 40 
Pro Pro Ser Ala Asp Tyr Pro Asp 

50 " " 55 

Pro Ser 
65 



Leu 


Thr 


Gly Arg 


Asn 


Ala 


Ser 


Leu 




10 








15 




Val 


Leu 


Thr Thr 


Gly 


Tyr 


Leu 


Leu 


25 








30 






Val 


Arg 


Glu Gin 


Pro 


Arg 


Leu 


Phe 








45 








Leu 


Arg 


Xaa Thr 


Thr 


Thr 


Ala 


Trp 



60 



<210> 842 
<211> 74 
<212> PRT 

<213> Homo sapiens 
<400> 842 

Met Ala Ala Ala Ala Gly Leu Gly Leu Gin Arg Glu Ala Ala Glu Gly 

1 5 10 15 

Lys Trp Ser Ala Trp Arg Met Thr Glu Glu Thr Arg He Val Tyr Trp 

20 25 30 

lie Lys Asp Arg Gin Leu Thr Asn Arg Asp Ser Thr He Leu Glu Leu 

35 4 0 4 5 

Gin Lys Val Leu Lys Thr Cys Trp Phe Val Cys Trp Ser He Phe Gin 

50 55 60 

Leu Lys Phe Leu Phe Leu Phe Phe Phe Phe 
65 70 

<210> 843 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 843 

Met Lys Lys Asn He Trp Leu Leu Ser Gly Ser Arg Gin Gin Tyr Leu 
15 10 15 

He Ala Leu Trp Arg Leu Leu Leu Leu Glu He Tyr He Ser Glu Gly 
20 25 30 
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Thr His Ser 


Thr Glu 


Glu 


Asp 


lie 


Asn 


Lys 


Gin 


He 


Asn 


Asp 


Lys 


Glu 


35 








40 










45 








Arg Val Ala 


Ala Ala 


Met 


Glu 


Asn 


Pro 


Asn 


Leu 


Arg 


Glu 


He 


Val 


Asp 


50 
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Trp 
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Asp 


Tyr 
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Trp 


Val 


Thr 


Pro 
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Pro 
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Thr 


Glu 


Leu 


Ser 


Gin 
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25 
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Leu 


Glu 
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Thr 
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Lys 


Phe 
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Thr 


Gin 


Val 
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45 
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Lys 


Thr 


Leu 


Arg 
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Asp 


Lys 


Thr 


Val 


Leu 


Leu 


Glu 
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Glu lie Tyr 


Thr 


His 


Lys 


Asn 


Leu 


Phe 


Thr 


Glu 


Arg 


Leu 


Asn 


Lys 


He 
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Ser Asp Gly 


Leu 


Lys 


Glu 


Lys 


Asp 


He 


Gin 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 
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90 
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Ser Cys Ala 


Asp 


Cys 


Arg 


Thr 


His 


Phe 


Leu 


Ser 'Cys 


Asn 


Asp 


Pro 


Thr 
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110 








Phe Cys Pro 


Ala 


Arg 


Asn 


Arg 


Arg 


Thr 


Ser 


Leu 


Trp 


Ala 


Val 


Ser 


Leu 
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Ser Ser Ala 


Leu 


Leu 


Leu 


Ala 


He 


Ala 


Gly Asp 


Val 


Xaa 


Phe 


Thr 


Gly 
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Lys Gly Arg 
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<400> 862 

Met Val Lys Val Met Trp 
-25 

Leu Leu Leu Thr Val Pro 
-10 -5 
Gin Leu Ala Ser Cys Ala 
10 

Glu Ala Val Ala Gin Ser 
25 



Val Ser Val Leu Ala 
-20 

Val Ala Glu Gly Val 
1 

Ala Ala Met Thr Ser 
15 



Leu Ala Ala Ala lie 
-15 

Thr Cys Ser Pro Met 
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Ser Ser Pro Pro Ser 
20 
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Met 


Lys Pro 


Val 


Ser 


Arg 


Arg 


Thr 


Leu 


Asp 


Trp 


He 


Tyr 


Ser 


Val 
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Leu 


Leu Ala 


He 


Val 


Leu 


He 


Ser 


Trp 


Gly 


Cys 


He 


He 


Tyr 


Ala 


-10 








-5 
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5 


Met 


Val Ser 


Ala 


Arg 


Arg 


Gin 


Leu 


Arg 


Lys 


Lys 


Tyr 


Pro 
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Lys 
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Glu 


Asn 
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Arg 


Thr 


Leu 
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Ala 


Arg 


Leu 


Ser 
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Arg 


Ser 


Arg 


Leu 


Gly 


Thr 


Lys 


Ala 








30 


Pro 


Ala 


He 


Ala 






45 




Glu 


Pro 


Ala 


Ser 




60 






Ala 


Glu 


Lys 


Glu 



Ala He Leu Ala 
-15 

His Ser Arg Gin 
1 

Gin Arg Thr Ser 
15 

Trp Leu Gin Ser 

Glu Tyr Gin Arg 
50 

Thr Arg Glu Asp 
65 

Lys 



Ala He Leu Leu 

Glu Leu Met Arg 
5 

Gin Lys Trp Leu 
20 

He Gin Ala Gin 
35 

Ala Leu Gin Glu 

Ser Gly His Ser 
70 



Val Ala Leu Gin 
-10 

Leu Leu Gin Pro 
10 

Phe Pro Leu His 
25 

Gly Lys Thr Trp 
40 

Asn Val Ala Met 
55 

Ala Ala Glu Leu 
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<400> 865 



Met Val Leu 


Leu 


Ser 


Ser 


Thr 


Met 


Ser 


Val 


Leu 


lie 


Pne 


Cys 


Leu 


Leu 
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Asn Leu Ala 


He 


Ser 


Asp 


Arg 


Arg 


Val 


Met 


Lys 


Ser 


Pro 


Thr 


He 


He 


-5 






1 








5 










10 




Val He His 


Leu 


Phe 


Phe 


Pro 


His 


Ser 


Ser 


Thr 


Ser 


He 


Cys 


He 


Glu 
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Leu 
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Gly 


Thr 
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Phe 


Trp 
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Leu Ala Ser 


Gly 


Trp 


Thr 


Pro 


Ala 


Phe 


Ala 


Tyr 


Ser 


Pro 


Arg 


Thr 


Pro 
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Asp Arg Val 


Ser 


Glu 


Ala 


Asp 


He 


Gin 


Arg 


Leu 


Leu 


His 


Gly 


Val 


Met 
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15 










20 
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He 


Ala 


Arg 


Pro 


Arg 


Val 


Glu 


Tyr 


Pro 


Ala 


His 


Gin 
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30 










35 








Ala Met Asn 


Leu 


Val 


Gly 


Pro 


Gin 


Ser 


He 


Glu 


Gly 


Gly 


Ala 


His 


Glu 


40 








4 D 
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Gly Leu Gin 


His 


Leu 


Gly 


Pro 


Phe 


Gly Asn 


He 


Pro 


Asn 


He 


Val 


Ala 


55 
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Glu Leu Thr 


Gly 


Asp 


Asn 


He 


Pro 


Lys 


Asp 


Phe 
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Glu 


Asp 


Gin 


Gly 
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val 
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Met Ala Ala 


Ala 
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Ala 


Ala 
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Ala 
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Gly 


Ser 
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Ala 
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Gly 


Ser 


Leu 


Ala 


Gly Arg 


Arg 


Ala 


Gly 


Pro 


Ala 
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He Trp Ala 
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Gly 


Trp 


Val 


Pro 


Ala 


Ala 


Gly 


Gly 


Pro 
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Pro 
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Val 
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Gly 


Lys 
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Glu 


Ala 


Asp 


Glu 


Leu 


O Q Y~ 


Gly Asp 


Ala 




65 










70 










75 






Ser Val Glu 


Asp 


Asp 


Ala 


Phe 


He 


Lys 


Val 


Lys 


Cys 










80 










o O 


















<210> 869 




























<211> 60 




























<212> PRT 




























<213> Homo , 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -41. . 


-1 


























<400> 869 




























Met Pro Glu 


Ser 


Leu 


Trp 


Gin 


His 


Ser 


His 


Trp 


Cys 


Gly 


Glu 


Ser 


Phe 


-40 








-35 










-30 










Gin Leu Lys 


Glu 


Ala 


<jj_y 


r ne 


Leu 


\j jl y 


i nr 


Leu 


Glu 


Val 


Phe 


Trp 


jxie z. 


-25 






-20 










-15 










-10 


Ser Phe Tyr 


Leu 


Ser 


Arg 


Val 


Asn 


Cys 


Thr 


Thr 


Lys 


Glu 


Thr 


Ser 


Pro 






-5 










1 








5 






Leu Pro Lys 


Pro 


Ala 


Arg 


His 


Leu 


Gly 


Leu 


Glu 


Pro 










10 










-L — ' 


















<210> 870 




























<211> 60 




























<212> PRT 




























<213> Homo 


sapiens 
























<220> 




























<221> SIGNAL 


























<222> -45. . 


-1 


























<400> 870 




























Met His Ser 


Leu 


Phe 


He 


Ala 


Ser 


Leu 


Lys 


Val 


Leu 


Phe 


Tyr 


Tyr 


Ser 


-45 






-40 










-35 










-30 


Phe Ser Phe 


Arg 


Phe 


Asn 


Trp 


Phe 


Asp 


Cys 


Leu 


Leu 


His 


Asn 


Leu 


Gly 






-25 










-20 










-15 





WO 02/083898 PCT/IBO 1/009 14 

439 

Glu Asn Phe Leu Ser Leu Leu Ser Lys Ser Cys Ser Ala Asp Pro Ser 

-10 -5 1 

Gly Ser Thr Phe Met Arg Asp lie Glu Thr Asn Lys 
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Glu 
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Arg 
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Leu 


Met 


Leu 
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Ser 


Gin 


Leu 


Arg 



Val Thr Leu Val 

Asp lie Gly Ser 
10 

Gly Gly Gly Ser 
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Cys Ser Ser Pro 
40 

Glu Thr Val Gin 

Thr Pro Glu Leu 
75 

Leu Gly Gin lie 
90 

Lys Pro Gly Lys 
105 

Asn 
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Gin Leu Ser He 
-5 

Lys Ser Asp Pro 
15 

Trp Ala Glu Leu 
30 

Glu Phe Ser Lys 
45 

Lys Leu Arg Phe 
60 

Arg Asp Asp Asp 

Val Ser Ser Gin 
95 

Pro Ala Gly Arg 
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Ser Cys Asp His 
1 

Leu Cys Val Leu 

Gly Arg Thr Glu 
35 

Thr Leu Gin Leu 
50 

Gly He Tyr Asp 
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Phe Leu Gly Gly 
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Met Val Lys Ser Ser 
-45 

Arg Glu Lys Glu Gly 
-30 

Thr Met Pro Leu Leu 
-15 

Ser Ser Arg Val Ser 
5 

Arg Trp Cys Ser 
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Leu Gin Arg He Leu Asn 
-40 

Asp Lys Pro Ser Ala Thr 
-2 5 

Ser Leu His Ser Arg Gly 
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10 
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<400> 873 
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Ala 
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Trp 
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Leu 


Gly Arg 


Trp 
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Arg 
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-15 
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Glu 


Gly 


Gly 


Arg 


Glu Gly 
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Pro 


Lys 
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Pro 


Ser 
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Met Ala Ala Ala 

Val Ser Ala lie 
1 

Ala Ala Cys Gly 
15 

Ser Gin Ala Lys 
30 

Val Leu Gin Lys 
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Val Gly Arg Leu 
-15 

Pro Trp Gly lie 
5 

Arg Thr Ser Leu 
20 

lie lie Ser Ser 

35 

Leu Leu Leu Leu 
50 



Leu Arg Ala Ser 
-10 

Ser Ala Thr Ala 

Thr Asn Leu Leu 
25 

lie Leu Trp lie 
40 

Val Thr Lys Leu 
55 



Val Ala Arg His 
-5 

Ala Leu Arg Pro 
10 

Cys Ser Gly Ser 

Ser Pro Leu Cys 
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Thr Asn Phe Thr 
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Met Glu Gly Arg 

-35 

Val Asn Gin Val 

Val Arg Ser Asp 
1 

Asp Leu Leu Met 
15 



Ser Arg Ser Arg 
-30 

Phe Val Gly Leu 
-15 

Gin lie Lys Asn 
5 

Asp 



Lys Ala Cys Leu 
-25 

Val Glu Val Val 
-10 

Met Tyr Leu Lys 



Glu Val Ala Thr 
-20 

Ala Val Gly Val 
-5 

Tyr Ser Gin His 
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Met Gin Glu Met 
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Glu Asn Gly Trp 



Ser Arg Pro lie 
5 

Pro His Glu Ala 
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10 

Thr Thr Thr Gly 



Leu Ser Leu Ala 
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Gly Lys Glu Gly 



WO 02/083898 PCT/IBO 1/009 14 

441 

20 25 30 

Cys Ser Thr Pro Arg Leu lie Gly Lys Pro Asn Gin lie Trp Asp Leu 

35 40 45 

Tyr Ser Val Ala Glu Ala Gly Arg 
50 55 
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Met Lys Lys Pro 

1 

lie Asp Leu Leu 
20 

Ser Leu Ser Tyr 
35 

Phe Ser Ser lie 
50 



Gin Ser Glu Asn 
5 

Val Val Phe Pro 

Tyr Ser Leu Lys 
40 

Leu 



Phe Gly Gly Gly 
10 

Lys Val His Val 
25 

Lys Lys Met Leu 



Pro Arg Thr Trp 
15 

Leu Glu lie Phe 
30 

Thr Val Tyr Leu 
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<400> 878 

Met Ser Thr Lys 

1 

Gin Phe Leu Ser 
20 

Leu His Asn Asp 
35 

Ser Pro Ala Pro 
50 



Arg Arg Leu Glu 
5 

Glu Glu Asn Met 

Gin Pro Leu Leu 
40 

Thr Gin Glu Pro 

55 



Glu Glu Gin Glu 
10 

Ala Thr His Phe 
25 

Gin Pro His His 
Leu Leu 



Pro Leu Arg Lys 
15 

Ser Gin Leu Ser 
30 

Asp Leu Leu Pro 
45 
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Met Ala Ser Asp 

1 

Ser Trp Arg Thr 
20 

Ser Ser Ser Val 
35 

Arg Arg Arg Leu 
50 



Leu Asp Phe Ser 
5 

Cys Tyr Gly Thr 

Cys Trp Pro Ser 
40 

Asn Arg Leu Ser 
55 



Pro Pro Glu Cys 
10 

Asp Ser Ser Trp 
25 

Ser Tyr Pro Phe 

Pro Glu Val Leu 
60 



Pro Ser Pro Leu 
15 

Glu Pro Ser Ser 
30 

Pro Ser Pro Thr 

45 

Arg 
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<400> 880 

Met Leu Ser Leu Asp Phe Leu Asp 
1 5 
Gin Leu Tyr Tyr Gin Val Leu Asn 
2 0 

Leu Met lie Trp Lys Gly Leu Met 
35 40 



Asp Val Arg Arg Met Asn Lys Arg 

10 15 

Phe Gly Met lie Val Ser Ser Ala 
25 30 

Val lie Thr Gly Ser Glu Ser Pro 
45 
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lie Val Val Val Ser Gly Lys Met Gly lie Ser Ser Phe 
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<400> 881 

Met lie Val Ser 

1 

Thr Gly Ser Glu 
20 

Pro Ala Phe His 
35 

Asp Pro lie Arg 
50 

Glu He Pro He 
65 

Pro Tyr He Gly 

Lys Tyr Ala Val 
100 

Glu 



Ser Ala Leu Met 
5 

Ser Pro He Val 

Arg Gly Asp Leu 
4 0 

Val Gly Glu He 
55 

Val His Arg Val 
70 

He Val Thr He 
85 

Leu Phe Leu Leu 



He Trp Lys Gly 
10 

Val Val Leu Ser 
25 

Leu Phe Leu Thr 

Val Val Phe Arg 
60 

Leu Lys He His 
75 

Leu Met Asn Asp 
90 

Gly Leu Phe Val 
105 



Leu Met Val He 
15 

Gly Ser Met Glu 
30 

Asn Arg Val Glu 
45 

He Glu Gly Arg 

Glu Lys Phe Val 
80 

Tyr Pro Lys Phe 
95 

Leu Val His Arg 
110 
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<223> Xaa = Cys , Gly , Arg, Ser 
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<223> Xaa = His, Gin 
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Gly 


Ser 


Ser 


Arg 


Asp 


Thr 


Ser 


1 
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Xaa 


Ser 


Ser 


Leu 


Asn 


Gly 


Phe 


Gly 
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Leu 


Gly 


Cys 


His 


Ser 


Ala 


Thr 


Asp 






35 










40 


Gly 


Phe 


Leu 


His 


Phe 


Cys 


Leu 


Tyr 




50 










55 




His 


Asp 


He 


Tyr 


Leu 


Ser 


Val 


Met 


65 










70 







Arg Ser Leu Gly Gly Cys Ser Pro 

10 15 
Phe Lys Asp Asp His Leu Arg Leu 
25 30 
His Leu Gin Leu Leu Ala Thr Leu 
45 

Val Leu Leu Leu Xaa Arg Leu Ser 
60 
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<400> 883 

Met Trp Tyr Ala Lys Lys Gin Gly Gin Lys Asn Lys Glu Ala Glu Arg 

1 5 10 15 

Gin Glu Arg Ala Val Val His Met Asp Glu Arg Arg Glu Glu Gin lie 

20 25 30 

Val Gin Leu Leu Asn Ser Val Gin Ala Lys Asn Asp Lys Glu Ser Glu 
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35 40 45 

Ala Gin lie Ser Thr Pro Asp Leu Ala Val Arg Ser Ala Pro Ser Gly 

50 55 60 

Gin Pro Phe Arg Ala Cys 
65 7 0 
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<213> Homo sapiens 



<400> 884 

Met Ser Gly Phe Glu Asn Leu Asn 
1 5 

Ser lie Asp Asp Gin Ser Gin Gin 
20 

Gly Pro Tyr Asn Ser Met Leu Ala 

35 40 
Leu Ser Leu Gin Thr 
50 



Thr Asp Phe Tyr Gin Thr Ser Tyr 

10 15 
Ser Tyr Asp Tyr Gly Gly Ser Gly 
25 ^ 30 
Met Thr lie Arg Ser Lys Ala Asp 
45 



<210> 885 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 885 

Met lie Phe Cys 

1 

His Thr Pro Gly 
20 

Arg Val Phe Thr 
35 

Leu Phe His Leu 
50 



Ser His Lys Leu 
5 

Val Cys Lys Tyr 

Ser Val Thr Phe 
40 

Thr Val He He 
55 



Ala Leu Leu Ser 
10 

Ser Leu Pro Leu 
25 

Cys Ala Tyr Cys 



He Phe Pro Glu 
15 

Leu His Thr His 
30 

Ser Ala Pro Cys 
45 



<210> 886 
<211> 103 
<212> PRT 
<213> Homo sapiens 

<400> 886 



Met 


Met 


Gly 


Ser 


Lys 


Met 


Ala 


Ser 


Ala 


Ser 


Arg 


Val 


Val 


Gin 


Val 


Val 


1 








5 










10 










15 




Lys 


Pro 


His 


Thr 


Pro 


Leu 


He 


Arg 


Phe 


Pro 


Asp 


Arg 


Arg 


Asp 


Asn 


Pro 








20 










25 










30 






Lys 


Pro 


Asn 


Val 


Ser 


Glu 


Ala 


Leu 


Arg 


Ser 


Ala 


Gly 


Leu 


Pro 


Ser 


His 




35 










40 










45 








Ser 


Ser 


Val 


He 


Ser 


Gin 


His 


Ser 


Lys 


Gly 


Ser 


Lys 


Ser 


Pro 


Asp 


Leu 




50 










55 










60 










Leu 


Met 


Tyr 


Gin 


Gly 


Pro 


Pro 


Asp 


Thr 


Ala 


Glu 


He 


He 


Lys 


Thr 


Leu 


65 










70 










75 










80 


Pro 


Gin 


Lys 


Tyr 


Arg 


Arg 


Lys 


Leu 


Val 


Ser 


Gin 


Glu 


Glu 


Met 


Glu 


Phe 










85 










90 










95 




He 


Gin 


Arg 


Gly 


Gly 


Pro 


Glu 





















100 



<210> 887 
<211> 124 
<212> PRT 

<213> Homo sapiens 



<400> 887 



WO 02/083898 



PCT/IB01/00914 



Met Ala Arg Val 
1 

Leu Leu lie Phe 
20 

Arg Phe Ala Tyr 
35 

Ser lie Asn Pro 
50 

Val Ser Gly Leu 
65 

Asn Thr Leu Ser 

Asp Ser Pro Gin 
100 

Tyr Leu Ala Ala 
115 



lie lie Asp Asn 
5 

Gly Ala Phe Val 

Tyr Lys Leu Leu 
4 0 

Gly Glu Thr Ala 
55 

Gly Phe Gly lie 
70 

Asp Ser Leu Gly 
85 

Phe Phe Leu Leu 

Cys lie Leu Gly 
120 



444 



Lys 


Asp 


Gly 


Pro 




10 






Ser 


Val 


Tyr 


He 


25 








Lys 


Lys 


Ala 


Ser 


Pro 


Ser 


Met 


Arg 








60 


Met 


Ser 


Gly 


Val 






75 




Pro 


Gly 


Thr 


Val 




90 






Phe 


Ser 


Phe 


His 


105 








His 


Cys 


He 


Phe 



Thr Gin Lys Tyr 
15 

Gin Glu Met Phe 
30 

Glu Gly Leu Lys 
45 

Leu Leu Ala Tyr 

Phe Ser Phe Val 
80 

Gly He His Gly 
95 

Asp Ala Gly His 
110 



<210> 888 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<400> 888 

Met Ala Ser Ser 

1 

Thr Pro Glu He 
20 

Pro Leu Pro Leu 
35 

Gly Thr Leu Cys 
50 



Thr Val Pro Val 
5 

Pro Asp Asn Val 

He Gly Met Thr 
40 

Cys Gly Ser Leu 

55 



Ser Ala Ala Gly 
10 

Gly Asp Trp Leu 
25 

Ser Gly Glu Leu 

Ala Gly Gin Glu 
60 



Ser Ala Asn Glu 
15 

Arg Gly Val Tyr 
30 

Asp Thr Lys Phe 
4 5 

Leu Glu 



<210> 889 
<211> 65 
<212> PRT 

<213> Homo sapiens 



<400> 889 



Met 


Leu 


Arg 


His 


Ser 


Pro 


Ser 


Leu 


Trp 


Glu 


Leu 


Val 


Glu 


Glu 


His 


Val 


1 








5 










10 










15 




Pro 


Leu 


Arg 


Glu 


Arg 


Arg 


Glu 


Val 


Lys 


Arg 


He 


Leu 


Gly 


Gly 


Gly 


Gly 








20 










25 










30 






Gly 


Gly 


Pro 


Glu 


Pro 


Gly 


Ala 


Ala 


Gly 


Gly 


Gly 


Gly 


Asp 


Val 


Thr 


Gly 






35 










40 










45 








Thr 


Ala 


Pro 


Arg 


Gly 


Ser 


He 


Leu 


Ser 


Ser 


Pro 


Gin 


Leu 


Pro 


Pro 


His 



50 55 60 

Leu 
65 



<210> 890 
<211> 140 
<212> PRT 

<213> Homo sapiens 



<400> 890 

Met Gly Gin Asn 

1 

Arg Arg Glu Gly 
20 

Met Asp Met Arg 
35 

Asn Phe Arg Ala 
50 



Arg Glu Met Leu 
5 

Trp Gin Arg Gly 

Gly He Phe Leu 
40 

Ala Pro He Arg 
55 



Pro Phe Trp Met 
10 

Trp His Gly Tyr 
25 

Ala Phe Gly Pro 

Ser Val Asp Val 
60 



Asn Ser Thr Gly 
15 

Asp Asn Glu Leu 
30 

Asp Phe Lys Ser 
4 5 

Tyr Asn Val Met 



WO 02/083898 

Cys Asn Val Val Gly lie 
65 70 
Arg Val Met Cys Met Leu 
85 

Gly Pro Ala Thr Val Pro 
100 

Thr Asp His lie Ala Cys 

115 

Pro Pro Lys Pro Asp Asp 

130 



445 

Thr Pro Leu Pro Asn Asn 
75 

Lys Gly Arg Ala Ala Leu 
90 

Trp His Asp Ser Ser Leu 
105 

Leu Arg Lys Lys His His 
120 

Phe His Phe Met Cys Glu 
135 140 



PCT/IB01/00914 



Gly 


Ser 


Trp 


Ser 








80 


Pro 


Arg 


Leu 


Ser 






95 




Pro 


Ala 


Cys 


He 




110 






Gin 


Gin 


Ser 


Gly 



125 



<210> 891 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<400> 891 

Met Glu Met Ala Ser Ser Ala Gly 
1 5 
Pro Leu Val Phe Leu Arg Leu Ser 
20 

Asp Ala Gly Lys Phe His Val Ala 

35 40 
Leu Cys Pro Leu Ser Leu Trp Pro 
50 55 



Ser Trp Leu Ser Gly Cys Leu He 

10 15 
Val His Val Ser Gly His Ala Gly 
25 30 
Leu Leu Gly Gly Thr Ala Glu Leu 
45 

Gly Thr Tyr Pro Arg Arg 
60 



<210> 892 
<211> 114 
<212> PRT 

<213> Homo sapiens 



<400> 892 



Met 


Asp 


He 


Val 


Ala 


His 


Cys 


Glu 


Lys 


Ala 


Met 


Ala 


Glu 


Ser 


Val 


Lys 


1 








5 










10 










15 




Lys 


His 


Gin 


Phe 
20 


Ser 


Thr 


Ser 


Glu 


Ser 
25 


Tyr 


Leu 


Pro 


Ala 


Thr 
30 


Leu 


He 


Leu 


Asn 


Ser 
35 


He 


Ser 


Phe 


Asn 


Thr 
40 


Asp 


Gly 


Ser 


He 


Gin 
45 


Tyr 


Lys 


Glu 


Lys 


Leu 
50 


Tyr 


Phe 


Ser 


Ala 


Ser 
55 


Glu 


Ala 


Leu 


Asp 


Ala 
60 


Tyr 


He 


Asp 


Asp 


Phe 


His 


Leu 


Asn 


Tyr 


Glu 


Pro 


Pro 


Asp 


He 


Asp 


Thr 


Lys 


Val 


Asn 


Leu 


65 










70 










75 










80 


Asp 


Gin 


Ser 


Pro 


Leu 
85 


Glu 


Phe 


Leu 


Ala 


Lys 
90 


Ser 


Asn 


Ser 


Gly 


Val 
95 


His 


Gly 


He 


Ala 


Pro 
100 


Gly 


Gin 


Ser 


Gly 


Met 
105 


Glu 


Lys 


He 


He 


Gin 
110 


Trp 


Tyr 



Ala Ala 



<210> 893 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Asn, Thr 
<220> 

<221> UNSURE 
<222> 39 

<223> Xaa = Leu, Val 
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446 

<220> 

<221> UNSURE 

<222> 34,37 

<223> Xaa = Lys,Asn 

<400> 893 



Met Arg Cys 


Arg 


Pro 


Ser 


Ser 


Val 


His 


Val 


Ala 


He 


Leu 


Thr 


Glu 


Ala 


1 




5 










1 U 










J. D 




lie Pro Pro 


Lys 


Met 


lie 


Ser 


Phe 


Ser 


Thr 


Xaa 


Ser 


Lys 


Arg 


He 


Phe 




20 










25 










30 






Lys Xaa Ser 


Pro 


Xaa 


Met 


Xaa 


Lys 


Ser 


Gin 


Val 


Cys 


Pro 


Leu 


He 


Gly 


35 










40 










45 








Glu lie Ser 


Gly 


Ser 


Trp 


Met 




















50 








55 




















<210> 894 




























<211> 123 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> UNSURE 


























<222> 26 




























<223> Xaa = 


His, 


Gin 
























<400> 894 




























Met Pro Lys 


Gly 


Gly Arg 


Lys 


Gly 


Gly 


His 


Lys 


Gly 


Arg 


Ala 


Arg 


Gin 


1 




5 










10 










15 




Tyr Thr Ser 


Pro 


Glu 


Glu 


He 


Asp 


Ala 


Xaa 


Cys 


Arg 


Leu 


Arg 


Ser 


Arg 




20 










25 










30 






Arg Pro Gly 


Lys 


Lys 


Arg 


Ser 


Lys 


Lys 


Gly 


Gly 


Asp 


Gly 


Ala 


Ala 


Gly 


35 










40 










45 








Asp Pro Lys 


Lys 


Glu 


Lys 


Lys 


Ser 


Leu 


Asp 


Ser 


Asp 


Glu 


Ser 


Glu 


Asp 


50 








55 










60 










Glu Glu Asp 


Asp 


Tyr 


Gin 


Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly 


Leu 


He 


65 






70 










/ o 










o u 


Asp lie Glu 


Asn 


Pro 


Asn 


Arg 


Val 


Ala 


Gin 


Thr 


Thr 


Lys 


Lys 


Val 


Thr 






85 










90 










95 




Gin Leu Asp 


Leu 


Asp 


Gly 


Pro 


Arg 


Ser 


Phe 


Arg 


Gly 


Glu 


Asn 


Glu 


Lys 




100 










105 










110 






Arg Leu Arg 


Ser 


Arg 


Arg 


bin 


Lys 


oer 


v ax 


i nr 












115 










120 


















<210> 895 




























<211> 123 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> UNSURE 


























<222> 26 




























<223> Xaa = 


His, 


Gin 
























<400> 895 




























Met Pro Lys 


Gly 


Gly 


Arg 


Lys 


Gly 


Gly 


His 


Lys 


Gly 


Arg 


Ala 


Arg 


Gin 


1 




5 










10 










15 




Tyr Thr Ser 


Pro 


Glu 


Glu 


He 


Asp 


Ala 


Xaa 


Cys 


Arg 


Leu 


Arg 


Ser 


Arg 




20 










25 










30 






Arg Pro Gly 


Lys 


Lys 


Arg 


Ser 


Lys 


Lys 


Gly 


Gly 


Asp 


Gly Ala 


Ala 


Gly 


35 










40 










45 








Asp Pro Lys 


Lys 


Glu 


Lys 


Lys 


Ser 


Leu 


Asp 


Ser 


Asp 


Glu 


Ser 


Glu 


Asp 


50 








55 










60 










Glu Glu Asp 


Asp 


Tyr 


Gin 


Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly 


Leu 


He 



WO 02/083898 

65 

Asp lie Glu Asn Pro 
85 

Gin Leu Asp Leu Asp 
100 

Arg Leu Arg Ser Arg 
115 

<210> 896 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 78 

<223> Xaa = Ser f Thr 



<400> 896 



Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 


Ala 


Asp Asp 


Val 


Lys 


Lys 


Val 


Phe 


1 








5 










10 








15 




His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


Phe He 


Glu 


Glu 


Asp 


Glu 


Leu 








20 










25 








30 






Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro Asp Ala 


Arg 


Asp 


Leu 


Ser 


Ala 






35 










40 








45 








Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly Asp 


Lys 


Asp 


Gly 


Asp 


Gly 




50 










55 








60 










Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr 


Leu Val 


Ala 


Glu 


Xaa 


Lys 


Lys 


65 










70 








75 










80 



His 

<210> 897 
<211> 143 
<212> PRT 

<213> Homo sapiens 
<400> 897 



Met 


Tyr 


Thr 


Thr 


Val 


Asp 


Ala 


Asn 


Gly 


Tyr 


Leu 


Lys 


Asn 


Gly 


Ser 


Ala 


1 








5 










10 










15 




Gly 


Gin 


Leu 


Ser 


Gin 


Ser 


Ser 


His 


Leu 


Ala 


Leu 


Gin 


Leu 


Pro 


Tyr 


Asn 






20 










25 










30 






Val 


Leu 


Gly 
35 


Leu 


Gly 


Arg 


Ser 


Ala 
40 


Asn 


Phe 


Leu 


Asp 


His 
45 


Leu 


Tyr 


Val 


Gly 


He 
50 


Pro 


Arg 


Pro 


Ser 


Gly 
55 


Glu 


Lys 


Ser 


He 


Arg 
60 


Lys 


Gin 


Glu 


Trp 


Thr 


Ala 


He 


He 


Pro 


Asn 


Ser 


Gin 


Leu 


He 


Val 


He 


Pro 


Tyr 


Pro 


His 


65 










70 










75 










80 


Asn 


Val 


Pro 


Arg 


Ser 
85 


Trp 


Ser 


Ala 


Lys 


Leu 
90 


Tyr 


Leu 


Thr 


Pro 


Ser 
95 


Asn 


He 


Val 


Leu 


Leu 
100 


Thr 


Ala 


He 


Ala 


Leu 
105 


He 


Gly 


Val 


Cys 


Val 
110 


Phe 


He 


Leu 


Ala 


He 
115 


He 


Gly 


He 


Leu 


His 
120 


Trp 


Gin 


Glu 


Lys 


Lys 
125 


Ala 


Asp 


Asp 


Arg 


Glu 
130 


Lys 


Arg 


Gin 


Glu 


Ala 
135 


His 


Arg 


Phe 


His 


Phe 
140 


Asp 


Ala 


Met 





<210> 898 
<211> 68 
<212> PRT 

<213> Homo sapiens 
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70 75 80 

Asn Arg Val Ala Gin Thr Thr Lys Lys Val Thr 

90 95 
Gly Pro Arg Ser Phe Arg Gly Glu Asn Glu Lys 

105 110 
Arg Gin Lys Ser Val Thr 
120 



<400> 898 

Met Met Asp Thr Met Asp Pro Phe Trp Phe Asn Ala Phe Lys Arg Thr 
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10 15 
s Leu Arg Met Ser Lys His Thr Asp Ala Ala 

25 30 
u Lys Lys Gly Cys Thr Gly Val lie Thr Leu 

40 45 
u Asn Ala Leu Thr Leu Asn Met lie Arg Gin 
55 60 

5 Tyr His Ser 
65 



1 








5 


Asn 


Thr 


He 


Leu 


His 








20 




Glu 


Glu 


Val 


Leu 


Leu 






35 






Asn 


Arg 


Pro 


Lys 


Phe 




50 








He 


Tyr 


His 


Ser 





<210> 899 
<211> 74 
<212> PRT 

<213> Homo sapiens 



<400> 899 

Met Ala Ser Ser Thr Val Pro Val 
1 5 

Thr Pro Glu lie Pro Asp Asn Val 
20 

Arg Phe Ala Thr Asp Arg Asn Asp 

35 40 
Leu Gly Leu Phe Ala Ala Gly Val 

50 55 
He Asp Leu Met Ala Pro Gin Pro 
65 * 7 0 



Ser Ala Ala Gly Ser Ala Asn Glu 

10 15 
Gly Asp Trp Leu Arg Gly Val Tyr 
25 30 
Phe Arg Arg Asn Leu He Leu Asn 
45 

Trp Leu Ala Arg Asn Leu Ser Asp 
60 

Gly Val 



<210> 900 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 19 

<223> Xaa = Arg, Ser 
<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Leu, Arg 
<220> 

<221> UNSURE 
<222> 74 

<223> Xaa = Lys, Asn 



<400> 900 












Met Gly 


Ala 


Thr 


Gly 


Pro 


Lys 


Leu 


1 






5 








Gly Arg 


Xaa 


Gly 


Gly 


Ser 


Glu 


Ala 






20 










Arg Pro 


Arg 


Gly 


Arg 


Ala 


Val 


Pro 




35 










40 


Ser Met 


Phe 


Tyr 


Asp 


Glu 


Asp 


Gly 


50 










55 




Glu Thr 


He 


Val 


Thr 


Lys 


Asn 


Gly 


65 








70 






Ser 















Gly Ala Cys Gly Leu Arg Leu Gly 

10 15 
Ala Gly Ala Glu Gin Ala Leu Val 
25 30 
Pro Phe Val Phe Thr Arg Arg Gly 
45 

Asp Leu Ala His Glu Phe Tyr Glu 
60 

Gin Xaa Xaa Ala Lys Leu Arg Ala 
75 80 



<210> 901 
<211> 68 
<212> PRT 
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<213> Homo sapiens 



449 
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<400> 901 

Met Trp Ala Phe 

1 

lie Val Ala Val 
20 

Val Val Leu Gin 
35 

Gin Pro Met Gin 
50 

Pro Pro Lys Lys 
65 



Pro Trp Glu Glu 
5 

Glu Ala Lys Trp 

Val lie Leu Glu 
40 

Lys Pro Val Tyr 
55 



Pro Leu Tyr Val 
10 

Thr Glu Ser Pro 
25 

Lys Leu Thr Tyr 

Val Gin Ser Ala 
60 



lie Thr Phe Tyr 
15 

Arg Gin His Glu 
30 

His Thr Thr His 
45 

Glu Cys Leu Gly 



<210> 
<211> 
<212> 
<213> 



902 
107 
PRT 
Homo 



sapiens 



<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = * , Trp 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = * , Tyr 
<220> 

<221> UNSURE 
<222> 23 

<223> Xaa - Arg, Ser 
<220> 

<221> UNSURE 
<222> 19 

<223> Xaa = Glu, Gly 
<220> 

<221> UNSURE 
<222> 74, 107 
<223> Xaa = Gly, Val 

<220> 

<221> UNSURE 
<222> 102,104 
<223> Xaa = His, Gin 

<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = Lys, Thr 



<400> 902 

Met Pro Gin Leu 

1 

Tyr Thr Xaa Thr 
20 

Pro Met Asp lie 
35 

Pro Gin Gly Asn 
50 



Lys Leu Thr Ser 
5 

Arg Thr Xaa Ser 

Gin Met Ala Asn 
40 

Asp Cys Asp Leu 
55 



Val Trp Thr Thr 
10 

Pro Gly Xaa Xaa 
25 

Asn Phe Thr Pro 

Tyr Ala His His 
60 



Thr His Ser Tyr 
15 

Pro Gly Pro Pro 
30 

Pro Ser Ala Thr 

45 

Ser Thr Ala Ser 
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lie Val Met Pro Leu His Tyr Ser Leu Xaa Phe lie lie Gly Leu Val 
65 70 75 80 

Gly Asn Leu Leu Ala Leu Val Val lie Val Gin Asn Arg Lys Lys lie 

85 90 95 

lie Ser Pro Xaa Ala Xaa Pro Xaa Pro lie Xaa 
100 105 



<210> 903 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<400> 903 

Met Leu Asp Ser lie Trp Lys Pro 
1 5 

Gly Ala Asn Phe His Glu Val Thr 
20 

Phe Lys Asn Gly Asn Val Leu Tyr 

35 40 
Ser Cys Pro Met Asp Leu Lys Asn 

50 55 
lie Met Gin Leu Glu Ser Leu Val 
65 70 



Asp Leu Phe Phe Ala Asn Glu Lys 

10 15 
Thr Asp Asn Lys Leu Leu Arg lie 
25 30 
Ser lie Arg Leu Thr Leu Thr Leu 
45 

Phe Pro Met Asp Val Gin Thr Cys 
60 



<210> 904 
<211> 57 
<212> PRT 

<213> Homo sapiens 



<400> 904 

Met Gly Lys Gin 

1 

Glu Thr Val Phe 

20 

Tyr Ser Phe Val 
35 

His lie Ser Leu 
50 



Arg Asn Leu Thr 
5 

Leu Asn His Leu 

Glu Pro Leu Leu 
40 

Leu Phe Gin Leu 
55 



Leu Phe Phe Ser 
10 

Leu Asn Ser Val 

25 

Ser Tyr Leu Arg 
Tyr 



Phe Pro Phe Cys 
15 

Phe Phe Pro Ser 
30 

Phe Pro Gin Thr 
45 



<210> 905 
<211> 102 
<212> PRT 

<213> Homo sapiens 



<400> 905 



Met 


Glu 


Arg 


Leu 


Thr 


Leu 


Pro 


Leu 


Gly 


Gly Ala 


Ala 


Ala 


Val 


Asp 


Glu 


1 








5 










10 










15 




Tyr 


Leu 


Glu 


Tyr 


Arg 


Arg 


He 


Val 


Gly 


Glu 


Asp 


Asp 


Gly 


Gly 


Lys 


Leu 








20 










25 










30 






Phe 


Thr 


Pro 


Glu 


Glu 


Tyr 


Glu 


Glu 


Tyr 


Lys 


Arg 


Lys 


Val 


Leu 


Pro 


Leu 






35 










40 










45 








Arg 


Leu 


Gin 


Asn 


Arg 


Leu 


Phe 


Val 


Ser 


Trp 


Arg 


Ser 


Pro 


Thr 


Gly 


Met 




50 










55 










60 










Asp 


Cys 


Lys 


Leu 


Val 


Gly 


Pro 


Glu 


Thr 


Leu 


Cys 


Phe 


Cys 


Thr 


His 


Arg 


65 










70 










75 










80 


Tyr 


Lys 


Gin 


His 


Lys 


Thr 


Asp 


Leu 


Glu 


Ala 


He 


Pro 


Gin 


Gin 


Cys 


Pro 










85 










90 










95 




lie 


Asp 


Pro 


Pro 


Ala 


Lys 























100 



<210> 906 
<211> 104 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 64 

<223> Xaa « His, Gin 
<400> 906 



Met 


Ser 


Thr 


Leu 


Leu 


He 


Ser 


Ser 


Ser 


Asp 


Ala 


Leu 


Leu 


Glu 


Glu 


Leu 


1 








5 










10 










15 




Glu 


Arg 


Ser 


Thr 
20 


Leu 


Gin 


Asp 


Ser 


Asp 
25 


Glu 


Tyr 


Ser 


Asn 
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Ser Ser Asn Ser 
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Gly Lys Ser Cys 
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Pro Pro Val Gly 
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Gin Glu Lys He 
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